Abstract
Objective
To use Rasch analysis to examine the measurement properties of the 23-item version of the Work Instability Scale (WIS-23) in a sample of worker compensation claimants with upper extremity disorders.
Design
Secondary data analysis on the data retrieved from a cross-sectional study.
Setting
Tertiary care hospital.
Participants
Patients (N=392) attending a specialty clinic for workers with upper limb injuries at a tertiary hospital were prospectively enrolled.
Interventions
Not applicable.
Main Outcome Measures
WIS-23.
Results
The study sample contained 392 participants between the ages of 19 and 73 years (mean, 47.0±10.5y). There were 148 (37.8%) women, 182 (46.4%) men, and 62 (15.8%) participants for whom sex identification was unavailable. The initial WIS data analysis showed significant misfit from the Rasch model (item-trait interaction: χ2=293.52; P<.0001). Item removal and splitting were performed to improve the model fit, resulting in a 20-item scale that met all assumptions (χ2=160.42; P=.008), including unidimensionality, local independence of items, and the absence of differential item function based on age, sex of respondents, employment type, and affected upper extremity area across all tested factors.
Conclusion
With the application of Rasch analysis, we refined the WIS-23 to produce a 20-item WIS for work-related upper extremity disorders (WIS-WREUD). The 20-item WIS-WREUD demonstrated excellent item and person fit, unidimensionality, acceptable person separation index, and local independency. The WIS-20 may provide better measurement properties, although longitudinal psychometric evaluations are needed.
Keywords: Arthritis, rheumatoid, Occupational health, Presenteeism, Rehabilitation, Work, Work performance
List of abbreviations: DIF, differential item functioning; ICC, item characteristic curve; LD, local dependency; OA, osteoarthritis; PCA, principal component analysis; PSI, person separation index; RA, rheumatoid arthritis; WD, work disability; WI, work instability; WIS-23, Work Instability Scale 23-item version; WIS-WRUED, Work Instability Scale for work-related upper extremity disorders
The assessment of health-related at-work limitations is essential for clinical researchers to evaluate the effect of occupational injuries.1, 2, 3 Previous research suggests that productivity loss at work contributes to a considerable amount of indirect economic costs.4 Upper extremity injuries are commonly considered as the major source of work disability.5,6 Some clients have complicated career trajectories that include career changes and job adjustments that may relate to a mismatch between person’s functional capabilities and the demands of the job. This is defined as work instability (WI).7, 8, 9 WI can be related to a variety of factors, including decreased capability related to aging or disease or increased capability due to treatment effects. Work instability can also occur when the demands of the job change.
The WI Scale (WIS-23) was originally developed as a WI classification system for individuals with rheumatoid arthritis (RA) to support timely and appropriate management, including vocational rehabilitation, psychosocial support, and clinical intervention for job retention.7,10 It consists of 23 dichotomous items (yes or no) describing specific experiences or situations that provide indications of WI. Examples of individual items include: “I have pain or stiffness all the time at work.” The total score is a sum between 0 and 23, with higher scores indicating greater WI. Three levels are used to classify the total score into low WI (<10), indicating low risk of work disability (WD); moderate WI (10-17), corresponding to medium level of WD; and high WI (>17), indicating those individuals having high risk of WD.7 Workplace modification would be recommended for individuals (4 of 5) with a moderate level of WI and is critical for those (19 of 20) who have a high level of WI.7
Previous studies have affirmed acceptable reliability and construct validity under classical test theory in populations with RA and osteoarthritis (OA).7,11 In addition, a qualitative interview captured key themes of WIS-23 covering job flexibility, good working relationships, and symptom control.11 Several formats of the WIS have been developed based on different diagnoses and populations, including upper extremity disorders,12 manual workers,13 brain injuries,14 and OA.10
A large study involving 2092 participants concluded that individuals with work-related injuries had a higher risk of absence from work, mobility-related functional problems, disability, and impaired functioning related to anxiety or depression.15 Workers who experience musculoskeletal pain across various regions will seek diverse resources of health care, in comparison to those with an explicit health condition.13 This suggests the need for a universal version of WIS for work-related upper extremity disorders (WIS-WRUED).
Rasch analysis, is an alternative strategy for the evaluation of structural validity. Rasch analysis enables examination of the assumption of unidimensionality and structure of rating scales to convert the ordinal scale of individual items into interval scaling.16,17 To compute a total score, the response options should demonstrate interval level scaling, also known as the scaling assumption.17 Where outcome measures are not developed using Rasch modeling, they can be retrospectively evaluated for fit to the Rasch model, which often results in modifications to the questionnaire to obtain fit. Previous studies evaluated the WIS-23 using Rasch analysis and found a significant deviation from the Rasch model fit.10,12
Therefore, the purpose of this study was to apply Rasch analysis to (1) examine to what extent the rating scale of WIS-23 fits to the Rasch model by inspecting the test fit statistic and ordering of item thresholds; (2) examine the differential item functioning based on age, sex of respondents, employment type, and affected upper extremity area and exploring solutions by altering the rating scale; and (3) test the construct validity of all subscales of the Work Limitations Questionnaire by examining the unidimensionality and local dependency.
Methods
The study dataset was prospectively collected from a specialty clinic for workers with upper limb injuries in a tertiary hospital in Canada. The package of patient-reported outcome measures including WIS-23 was sent to patients shortly before their initial clinic assessment and completed immediately prior to attending clinic or at the initial clinic visit. Research ethics board approval was obtained for the clinical database, and patients provided informed written consent to have data used in research.
Rasch analysis included tests of unidimensionality fit of residual, ordering of item thresholds, person separation index (PSI), differential item functioning (DIF), and local independence of items. The analysis was performed using RUMM 2030 professional suite software.a The significance level was set at 0.05, with Bonferroni correction applied when multiple comparisons were made. To facilitate stable analyses, sample sizes of 250 participants were required.18,19
Rasch analysis
Test of fit
The test of fit quantifies to what extent items from the outcome measure meet the expectations of the Rasch model. Fit statistics can be checked at both overall and individual item levels. For the overall fit, the P value from a chi-square test of item-trait interaction should be nonsignificant according to the critical value.17,20 The item–person interaction statistics were then transformed to approximate a z score following a standardized normal distribution. We expected a mean of approximately 0 and a standard deviation of 1 to characterize the normal distribution.17 At the individual level, a fit residual localized within ±2.5 logits represented an adequate fit for the model.17
Threshold
The threshold refers to the point between 2 response categories at which either response is equally probable. Disordered thresholds mean that the respondents fail to meaningfully discriminate between response options or that options are potentially confusing. Threshold maps and categorical probability curves were used for visual inspection of this phenomenon. Where needed, we attempted to resolve this problem by collapsing adjacent categories and reversing the order of response options.17
Targeting
Scale-to-sample targeting reflects the extent to which the items can measure the whole range of an individual’s ability level. The person item threshold distribution displays the relative difficulty (item locations) and relative ability (person location) on the same ruler of logits. The better the ranges match each other, the greater the potential for precise person measurement. Poor targeting often results in floor or ceiling effects, indicating that patients at the extremes cannot be differentiated from each other nor can change be measured in terms of lower (floor) or higher (ceiling) future scores.20,21 A scale is considered well targeted if the difference between person and item means would be less than 1 logit unit.10
DIF
DIF occurs when individual groups of patients within the study sample (men vs women), respond differently to an item given the equal level of characteristic being measured. For instance, men and women with equal level of work disability may respond systematically different to an item measuring completeness of job demands.22 Two types of DIF can be identified. Uniform DIF is where the group shows a consistent systematic difference in their responses to an item. The standard approach of splitting the item for individual groups can be used to address such issue. Nonuniform DIF results from random differences (eg, responses to individual item vary across levels of the ability for subgroups). Currently, there is no solution for nonuniform DIF, except item removal.23 DIF was examined on age, sex of respondents, employment type, and affected upper extremity area using both critical statistical values using a Bonferroni adjusted P value of .0007 (.05/23∗3) and visual inspection by item characteristic curve (ICC).17 The visual inspection was facilitated by plotting the item characteristic curve along with the person trait for given person factors. Under an ideal situation, there is no difference in ICCs for different groups, indicating that participants with identical level of ability have equal probabilities of affirming a given item.
Dimensionality
The basic assumption of the Rasch model, unidimensionality, was checked through the principal component analysis (PCA) under item response theory. After the rescoring of individual response options and any resultant item reduction, the PCA was be revisited as confirmation of the unidimensionality.24 We set the number of significant t tests at 5% of the total comparisons as the indicator of multidimensionality.
Local independence
Residual pattern refers to the standardized person-item differences between the observed data and expected response generated by the model for every person’s response to individual item. The Rasch model requires no residual pattern existing in the data, which is named as local independence. Such residual pattern was examined by PCA. A residual correlation between any 2 items greater than 0.2 above the average correlation would appear to indicate the violation of local independence, as local dependency (LD).25,26 Potential reasons for the appearance of LD are response dependency and multidimensionality.27 Item deletion and the creation of testlets to bundle the dependent items was used as a potential solution to address the LD.25,28
Reliability
The PSI indicates the precision of the estimate for each person and was used to evaluate the internal consistency under the Rasch model. The acceptable value was set as 0.7 to establish that the scale is reliable to distinguish between at least 2 groups.29, 30, 31 In addition to the PSI, the traditional Cronbach’s alpha was provided by the RUMM 2030 program; 0.8 was adopted as the satisfactory value.32
Results
Study participants
The study sample contained 392 participants with full responses. The age of participants ranged from 19 to 73 years old (mean ± SD, 47.0±10.5y). There were 148 (37.8%) women, 182 (46.4%) men, and 62 (15.8%) participants missing sex information. Within the study sample, 56.4% of the total subjects engaged in labor work, 16.8% engaged in office work, and 7.9% engaged in a job that included both labor and office work. Due to the nature of Rasch analysis, continuous descriptive variables were transferred to categorical data. The continuous age variable was then recoded into 2 groups according to the median value. Specifically, code 1 represented the group between 19 and 49 years old, and code 2 represented those between 50 and 73 years old. A full summary of participant demographic and clinical characteristics is listed in table 1.
Table 1.
Personal Factor | Classification | Frequency | Percentage |
---|---|---|---|
Sex | Women | 148 | 37.8 |
Men | 182 | 46.4 | |
Missing | 62 | 15.8 | |
Affected side | Left | 119 | 30.4 |
Right | 177 | 45.2 | |
Both | 76 | 19.4 | |
Missing | 20 | 5.0 | |
Job | Labor | 221 | 56.4 |
Office | 66 | 16.8 | |
Mixed | 31 | 7.9 | |
Missing | 74 | 18.9 | |
Injury region | Wrist and hand | 99 | 25.3 |
Elbow and forearm | 57 | 14.5 | |
Shoulder and arm | 171 | 43.6 | |
Upper extremity (CRPS) | 12 | 3.1 | |
Other | 6 | 1.5 | |
Missing | 47 | 12.0 | |
Age, y | Mean ± SD | Median | Range |
47.0±10.5 | 49 | 19-73 | |
Abbreviation: CRPS, complex regional pain syndrome. |
Test of fit
The rating scale model was selected for the current analysis due to the unified dichotomous response options over all 23 times.27 The initial evaluation of the overall questionnaire with 23 items demonstrated poor overall fit to the Rasch model in the substantial deviation in the standardized item fit residual statistic (mean ± SD, –0.27±1.9). The significant chi-square test (χ2=293.52; df=138; P<.001) for item-trait interaction revealed misfit from Rasch model. Table 2 lists the overall summary of fit statistics. To locate problematic items that may cause misfit issues, we checked the individual item fit statistics. Items 1 and 23 were misfitting as the fit residual values were greater than 2.5. Individual item descriptions with Rasch solutions are shown in table 3.
Table 2.
Analysis | Sample Size |
Item Fit Residual |
Person Fit Residual |
Chi-Square Interaction |
PSI | Alpha | Unidimensionality T Tests |
|||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
With ext | Without ext | Mean ± SD | Mean ± SD | Value | df | P Value | No. of significant tests | N | % | |||
Original 23-item | 392 | 384 | –0.27 ± 1.9 | –0.23 ± 0.78 | 293.52 | 138 | <.0001 | 0.85 | 0.87 | 19 | 384 | 4.95 |
Revised 20-item | 392 | 381 | –0.38 ± 1.25 | –0.25 ± 0.73 | 160.42 | 120 | .008 | 0.85 | 0.87 | 12 | 381 | 3.15 |
Abbreviation: ext: extreme data. NOTE. The original work instability index contains 23 items. To achieve the Rasch model fit, 4 items, including items 1 and 23, were removed. Item 19 was deleted due to the issue of nonuniform DIF in the job classification. Item 11 was separated for men and women, and items 18 and 22 were separated for different affected regions due to the presence of uniform DIF. No disordered thresholds were found in the original or revised versions. After Bonferroni correction, the significant P value was .0027.
Table 3.
Item Description | Solution | Rationale | |
---|---|---|---|
1 | I can get my job done, I’m just a lot slower | Remove | Fit residual 5.92 |
2 | If I don't reduce my hours, I may have to give up work | ||
3 | I am very worried about my ability to keep working | ||
4 | I have pain or stiffness all the time at work | ||
5 | I don't have the stamina to work like I used to | ||
6 | I have used my holiday so that I don't have to go on sick leave | ||
7 | I push myself to go to work because I don't want to give in to my shoulder/elbow problem | ||
8 | Sometimes I can't face being at work all day | ||
9 | I have to say no to certain things at work | ||
10 | I've got to watch how much I do certain things at work | ||
11 | I have great difficulty opening some of the doors at work | Item split | Uniform DIF identified in sex |
12 | I have to allow myself extra time to do some jobs | ||
13 | It's very frustrating because I can't always do things at work | ||
14 | I feel I may have to give up work | ||
15 | I get on with work but afterwards I have a lot of pain | ||
16 | When I’m feeling tired all the time, work's a grind | ||
17 | I'd like another job, but I am restricted as to what I can do | ||
18 | I'm getting up earlier because of my shoulder/elbow problem | Item split | Uniform DIF identified in injury region |
19 | I get very stiff at work | Remove | Nonuniform DIF identified in job |
20 | I'm finding my job is about all I can manage | ||
21 | The stress of my job makes my shoulder/elbow problem flare | ||
22 | I'm finding any pressure on my hands is a problem | Item split | Uniform DIF identified in injury region |
23 | I get good days and bad days at work | Remove | Fit residual 2.86 |
Threshold
The inspection of the threshold map of WIS-23 suggested that no disordered issues were present, as expected given the dichotomous response option. Figure 1 shows the threshold map of the original WIS.
Targeting
Figure 2 shows the targeting of the WIS-23. The mean of person logits is equal to 0.24 (<1 logit), indicating that the scale item difficulty matches the abilities of the sample. In addition, the floor and ceiling effects were not detected because only 6% (<10 %) of the participants were on either end of the spectrum.
DIF
The personal factors considered as potential sources of DIF (bias) were sex, age (19-49y and 50-73y), affected sides (left, right, bilateral), injured body area (wrist and hand, elbow and forearm, shoulder and arm, entire upper extremity), and 3 job categories (labor, office, and mixed). Uniform DIF due to sex bias was detected on item 11, as the curve representing expected value of item response for women was higher than the curve for men across most person locations (person ability). Figure 3 shows the uniform-DIF of item 11. Similar uniform DIFs were identified on items 18 and 22 for injured body area due to selection bias. Item 19 was removed due to the nonuniform DIF identified in different job categories. Table 3 lists the individual item description with Rasch solutions.
Dimensionality
The WIS-23 met the assumption of unidimensionality as 4.95% (<5%) of the independent t tests were found to be significant at the 5% level (see table 2 for the overall summary of fit statistics).
Local independence
None of the item-to-item correlations exceeded the cutoff value, and the assumption of local independence was met in the WIS-23 (Supplemental Table S1, available online only at http://www.archives-pmr.org/).
Reliability
The PSI value was equal to 0.85 for the WIS-23. The Cronbach’s alpha was calculated as 0.87 without missing data points.
Revisiting the Rasch statistics
After removal of 3 items with unresolvable issues, including item misfit (items 1 and 23) and nonuniform DIF (item 19), the revised WIS questionnaire contained 20 items. The reanalyzed WIS-20 showed acceptable level of overall fit to the Rasch model (mean ± SD, –0.38±1.25) with a nonsignificant chi-square test (χ2=160.42; df=120; P=.008) for item-trait interaction (see table 2). No disordered thresholds were revealed during the reanalysis (fig 4) A mean value of 0.14 logits (<1 logit) revealed good targeting of the revised WIS. Flooring and ceiling effects were absent in the 20-item version (fig 5). The revised version was in accordance with the assumptions of dimensionality and local independency. Both PSI and reliability statistics remained the same after deleting 3 items (see table 2). Therefore, we conducted a logit transformation of the 20-item WIS summed scores to provide interval-level scaling. The conversions are presented in table 4.
Table 4.
Summed Score | Logit | Interval |
---|---|---|
0 | –4.493 | 0.0 |
1 | –3.47 | 2.3 |
2 | –2.736 | 4.0 |
3 | –2.21 | 5.2 |
4 | –1.785 | 6.2 |
5 | –1.42 | 7.0 |
6 | –1.091 | 7.7 |
7 | –0.788 | 8.4 |
8 | –0.503 | 9.1 |
9 | –0.23 | 9.7 |
10 | 0.035 | 10.3 |
11 | 0.299 | 10.9 |
12 | 0.564 | 11.5 |
13 | 0.838 | 12.1 |
14 | 1.126 | 12.8 |
15 | 1.438 | 13.5 |
16 | 1.787 | 14.3 |
17 | 2.192 | 15.2 |
18 | 2.691 | 16.3 |
19 | 3.372 | 17.9 |
20 | 4.303 | 20.0 |
Discussion
Our Rasch analysis of WIS-23 supports a 20-item version and provides a transformation of interval level scaling in injured workers with upper extremity conditions. This complements the findings from a previous study by Tang et al,12 in which they recommended a 17-item WIS specifically for the upper extremity. They excluded items 1 and 23 among others, which were also recommended to be removed by the current Rasch analysis.
Our initial analysis indicated that there were no disordered thresholds. This was an expected finding as dichotomous response options are not susceptible to disordered thresholds, as reported in a previous Rasch analysis study.12
PSI values of >0.70 and >0.85 are considered acceptable for group and individual use, respectively. The PSI value for the 20-item WIS was 0.85, indicating that the WIS can discriminate at both levels and show similar discrimination as the WIS-23 (PSI=0.83).12
We had to delete 2 misfitting items, including items 1 (“I can get my job done, I’m just a lot slower”) and 23 (“I get good and bad days at work”), as they had fit residual greater than ±2.5. These items also misfit in a previous Rasch analysis,12 which increases our confidence that there may be problems with these items that warrant their removal. Item 1 is a double-barreled question, as an answer of “no” could mean that the respondent cannot do their job tasks (eg, modified work) or that they can and are not slower (eg, can do normally but have pain). Double-barreled questions are insufficient in terms of content validity. Thus, our Rasch findings that question this item reinforce the need for careful content validity before items are included in a measure and provide justification for removal of the item. Item 23 was designed to indicate the fluctuation of symptoms during day-to-day work but not necessarily for respondents in the recovery phase.12 These findings are similar to previous Rasch analyses. Because a single piece of evidence is too preliminary to justify large-scale adoption of a revised measure, this level of conceptual and independent verification is needed. We suggest that these combined concerns justify the removal of these 2 items.
Prior studies have suggested removal of items 8, 13, 20, and 22 to improve the Rasch model fit. However, in the current study, we found no significant deviation from the model fit and retained these items. Because work instability is a complex phenomenon and the questionnaire is already quickly answered due to its yes/no response options, we believe that an overly shortened item may not be sufficiently inclusive of the problems individuals experience. Therefore, where items do function well, we would prefer to retain them. Some differences between Rasch analyses in different studies can be expected due to differences in the health conditions, occupations, and demographics of different study samples. For example, item 22 (“I'm finding any pressure on my hands is a problem”) performed well in our sample in which 25.1% of the injured workers had claims related to the hand and wrist, but may not have been as relevant to patients in a previous study that only included patients with shoulder injuries.12 As we see different Rasch analyses being performed on the same instrument, we have come to appreciate that variances in results are to be expected. Therefore, researchers wanting to optimize measurement in their sample should choose a Rasch solution that is most similar to their population. Permanent changes to measures should be considered where there are consistent findings that items do not perform well.
The third item eliminated from the 23-item version to create our 20-item WIS was item 19 (“I get very stiff at work”) since it exhibited nonuniform DIF based on job classification. Usually, nonuniform DIF occurs when the ICC indicates that the participants from different groups with identical ability levels have different probabilities of correctly responding to an item. In the current case, we classified jobs as labor, office work, and mixed. The onset of stiffness could vary between individuals depending on what type of activities they perform. For example, an office worker who works in front of a computer can develop spinal stiffness due to sustained postures, whereas a person with a manual job might have knee and hand stiffness that arises from overuse of muscles and joints. These could be quite different in nature and severity. Because the question is not very specific in terms of the source of stiffness, different jobs could have a different type of response or bias. However, unless this is a stable finding across studies, we are less certain of the recommendation for permanent removal.
Our study identified uniform DIF for item 11 (“I have great difficulty opening some of the doors at work”) for men and women. This did not result in removal of the item because we recognize that differences in response patterns could be due to the biological variations in strength, muscle fibers, etc33 because the percentage of strength required for men and women to open a standard door would be different. Uniform DIF for areas of injury was also observed in items 18 (“I'm getting up earlier because of my shoulder/elbow problem”) and 22 (“I'm finding any pressure on my hands is a problem”) in the current study. A similar trend was observed in a previous study by Tang et al,12 including uniform DIF for items 18 and 22 between rheumatoid arthritis patients and OA patients. This indicates that patients with different areas of injury in the upper extremity will respond differently to these 2 questions (items 18 and 22). Hence, these 2 items should be separated for different subgroups of patients with various areas of injury.
The transformation of the summed score to an interval-level score in our revised version enables the analysis of within- and between-subject differences such as analysis of variance since interval level scaling is an assumption for mathematical manipulations. Ordinal measures performed by many self-report instruments cannot guarantee that the intervals between different response options are equivalent and, therefore, it would be inappropriate to perform mathematical operations on these scores. We recognize that this is routinely done. However, it violates the assumptions of statistical tests and is a potential source of error.
Study limitations
A limitation of Rasch analysis is that it can be used to identify problematic items but cannot identify why an item does not perform well. We know that many currently used outcome measures do not have publications that clearly articulate the steps taken to ensure content validity of the items. Issues such as lack of clarity of items, double-barreled questions, and lack of relevance are surprisingly common in currently used outcome measures. Therefore, it is possible that Rasch analysis will indicate the need to delete an item that could be rehabilitated. Some differences exist across different Rasch solutions and only those items that consistently demonstrate measurement problems can be confidently removed34 to avoid multiple conflicting versions of outcome measures. However, because shortened measures that remove nonfunctioning items can lessen burden and improve measurement, they should be adopted where sufficient evidence exists.
Implications
Further research should include cognitive interviewing and qualitative methods to further examine the content validity and clarity of the WIS items to complement statistical approaches and comparison of different Rasch solutions in the same sample to further examine proposed alternate forms, including complementary statistical analyses such as factor analyses and classic psychometric properties (eg, responsiveness). A comparative approach across samples and versions is needed to provide robust evidence on the “best” version of the WIS. At present, researchers should use the version of the WIS that provides the best measurement properties for their study sample.
Conclusions
In conclusion, we found through Rasch analysis that 3 problematic items of the WIS-23 could be removed to perform a well-functioning interval level scaled WIS-20 in injured workers with upper extremity musculoskeletal conditions. The 20-item WIS-WREUD demonstrated excellent item and person fit, unidimensionality, acceptable person separation index, and local independency.
Supplier
-
a.
RUMM 2030; RUMM Laboratory Pty Ltd.
Footnotes
Dr MacDermid was supported by a CIHR Chair in Gender, Work and Health and the Dr James Roth Research Chair in Musculoskeletal Measurement and Knowledge Translation.
Disclosures: none.
Supplementary Data
References
- 1.Lerner D., Amick B.C., 3rd, Rogers W.H., Malspeis S., Bungay K., Cynn D. The Work Limitations Questionnaire. Med Care. 2001;39:72–85. doi: 10.1097/00005650-200101000-00009. [DOI] [PubMed] [Google Scholar]
- 2.Lerner D., Reed J.I., Massarotti E. The Work Limitations Questionnaire’s validity and reliability among patients with osteoarthritis. J Clin Epidemiol. 2002;55:197–208. doi: 10.1016/s0895-4356(01)00424-3. [DOI] [PubMed] [Google Scholar]
- 3.Escorpizo R., Cieza A., Beaton D., Boonen A. Content comparison of worker productivity questionnaires in arthritis and musculoskeletal conditions using the international classification of functioning, disability, and health framework. J Occup Rehabil. 2009;19:382–397. doi: 10.1007/s10926-009-9193-0. [DOI] [PubMed] [Google Scholar]
- 4.Koopman C., Pelletier K.R., Murray J.F. Stanford Presenteeism Scale: health status and employee productivity. J Occup Environ Med. 2002;44:14–20. doi: 10.1097/00043764-200201000-00004. [DOI] [PubMed] [Google Scholar]
- 5.Baldwin M.L., Butler R.J. Upper extremity disorders in the workplace: costs and outcomes beyond the first return to work. J Occup Rehabil. 2006;16:303–323. doi: 10.1007/s10926-006-9043-2. [DOI] [PubMed] [Google Scholar]
- 6.Workplace Safety and Insurance Board Workplace Safety and Insurance Board 2018 annual report. https://www.wsib.ca/en/2018-annual-report-highlights Available at: Accessed January 27, 2021.
- 7.Gilworth G., Chamberlain M.A., Harvey A. Development of a work instability scale for rheumatoid arthritis. Arthritis Rheum. 2003;49:349–354. doi: 10.1002/art.11114. [DOI] [PubMed] [Google Scholar]
- 8.Van Oostrom S.H., Driessen M.T., De Vet H.C.W. Workplace interventions for preventing work disability. Cochrane Database Syst Rev. 2009;2:CD006955. doi: 10.1002/14651858.CD006955.pub2. [DOI] [PubMed] [Google Scholar]
- 9.Dorman P. The economics of safety, health, and well-being at work: an overview. https://www.ilo.org/wcmsp5/groups/public/---ed_protect/---protrav/---safework/documents/publication/wcms_110382.pdf Available at: Accessed January 27, 2021.
- 10.Tang K., Beaton D.E., Lacaille D. The Work Instability Scale for Rheumatoid Arthritis (RA-WIS): does it work in osteoarthritis? Qual Life Res. 2010;19:1057–1068. doi: 10.1007/s11136-010-9656-y. [DOI] [PubMed] [Google Scholar]
- 11.Tang K., Beaton D.E., Boonen A., Gignac M.A.M., Bombardier C. Measures of work disability and productivity: Rheumatoid Arthritis Specific Work Productivity Survey (WPS-RA), Workplace Activity Limitations Scale (WALS), Work Instability Scale for Rheumatoid Arthritis (RA-WIS), Work Limitations Questionnaire (WLQ), and Work Productivity and Activity Impairment Questionnaire (WPAI) Arthritis Care Res. 2011;63:S337–S349. doi: 10.1002/acr.20633. [DOI] [PubMed] [Google Scholar]
- 12.Tang K., Beaton D.E., Gignac M.A.M., Bombardier C. Rasch analysis informed modifications to the Work Instability Scale for Rheumatoid Arthritis for use in work-related upper limb disorders. J Clin Epidemiol. 2011;64:1242–1251. doi: 10.1016/j.jclinepi.2011.02.002. [DOI] [PubMed] [Google Scholar]
- 13.Gilworth G., Smyth M.G., Smith J., Tennant A. The manual work instability scale: development and validation. Occup Med (Lond) 2016;66:300–304. doi: 10.1093/occmed/kqv217. [DOI] [PubMed] [Google Scholar]
- 14.Gilworth G., Carey A., Eyres S. Screening for job loss: development of a work instability scale for traumatic brain injury. Brain Inj. 2006;20:835–843. doi: 10.1080/02699050600832221. [DOI] [PubMed] [Google Scholar]
- 15.Lilley R., Davie G., Langley J., Ameratunga S., Derrett S. Do outcomes differ between work and non-work-related injury in a universal injury compensation system? Findings from the New Zealand Prospective Outcomes of Injury Study. BMC Public Health. 2013;13:1–9. doi: 10.1186/1471-2458-13-995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cano S.J., Klassen A.F., Scott A.M., Cordeiro P.G., Pusic A.L. The BREAST-Q: further validation in independent clinical samples. Plast Reconstr Surg. 2012;129:293–302. doi: 10.1097/PRS.0b013e31823aec6b. [DOI] [PubMed] [Google Scholar]
- 17.Pallant J.F., Tennant A. An introduction to the Rasch measurement model: an example using the Hospital Anxiety and Depression Scale (HADS) Br J Clin Psychol. 2007;46:1–18. doi: 10.1348/014466506x96931. [DOI] [PubMed] [Google Scholar]
- 18.Chen W.H., Lenderking W., Jin Y., Wyrwich K.W., Gelhorn H., Revicki D.A. Is Rasch model analysis applicable in small sample size pilot studies for assessing item characteristics? An example using PROMIS pain behavior item bank data. Qual Life Res. 2014;23:485–493. doi: 10.1007/s11136-013-0487-5. [DOI] [PubMed] [Google Scholar]
- 19.Smith A.B., Rush R., Fallowfield L.J., Velikova G., Sharpe M. Rasch fit statistics and sample size considerations for polytomous data. BMC Med Res Methodol. 2008;8:33. doi: 10.1186/1471-2288-8-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Jerosch-Herold C, Chester R, Shepstone L, Vincent JI, MacDermid JC. An evaluation of the structural validity of the shoulder pain and disability index (SPADI) using the Rasch model. Qual Life Res 2018;27:389-400. [DOI] [PubMed]
- 21.McHorney C.A., Tarlov A.R. Individual-patient monitoring in clinical practice: are available health status surveys adequate? Qual Life Res. 1995;4:293–307. doi: 10.1007/BF01593882. [DOI] [PubMed] [Google Scholar]
- 22.Tang K., Beaton D.E., Amick B.C., 3rd, Hogg-Johnson H., Côté P., Loisel P. Confirmatory factor analysis of the Work Limitations Questionnaire (WLQ-25 ) in workers’ compensation claimants with chronic upper-limb disorders. 2013;23:228–238. doi: 10.1007/s10926-012-9397-6. [DOI] [PubMed] [Google Scholar]
- 23.Kersten P., White P.J., Tennant A. Is the pain visual analogue scale linear and responsive to change? An exploration using rasch analysis. PLoS One. 2014;9 doi: 10.1371/journal.pone.0099485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Packham T., Macdermid J.C. Measurement properties of the patient-rated wrist and hand evaluation: Rasch analysis of responses from a traumatic hand injury population. J Hand Ther. 2013;26:216–224. doi: 10.1016/j.jht.2012.12.006. [DOI] [PubMed] [Google Scholar]
- 25.Christensen K.B., Makransky G., Horton M. Critical values for Yen’s Q3: identification of local dependence in the Rasch model using residual correlations. Appl Psychol Meas. 2017;41:178–194. doi: 10.1177/0146621616677520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Walton D.M., MacDermid J.C. A brief 5-item version of the Neck Disability Index shows good psychometric properties. Health Qual Life Outcomes. 2013;11:108. doi: 10.1186/1477-7525-11-108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Nilsson Å.L., Tennant A. Past and present issues in Rasch analysis: the Functional Independence Measure (FIMTM) revisited. J Rehabil Med. 2011;43:884–891. doi: 10.2340/16501977-0871. [DOI] [PubMed] [Google Scholar]
- 28.Gothwal V.K., Wright T., Lamoureux E.L., Pesudovs K. Psychometric properties of visual functioning index using Rasch analysis. Acta Ophthalmol. 2010;88:797–803. doi: 10.1111/j.1755-3768.2009.01562.x. [DOI] [PubMed] [Google Scholar]
- 29.Koopmans L., Bernaards C.M., Hildebrandt V.H., van Buuren S., van der Beek A.J., de Vet H.C.W. Improving the Individual Work Performance Questionnaire using Rasch analysis. J Appl Meas. 2014;15:160–175. [PubMed] [Google Scholar]
- 30.Lamoureux E.L., Pallant J.F., Pesudovs K., Hassell J.B., Keeffe J.E. The impact of vision impairment questionnaire: an evaluation of its measurement properties using Rasch analysis. Investig Ophthalmol Vis Sci. 2006;47:4732–4741. doi: 10.1167/iovs.06-0220. [DOI] [PubMed] [Google Scholar]
- 31.Tavakol M., Dennick R. Making sense of Cronbach’s alpha. Int J Med Educ. 2011;2:53–55. doi: 10.5116/ijme.4dfb.8dfd. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lambert S.D., Pallant J.F., Boyes A.W., King M.T., Britton B., Girgis A. A Rasch analysis of the Hospital Anxiety and Depression Scale (HADS) among cancer survivors. Psychol Assess. 2013;25:379–390. doi: 10.1037/a0031154. [DOI] [PubMed] [Google Scholar]
- 33.Miller E.A., MacDougall J.D., Tarnopolsky M.A., Sale D.G. Gender differences in strength and muscle fiber characteristics. Eur J Appl Physiol Occup Physiol. 1993;66:254–262. doi: 10.1007/BF00235103. [DOI] [PubMed] [Google Scholar]
- 34.Lu Z., MacDermid J.C., Nazari G. Agreement between original and Rasch-approved Neck Disability Index. BMC Med Res Methodol. 2020;20:180. doi: 10.1186/s12874-020-01069-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.