Evaluation of the Structural Validity of the Work Instability Scale Using the Rasch Model

Ze Lu; Joshua I Vincent; Joy C MacDermid

doi:10.1016/j.arrct.2021.100103

. 2021 Jan 13;3(1):100103. doi: 10.1016/j.arrct.2021.100103

Evaluation of the Structural Validity of the Work Instability Scale Using the Rasch Model

Ze Lu ^a,^b,^∗, Joshua I Vincent ^b,^c, Joy C MacDermid ^a,^b,^c

PMCID: PMC7984990 PMID: 33778476

Abstract

Objective

To use Rasch analysis to examine the measurement properties of the 23-item version of the Work Instability Scale (WIS-23) in a sample of worker compensation claimants with upper extremity disorders.

Design

Secondary data analysis on the data retrieved from a cross-sectional study.

Setting

Tertiary care hospital.

Participants

Patients (N=392) attending a specialty clinic for workers with upper limb injuries at a tertiary hospital were prospectively enrolled.

Interventions

Not applicable.

Main Outcome Measures

WIS-23.

Results

The study sample contained 392 participants between the ages of 19 and 73 years (mean, 47.0±10.5y). There were 148 (37.8%) women, 182 (46.4%) men, and 62 (15.8%) participants for whom sex identification was unavailable. The initial WIS data analysis showed significant misfit from the Rasch model (item-trait interaction: χ²=293.52; P<.0001). Item removal and splitting were performed to improve the model fit, resulting in a 20-item scale that met all assumptions (χ²=160.42; P=.008), including unidimensionality, local independence of items, and the absence of differential item function based on age, sex of respondents, employment type, and affected upper extremity area across all tested factors.

Conclusion

With the application of Rasch analysis, we refined the WIS-23 to produce a 20-item WIS for work-related upper extremity disorders (WIS-WREUD). The 20-item WIS-WREUD demonstrated excellent item and person fit, unidimensionality, acceptable person separation index, and local independency. The WIS-20 may provide better measurement properties, although longitudinal psychometric evaluations are needed.

Keywords: Arthritis, rheumatoid, Occupational health, Presenteeism, Rehabilitation, Work, Work performance

List of abbreviations: DIF, differential item functioning; ICC, item characteristic curve; LD, local dependency; OA, osteoarthritis; PCA, principal component analysis; PSI, person separation index; RA, rheumatoid arthritis; WD, work disability; WI, work instability; WIS-23, Work Instability Scale 23-item version; WIS-WRUED, Work Instability Scale for work-related upper extremity disorders

The assessment of health-related at-work limitations is essential for clinical researchers to evaluate the effect of occupational injuries.1, 2, 3 Previous research suggests that productivity loss at work contributes to a considerable amount of indirect economic costs.⁴ Upper extremity injuries are commonly considered as the major source of work disability.⁵^,⁶ Some clients have complicated career trajectories that include career changes and job adjustments that may relate to a mismatch between person’s functional capabilities and the demands of the job. This is defined as work instability (WI).7, 8, 9 WI can be related to a variety of factors, including decreased capability related to aging or disease or increased capability due to treatment effects. Work instability can also occur when the demands of the job change.

The WI Scale (WIS-23) was originally developed as a WI classification system for individuals with rheumatoid arthritis (RA) to support timely and appropriate management, including vocational rehabilitation, psychosocial support, and clinical intervention for job retention.⁷^,¹⁰ It consists of 23 dichotomous items (yes or no) describing specific experiences or situations that provide indications of WI. Examples of individual items include: “I have pain or stiffness all the time at work.” The total score is a sum between 0 and 23, with higher scores indicating greater WI. Three levels are used to classify the total score into low WI (<10), indicating low risk of work disability (WD); moderate WI (10-17), corresponding to medium level of WD; and high WI (>17), indicating those individuals having high risk of WD.⁷ Workplace modification would be recommended for individuals (4 of 5) with a moderate level of WI and is critical for those (19 of 20) who have a high level of WI.⁷

Previous studies have affirmed acceptable reliability and construct validity under classical test theory in populations with RA and osteoarthritis (OA).⁷^,¹¹ In addition, a qualitative interview captured key themes of WIS-23 covering job flexibility, good working relationships, and symptom control.¹¹ Several formats of the WIS have been developed based on different diagnoses and populations, including upper extremity disorders,¹² manual workers,¹³ brain injuries,¹⁴ and OA.¹⁰

A large study involving 2092 participants concluded that individuals with work-related injuries had a higher risk of absence from work, mobility-related functional problems, disability, and impaired functioning related to anxiety or depression.¹⁵ Workers who experience musculoskeletal pain across various regions will seek diverse resources of health care, in comparison to those with an explicit health condition.¹³ This suggests the need for a universal version of WIS for work-related upper extremity disorders (WIS-WRUED).

Rasch analysis, is an alternative strategy for the evaluation of structural validity. Rasch analysis enables examination of the assumption of unidimensionality and structure of rating scales to convert the ordinal scale of individual items into interval scaling.¹⁶^,¹⁷ To compute a total score, the response options should demonstrate interval level scaling, also known as the scaling assumption.¹⁷ Where outcome measures are not developed using Rasch modeling, they can be retrospectively evaluated for fit to the Rasch model, which often results in modifications to the questionnaire to obtain fit. Previous studies evaluated the WIS-23 using Rasch analysis and found a significant deviation from the Rasch model fit.¹⁰^,¹²

Therefore, the purpose of this study was to apply Rasch analysis to (1) examine to what extent the rating scale of WIS-23 fits to the Rasch model by inspecting the test fit statistic and ordering of item thresholds; (2) examine the differential item functioning based on age, sex of respondents, employment type, and affected upper extremity area and exploring solutions by altering the rating scale; and (3) test the construct validity of all subscales of the Work Limitations Questionnaire by examining the unidimensionality and local dependency.

Methods

The study dataset was prospectively collected from a specialty clinic for workers with upper limb injuries in a tertiary hospital in Canada. The package of patient-reported outcome measures including WIS-23 was sent to patients shortly before their initial clinic assessment and completed immediately prior to attending clinic or at the initial clinic visit. Research ethics board approval was obtained for the clinical database, and patients provided informed written consent to have data used in research.

Rasch analysis included tests of unidimensionality fit of residual, ordering of item thresholds, person separation index (PSI), differential item functioning (DIF), and local independence of items. The analysis was performed using RUMM 2030 professional suite software.a The significance level was set at 0.05, with Bonferroni correction applied when multiple comparisons were made. To facilitate stable analyses, sample sizes of 250 participants were required.¹⁸^,¹⁹

Rasch analysis

Test of fit

The test of fit quantifies to what extent items from the outcome measure meet the expectations of the Rasch model. Fit statistics can be checked at both overall and individual item levels. For the overall fit, the P value from a chi-square test of item-trait interaction should be nonsignificant according to the critical value.¹⁷^,²⁰ The item–person interaction statistics were then transformed to approximate a z score following a standardized normal distribution. We expected a mean of approximately 0 and a standard deviation of 1 to characterize the normal distribution.¹⁷ At the individual level, a fit residual localized within ±2.5 logits represented an adequate fit for the model.¹⁷

Threshold

The threshold refers to the point between 2 response categories at which either response is equally probable. Disordered thresholds mean that the respondents fail to meaningfully discriminate between response options or that options are potentially confusing. Threshold maps and categorical probability curves were used for visual inspection of this phenomenon. Where needed, we attempted to resolve this problem by collapsing adjacent categories and reversing the order of response options.¹⁷

Targeting

Scale-to-sample targeting reflects the extent to which the items can measure the whole range of an individual’s ability level. The person item threshold distribution displays the relative difficulty (item locations) and relative ability (person location) on the same ruler of logits. The better the ranges match each other, the greater the potential for precise person measurement. Poor targeting often results in floor or ceiling effects, indicating that patients at the extremes cannot be differentiated from each other nor can change be measured in terms of lower (floor) or higher (ceiling) future scores.²⁰^,²¹ A scale is considered well targeted if the difference between person and item means would be less than 1 logit unit.¹⁰

DIF

DIF occurs when individual groups of patients within the study sample (men vs women), respond differently to an item given the equal level of characteristic being measured. For instance, men and women with equal level of work disability may respond systematically different to an item measuring completeness of job demands.²² Two types of DIF can be identified. Uniform DIF is where the group shows a consistent systematic difference in their responses to an item. The standard approach of splitting the item for individual groups can be used to address such issue. Nonuniform DIF results from random differences (eg, responses to individual item vary across levels of the ability for subgroups). Currently, there is no solution for nonuniform DIF, except item removal.²³ DIF was examined on age, sex of respondents, employment type, and affected upper extremity area using both critical statistical values using a Bonferroni adjusted P value of .0007 (.05/23∗3) and visual inspection by item characteristic curve (ICC).¹⁷ The visual inspection was facilitated by plotting the item characteristic curve along with the person trait for given person factors. Under an ideal situation, there is no difference in ICCs for different groups, indicating that participants with identical level of ability have equal probabilities of affirming a given item.

Dimensionality

The basic assumption of the Rasch model, unidimensionality, was checked through the principal component analysis (PCA) under item response theory. After the rescoring of individual response options and any resultant item reduction, the PCA was be revisited as confirmation of the unidimensionality.²⁴ We set the number of significant t tests at 5% of the total comparisons as the indicator of multidimensionality.

Local independence

Residual pattern refers to the standardized person-item differences between the observed data and expected response generated by the model for every person’s response to individual item. The Rasch model requires no residual pattern existing in the data, which is named as local independence. Such residual pattern was examined by PCA. A residual correlation between any 2 items greater than 0.2 above the average correlation would appear to indicate the violation of local independence, as local dependency (LD).²⁵^,²⁶ Potential reasons for the appearance of LD are response dependency and multidimensionality.²⁷ Item deletion and the creation of testlets to bundle the dependent items was used as a potential solution to address the LD.²⁵^,²⁸

Reliability

The PSI indicates the precision of the estimate for each person and was used to evaluate the internal consistency under the Rasch model. The acceptable value was set as 0.7 to establish that the scale is reliable to distinguish between at least 2 groups.29, 30, 31 In addition to the PSI, the traditional Cronbach’s alpha was provided by the RUMM 2030 program; 0.8 was adopted as the satisfactory value.³²

Results

Study participants

The study sample contained 392 participants with full responses. The age of participants ranged from 19 to 73 years old (mean ± SD, 47.0±10.5y). There were 148 (37.8%) women, 182 (46.4%) men, and 62 (15.8%) participants missing sex information. Within the study sample, 56.4% of the total subjects engaged in labor work, 16.8% engaged in office work, and 7.9% engaged in a job that included both labor and office work. Due to the nature of Rasch analysis, continuous descriptive variables were transferred to categorical data. The continuous age variable was then recoded into 2 groups according to the median value. Specifically, code 1 represented the group between 19 and 49 years old, and code 2 represented those between 50 and 73 years old. A full summary of participant demographic and clinical characteristics is listed in table 1.

Table 1.

Demographics of the total sample (N=392)

Personal Factor	Classification	Frequency	Percentage
Sex	Women	148	37.8
	Men	182	46.4
	Missing	62	15.8
Affected side	Left	119	30.4
	Right	177	45.2
	Both	76	19.4
	Missing	20	5.0

Job	Labor	221	56.4
	Office	66	16.8
	Mixed	31	7.9
	Missing	74	18.9

Injury region	Wrist and hand	99	25.3
	Elbow and forearm	57	14.5
	Shoulder and arm	171	43.6
	Upper extremity (CRPS)	12	3.1
	Other	6	1.5
	Missing	47	12.0

Age, y	Mean ± SD	Median	Range
	47.0±10.5	49	19-73
Abbreviation: CRPS, complex regional pain syndrome.

Open in a new tab

Test of fit

The rating scale model was selected for the current analysis due to the unified dichotomous response options over all 23 times.²⁷ The initial evaluation of the overall questionnaire with 23 items demonstrated poor overall fit to the Rasch model in the substantial deviation in the standardized item fit residual statistic (mean ± SD, –0.27±1.9). The significant chi-square test (χ²=293.52; df=138; P<.001) for item-trait interaction revealed misfit from Rasch model. Table 2 lists the overall summary of fit statistics. To locate problematic items that may cause misfit issues, we checked the individual item fit statistics. Items 1 and 23 were misfitting as the fit residual values were greater than 2.5. Individual item descriptions with Rasch solutions are shown in table 3.

Table 2.

Overall summary of Rasch statistics

Analysis	Sample Size		Item Fit Residual	Person Fit Residual	Chi-Square Interaction			PSI	Alpha	Unidimensionality T Tests
Analysis	With ext	Without ext	Mean ± SD	Mean ± SD	Value	df	P Value	PSI	Alpha	No. of significant tests	N	%
Original 23-item	392	384	–0.27 ± 1.9	–0.23 ± 0.78	293.52	138	<.0001	0.85	0.87	19	384	4.95
Revised 20-item	392	381	–0.38 ± 1.25	–0.25 ± 0.73	160.42	120	.008	0.85	0.87	12	381	3.15

Open in a new tab

Abbreviation: ext: extreme data. NOTE. The original work instability index contains 23 items. To achieve the Rasch model fit, 4 items, including items 1 and 23, were removed. Item 19 was deleted due to the issue of nonuniform DIF in the job classification. Item 11 was separated for men and women, and items 18 and 22 were separated for different affected regions due to the presence of uniform DIF. No disordered thresholds were found in the original or revised versions. After Bonferroni correction, the significant P value was .0027.

Table 3.

Individual item description with Rasch solutions

	Item Description	Solution	Rationale
1	I can get my job done, I’m just a lot slower	Remove	Fit residual 5.92
2	If I don't reduce my hours, I may have to give up work
3	I am very worried about my ability to keep working
4	I have pain or stiffness all the time at work
5	I don't have the stamina to work like I used to
6	I have used my holiday so that I don't have to go on sick leave
7	I push myself to go to work because I don't want to give in to my shoulder/elbow problem
8	Sometimes I can't face being at work all day
9	I have to say no to certain things at work
10	I've got to watch how much I do certain things at work
11	I have great difficulty opening some of the doors at work	Item split	Uniform DIF identified in sex
12	I have to allow myself extra time to do some jobs
13	It's very frustrating because I can't always do things at work
14	I feel I may have to give up work
15	I get on with work but afterwards I have a lot of pain
16	When I’m feeling tired all the time, work's a grind
17	I'd like another job, but I am restricted as to what I can do
18	I'm getting up earlier because of my shoulder/elbow problem	Item split	Uniform DIF identified in injury region
19	I get very stiff at work	Remove	Nonuniform DIF identified in job
20	I'm finding my job is about all I can manage
21	The stress of my job makes my shoulder/elbow problem flare
22	I'm finding any pressure on my hands is a problem	Item split	Uniform DIF identified in injury region
23	I get good days and bad days at work	Remove	Fit residual 2.86

Open in a new tab

Threshold

The inspection of the threshold map of WIS-23 suggested that no disordered issues were present, as expected given the dichotomous response option. Figure 1 shows the threshold map of the original WIS.

Fig 1 — The original thresholds map for the WIS-23. All thresholds are ordered and displayed on the map.

Targeting

Figure 2 shows the targeting of the WIS-23. The mean of person logits is equal to 0.24 (<1 logit), indicating that the scale item difficulty matches the abilities of the sample. In addition, the floor and ceiling effects were not detected because only 6% (<10 %) of the participants were on either end of the spectrum.

Fig 2 — Person-item distribution map for the WIS-23. The mean of person logits is equal to 0.24 (<1 logit), indicating that the scale item difficulty matches the abilities of the sample. In addition, the floor and ceiling effects were not detected as only 6% (<10%) of the participants were on either end of the spectrum.

DIF

The personal factors considered as potential sources of DIF (bias) were sex, age (19-49y and 50-73y), affected sides (left, right, bilateral), injured body area (wrist and hand, elbow and forearm, shoulder and arm, entire upper extremity), and 3 job categories (labor, office, and mixed). Uniform DIF due to sex bias was detected on item 11, as the curve representing expected value of item response for women was higher than the curve for men across most person locations (person ability). Figure 3 shows the uniform-DIF of item 11. Similar uniform DIFs were identified on items 18 and 22 for injured body area due to selection bias. Item 19 was removed due to the nonuniform DIF identified in different job categories. Table 3 lists the individual item description with Rasch solutions.

Fig 3 — Uniform DIF detected on item 11 across male vs female groups. The visual inspection was facilitated by plotting the item characteristic curve along with the person trait for given person factors.

Dimensionality

The WIS-23 met the assumption of unidimensionality as 4.95% (<5%) of the independent t tests were found to be significant at the 5% level (see table 2 for the overall summary of fit statistics).

Local independence

None of the item-to-item correlations exceeded the cutoff value, and the assumption of local independence was met in the WIS-23 (Supplemental Table S1, available online only at http://www.archives-pmr.org/).

Reliability

The PSI value was equal to 0.85 for the WIS-23. The Cronbach’s alpha was calculated as 0.87 without missing data points.

Revisiting the Rasch statistics

After removal of 3 items with unresolvable issues, including item misfit (items 1 and 23) and nonuniform DIF (item 19), the revised WIS questionnaire contained 20 items. The reanalyzed WIS-20 showed acceptable level of overall fit to the Rasch model (mean ± SD, –0.38±1.25) with a nonsignificant chi-square test (χ²=160.42; df=120; P=.008) for item-trait interaction (see table 2). No disordered thresholds were revealed during the reanalysis (fig 4) A mean value of 0.14 logits (<1 logit) revealed good targeting of the revised WIS. Flooring and ceiling effects were absent in the 20-item version (fig 5). The revised version was in accordance with the assumptions of dimensionality and local independency. Both PSI and reliability statistics remained the same after deleting 3 items (see table 2). Therefore, we conducted a logit transformation of the 20-item WIS summed scores to provide interval-level scaling. The conversions are presented in table 4.

Fig 4 — The reassessment of the thresholds shown by the ordered threshold map for the revised WIS. All thresholds are ordered and displayed on the map.

Fig 5 — Person-item distribution map for the revised 20-item version of the WIS. The mean of person logits is equal to 0.14 (<1 logit), indicating that the scale item difficulty matches the abilities of the sample. In addition, the floor and ceiling effects were not detected as only 8% (<10%) of the participants were on either end of the spectrum.

Table 4.

Transformation of WRUED-WIS raw scores to interval-level scores on a logit scale summed

Summed Score	Logit	Interval
0	–4.493	0.0
1	–3.47	2.3
2	–2.736	4.0
3	–2.21	5.2
4	–1.785	6.2
5	–1.42	7.0
6	–1.091	7.7
7	–0.788	8.4
8	–0.503	9.1
9	–0.23	9.7
10	0.035	10.3
11	0.299	10.9
12	0.564	11.5
13	0.838	12.1
14	1.126	12.8
15	1.438	13.5
16	1.787	14.3
17	2.192	15.2
18	2.691	16.3
19	3.372	17.9
20	4.303	20.0

Open in a new tab

Discussion

Our Rasch analysis of WIS-23 supports a 20-item version and provides a transformation of interval level scaling in injured workers with upper extremity conditions. This complements the findings from a previous study by Tang et al,¹² in which they recommended a 17-item WIS specifically for the upper extremity. They excluded items 1 and 23 among others, which were also recommended to be removed by the current Rasch analysis.

Our initial analysis indicated that there were no disordered thresholds. This was an expected finding as dichotomous response options are not susceptible to disordered thresholds, as reported in a previous Rasch analysis study.¹²

PSI values of >0.70 and >0.85 are considered acceptable for group and individual use, respectively. The PSI value for the 20-item WIS was 0.85, indicating that the WIS can discriminate at both levels and show similar discrimination as the WIS-23 (PSI=0.83).¹²

We had to delete 2 misfitting items, including items 1 (“I can get my job done, I’m just a lot slower”) and 23 (“I get good and bad days at work”), as they had fit residual greater than ±2.5. These items also misfit in a previous Rasch analysis,¹² which increases our confidence that there may be problems with these items that warrant their removal. Item 1 is a double-barreled question, as an answer of “no” could mean that the respondent cannot do their job tasks (eg, modified work) or that they can and are not slower (eg, can do normally but have pain). Double-barreled questions are insufficient in terms of content validity. Thus, our Rasch findings that question this item reinforce the need for careful content validity before items are included in a measure and provide justification for removal of the item. Item 23 was designed to indicate the fluctuation of symptoms during day-to-day work but not necessarily for respondents in the recovery phase.¹² These findings are similar to previous Rasch analyses. Because a single piece of evidence is too preliminary to justify large-scale adoption of a revised measure, this level of conceptual and independent verification is needed. We suggest that these combined concerns justify the removal of these 2 items.

Prior studies have suggested removal of items 8, 13, 20, and 22 to improve the Rasch model fit. However, in the current study, we found no significant deviation from the model fit and retained these items. Because work instability is a complex phenomenon and the questionnaire is already quickly answered due to its yes/no response options, we believe that an overly shortened item may not be sufficiently inclusive of the problems individuals experience. Therefore, where items do function well, we would prefer to retain them. Some differences between Rasch analyses in different studies can be expected due to differences in the health conditions, occupations, and demographics of different study samples. For example, item 22 (“I'm finding any pressure on my hands is a problem”) performed well in our sample in which 25.1% of the injured workers had claims related to the hand and wrist, but may not have been as relevant to patients in a previous study that only included patients with shoulder injuries.¹² As we see different Rasch analyses being performed on the same instrument, we have come to appreciate that variances in results are to be expected. Therefore, researchers wanting to optimize measurement in their sample should choose a Rasch solution that is most similar to their population. Permanent changes to measures should be considered where there are consistent findings that items do not perform well.

The third item eliminated from the 23-item version to create our 20-item WIS was item 19 (“I get very stiff at work”) since it exhibited nonuniform DIF based on job classification. Usually, nonuniform DIF occurs when the ICC indicates that the participants from different groups with identical ability levels have different probabilities of correctly responding to an item. In the current case, we classified jobs as labor, office work, and mixed. The onset of stiffness could vary between individuals depending on what type of activities they perform. For example, an office worker who works in front of a computer can develop spinal stiffness due to sustained postures, whereas a person with a manual job might have knee and hand stiffness that arises from overuse of muscles and joints. These could be quite different in nature and severity. Because the question is not very specific in terms of the source of stiffness, different jobs could have a different type of response or bias. However, unless this is a stable finding across studies, we are less certain of the recommendation for permanent removal.

Our study identified uniform DIF for item 11 (“I have great difficulty opening some of the doors at work”) for men and women. This did not result in removal of the item because we recognize that differences in response patterns could be due to the biological variations in strength, muscle fibers, etc³³ because the percentage of strength required for men and women to open a standard door would be different. Uniform DIF for areas of injury was also observed in items 18 (“I'm getting up earlier because of my shoulder/elbow problem”) and 22 (“I'm finding any pressure on my hands is a problem”) in the current study. A similar trend was observed in a previous study by Tang et al,¹² including uniform DIF for items 18 and 22 between rheumatoid arthritis patients and OA patients. This indicates that patients with different areas of injury in the upper extremity will respond differently to these 2 questions (items 18 and 22). Hence, these 2 items should be separated for different subgroups of patients with various areas of injury.

The transformation of the summed score to an interval-level score in our revised version enables the analysis of within- and between-subject differences such as analysis of variance since interval level scaling is an assumption for mathematical manipulations. Ordinal measures performed by many self-report instruments cannot guarantee that the intervals between different response options are equivalent and, therefore, it would be inappropriate to perform mathematical operations on these scores. We recognize that this is routinely done. However, it violates the assumptions of statistical tests and is a potential source of error.

Study limitations

A limitation of Rasch analysis is that it can be used to identify problematic items but cannot identify why an item does not perform well. We know that many currently used outcome measures do not have publications that clearly articulate the steps taken to ensure content validity of the items. Issues such as lack of clarity of items, double-barreled questions, and lack of relevance are surprisingly common in currently used outcome measures. Therefore, it is possible that Rasch analysis will indicate the need to delete an item that could be rehabilitated. Some differences exist across different Rasch solutions and only those items that consistently demonstrate measurement problems can be confidently removed³⁴ to avoid multiple conflicting versions of outcome measures. However, because shortened measures that remove nonfunctioning items can lessen burden and improve measurement, they should be adopted where sufficient evidence exists.

Implications

Further research should include cognitive interviewing and qualitative methods to further examine the content validity and clarity of the WIS items to complement statistical approaches and comparison of different Rasch solutions in the same sample to further examine proposed alternate forms, including complementary statistical analyses such as factor analyses and classic psychometric properties (eg, responsiveness). A comparative approach across samples and versions is needed to provide robust evidence on the “best” version of the WIS. At present, researchers should use the version of the WIS that provides the best measurement properties for their study sample.

Conclusions

In conclusion, we found through Rasch analysis that 3 problematic items of the WIS-23 could be removed to perform a well-functioning interval level scaled WIS-20 in injured workers with upper extremity musculoskeletal conditions. The 20-item WIS-WREUD demonstrated excellent item and person fit, unidimensionality, acceptable person separation index, and local independency.

Supplier

a.
RUMM 2030; RUMM Laboratory Pty Ltd.

Footnotes

Dr MacDermid was supported by a CIHR Chair in Gender, Work and Health and the Dr James Roth Research Chair in Musculoskeletal Measurement and Knowledge Translation.

Disclosures: none.

Supplementary Data

Supplemental Table S1

mmc1.pdf^{(204.7KB, pdf)}

References

1.Lerner D., Amick B.C., 3rd, Rogers W.H., Malspeis S., Bungay K., Cynn D. The Work Limitations Questionnaire. Med Care. 2001;39:72–85. doi: 10.1097/00005650-200101000-00009. [DOI] [PubMed] [Google Scholar]
2.Lerner D., Reed J.I., Massarotti E. The Work Limitations Questionnaire’s validity and reliability among patients with osteoarthritis. J Clin Epidemiol. 2002;55:197–208. doi: 10.1016/s0895-4356(01)00424-3. [DOI] [PubMed] [Google Scholar]
3.Escorpizo R., Cieza A., Beaton D., Boonen A. Content comparison of worker productivity questionnaires in arthritis and musculoskeletal conditions using the international classification of functioning, disability, and health framework. J Occup Rehabil. 2009;19:382–397. doi: 10.1007/s10926-009-9193-0. [DOI] [PubMed] [Google Scholar]
4.Koopman C., Pelletier K.R., Murray J.F. Stanford Presenteeism Scale: health status and employee productivity. J Occup Environ Med. 2002;44:14–20. doi: 10.1097/00043764-200201000-00004. [DOI] [PubMed] [Google Scholar]
5.Baldwin M.L., Butler R.J. Upper extremity disorders in the workplace: costs and outcomes beyond the first return to work. J Occup Rehabil. 2006;16:303–323. doi: 10.1007/s10926-006-9043-2. [DOI] [PubMed] [Google Scholar]
6.Workplace Safety and Insurance Board Workplace Safety and Insurance Board 2018 annual report. https://www.wsib.ca/en/2018-annual-report-highlights Available at: Accessed January 27, 2021.
7.Gilworth G., Chamberlain M.A., Harvey A. Development of a work instability scale for rheumatoid arthritis. Arthritis Rheum. 2003;49:349–354. doi: 10.1002/art.11114. [DOI] [PubMed] [Google Scholar]
8.Van Oostrom S.H., Driessen M.T., De Vet H.C.W. Workplace interventions for preventing work disability. Cochrane Database Syst Rev. 2009;2:CD006955. doi: 10.1002/14651858.CD006955.pub2. [DOI] [PubMed] [Google Scholar]
9.Dorman P. The economics of safety, health, and well-being at work: an overview. https://www.ilo.org/wcmsp5/groups/public/---ed_protect/---protrav/---safework/documents/publication/wcms_110382.pdf Available at: Accessed January 27, 2021.
10.Tang K., Beaton D.E., Lacaille D. The Work Instability Scale for Rheumatoid Arthritis (RA-WIS): does it work in osteoarthritis? Qual Life Res. 2010;19:1057–1068. doi: 10.1007/s11136-010-9656-y. [DOI] [PubMed] [Google Scholar]
11.Tang K., Beaton D.E., Boonen A., Gignac M.A.M., Bombardier C. Measures of work disability and productivity: Rheumatoid Arthritis Specific Work Productivity Survey (WPS-RA), Workplace Activity Limitations Scale (WALS), Work Instability Scale for Rheumatoid Arthritis (RA-WIS), Work Limitations Questionnaire (WLQ), and Work Productivity and Activity Impairment Questionnaire (WPAI) Arthritis Care Res. 2011;63:S337–S349. doi: 10.1002/acr.20633. [DOI] [PubMed] [Google Scholar]
12.Tang K., Beaton D.E., Gignac M.A.M., Bombardier C. Rasch analysis informed modifications to the Work Instability Scale for Rheumatoid Arthritis for use in work-related upper limb disorders. J Clin Epidemiol. 2011;64:1242–1251. doi: 10.1016/j.jclinepi.2011.02.002. [DOI] [PubMed] [Google Scholar]
13.Gilworth G., Smyth M.G., Smith J., Tennant A. The manual work instability scale: development and validation. Occup Med (Lond) 2016;66:300–304. doi: 10.1093/occmed/kqv217. [DOI] [PubMed] [Google Scholar]
14.Gilworth G., Carey A., Eyres S. Screening for job loss: development of a work instability scale for traumatic brain injury. Brain Inj. 2006;20:835–843. doi: 10.1080/02699050600832221. [DOI] [PubMed] [Google Scholar]
15.Lilley R., Davie G., Langley J., Ameratunga S., Derrett S. Do outcomes differ between work and non-work-related injury in a universal injury compensation system? Findings from the New Zealand Prospective Outcomes of Injury Study. BMC Public Health. 2013;13:1–9. doi: 10.1186/1471-2458-13-995. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Cano S.J., Klassen A.F., Scott A.M., Cordeiro P.G., Pusic A.L. The BREAST-Q: further validation in independent clinical samples. Plast Reconstr Surg. 2012;129:293–302. doi: 10.1097/PRS.0b013e31823aec6b. [DOI] [PubMed] [Google Scholar]
17.Pallant J.F., Tennant A. An introduction to the Rasch measurement model: an example using the Hospital Anxiety and Depression Scale (HADS) Br J Clin Psychol. 2007;46:1–18. doi: 10.1348/014466506x96931. [DOI] [PubMed] [Google Scholar]
18.Chen W.H., Lenderking W., Jin Y., Wyrwich K.W., Gelhorn H., Revicki D.A. Is Rasch model analysis applicable in small sample size pilot studies for assessing item characteristics? An example using PROMIS pain behavior item bank data. Qual Life Res. 2014;23:485–493. doi: 10.1007/s11136-013-0487-5. [DOI] [PubMed] [Google Scholar]
19.Smith A.B., Rush R., Fallowfield L.J., Velikova G., Sharpe M. Rasch fit statistics and sample size considerations for polytomous data. BMC Med Res Methodol. 2008;8:33. doi: 10.1186/1471-2288-8-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Jerosch-Herold C, Chester R, Shepstone L, Vincent JI, MacDermid JC. An evaluation of the structural validity of the shoulder pain and disability index (SPADI) using the Rasch model. Qual Life Res 2018;27:389-400. [DOI] [PubMed]
21.McHorney C.A., Tarlov A.R. Individual-patient monitoring in clinical practice: are available health status surveys adequate? Qual Life Res. 1995;4:293–307. doi: 10.1007/BF01593882. [DOI] [PubMed] [Google Scholar]
22.Tang K., Beaton D.E., Amick B.C., 3rd, Hogg-Johnson H., Côté P., Loisel P. Confirmatory factor analysis of the Work Limitations Questionnaire (WLQ-25 ) in workers’ compensation claimants with chronic upper-limb disorders. 2013;23:228–238. doi: 10.1007/s10926-012-9397-6. [DOI] [PubMed] [Google Scholar]
23.Kersten P., White P.J., Tennant A. Is the pain visual analogue scale linear and responsive to change? An exploration using rasch analysis. PLoS One. 2014;9 doi: 10.1371/journal.pone.0099485. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Packham T., Macdermid J.C. Measurement properties of the patient-rated wrist and hand evaluation: Rasch analysis of responses from a traumatic hand injury population. J Hand Ther. 2013;26:216–224. doi: 10.1016/j.jht.2012.12.006. [DOI] [PubMed] [Google Scholar]
25.Christensen K.B., Makransky G., Horton M. Critical values for Yen’s Q3: identification of local dependence in the Rasch model using residual correlations. Appl Psychol Meas. 2017;41:178–194. doi: 10.1177/0146621616677520. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Walton D.M., MacDermid J.C. A brief 5-item version of the Neck Disability Index shows good psychometric properties. Health Qual Life Outcomes. 2013;11:108. doi: 10.1186/1477-7525-11-108. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Nilsson Å.L., Tennant A. Past and present issues in Rasch analysis: the Functional Independence Measure (FIMTM) revisited. J Rehabil Med. 2011;43:884–891. doi: 10.2340/16501977-0871. [DOI] [PubMed] [Google Scholar]
28.Gothwal V.K., Wright T., Lamoureux E.L., Pesudovs K. Psychometric properties of visual functioning index using Rasch analysis. Acta Ophthalmol. 2010;88:797–803. doi: 10.1111/j.1755-3768.2009.01562.x. [DOI] [PubMed] [Google Scholar]
29.Koopmans L., Bernaards C.M., Hildebrandt V.H., van Buuren S., van der Beek A.J., de Vet H.C.W. Improving the Individual Work Performance Questionnaire using Rasch analysis. J Appl Meas. 2014;15:160–175. [PubMed] [Google Scholar]
30.Lamoureux E.L., Pallant J.F., Pesudovs K., Hassell J.B., Keeffe J.E. The impact of vision impairment questionnaire: an evaluation of its measurement properties using Rasch analysis. Investig Ophthalmol Vis Sci. 2006;47:4732–4741. doi: 10.1167/iovs.06-0220. [DOI] [PubMed] [Google Scholar]
31.Tavakol M., Dennick R. Making sense of Cronbach’s alpha. Int J Med Educ. 2011;2:53–55. doi: 10.5116/ijme.4dfb.8dfd. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Lambert S.D., Pallant J.F., Boyes A.W., King M.T., Britton B., Girgis A. A Rasch analysis of the Hospital Anxiety and Depression Scale (HADS) among cancer survivors. Psychol Assess. 2013;25:379–390. doi: 10.1037/a0031154. [DOI] [PubMed] [Google Scholar]
33.Miller E.A., MacDougall J.D., Tarnopolsky M.A., Sale D.G. Gender differences in strength and muscle fiber characteristics. Eur J Appl Physiol Occup Physiol. 1993;66:254–262. doi: 10.1007/BF00235103. [DOI] [PubMed] [Google Scholar]
34.Lu Z., MacDermid J.C., Nazari G. Agreement between original and Rasch-approved Neck Disability Index. BMC Med Res Methodol. 2020;20:180. doi: 10.1186/s12874-020-01069-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Table S1

mmc1.pdf^{(204.7KB, pdf)}

[bib1] 1.Lerner D., Amick B.C., 3rd, Rogers W.H., Malspeis S., Bungay K., Cynn D. The Work Limitations Questionnaire. Med Care. 2001;39:72–85. doi: 10.1097/00005650-200101000-00009. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Lerner D., Reed J.I., Massarotti E. The Work Limitations Questionnaire’s validity and reliability among patients with osteoarthritis. J Clin Epidemiol. 2002;55:197–208. doi: 10.1016/s0895-4356(01)00424-3. [DOI] [PubMed] [Google Scholar]

[bib3] 3.Escorpizo R., Cieza A., Beaton D., Boonen A. Content comparison of worker productivity questionnaires in arthritis and musculoskeletal conditions using the international classification of functioning, disability, and health framework. J Occup Rehabil. 2009;19:382–397. doi: 10.1007/s10926-009-9193-0. [DOI] [PubMed] [Google Scholar]

[bib4] 4.Koopman C., Pelletier K.R., Murray J.F. Stanford Presenteeism Scale: health status and employee productivity. J Occup Environ Med. 2002;44:14–20. doi: 10.1097/00043764-200201000-00004. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Baldwin M.L., Butler R.J. Upper extremity disorders in the workplace: costs and outcomes beyond the first return to work. J Occup Rehabil. 2006;16:303–323. doi: 10.1007/s10926-006-9043-2. [DOI] [PubMed] [Google Scholar]

[bib6] 6.Workplace Safety and Insurance Board Workplace Safety and Insurance Board 2018 annual report. https://www.wsib.ca/en/2018-annual-report-highlights Available at: Accessed January 27, 2021.

[bib7] 7.Gilworth G., Chamberlain M.A., Harvey A. Development of a work instability scale for rheumatoid arthritis. Arthritis Rheum. 2003;49:349–354. doi: 10.1002/art.11114. [DOI] [PubMed] [Google Scholar]

[bib8] 8.Van Oostrom S.H., Driessen M.T., De Vet H.C.W. Workplace interventions for preventing work disability. Cochrane Database Syst Rev. 2009;2:CD006955. doi: 10.1002/14651858.CD006955.pub2. [DOI] [PubMed] [Google Scholar]

[bib9] 9.Dorman P. The economics of safety, health, and well-being at work: an overview. https://www.ilo.org/wcmsp5/groups/public/---ed_protect/---protrav/---safework/documents/publication/wcms_110382.pdf Available at: Accessed January 27, 2021.

[bib10] 10.Tang K., Beaton D.E., Lacaille D. The Work Instability Scale for Rheumatoid Arthritis (RA-WIS): does it work in osteoarthritis? Qual Life Res. 2010;19:1057–1068. doi: 10.1007/s11136-010-9656-y. [DOI] [PubMed] [Google Scholar]

[bib11] 11.Tang K., Beaton D.E., Boonen A., Gignac M.A.M., Bombardier C. Measures of work disability and productivity: Rheumatoid Arthritis Specific Work Productivity Survey (WPS-RA), Workplace Activity Limitations Scale (WALS), Work Instability Scale for Rheumatoid Arthritis (RA-WIS), Work Limitations Questionnaire (WLQ), and Work Productivity and Activity Impairment Questionnaire (WPAI) Arthritis Care Res. 2011;63:S337–S349. doi: 10.1002/acr.20633. [DOI] [PubMed] [Google Scholar]

[bib12] 12.Tang K., Beaton D.E., Gignac M.A.M., Bombardier C. Rasch analysis informed modifications to the Work Instability Scale for Rheumatoid Arthritis for use in work-related upper limb disorders. J Clin Epidemiol. 2011;64:1242–1251. doi: 10.1016/j.jclinepi.2011.02.002. [DOI] [PubMed] [Google Scholar]

[bib13] 13.Gilworth G., Smyth M.G., Smith J., Tennant A. The manual work instability scale: development and validation. Occup Med (Lond) 2016;66:300–304. doi: 10.1093/occmed/kqv217. [DOI] [PubMed] [Google Scholar]

[bib14] 14.Gilworth G., Carey A., Eyres S. Screening for job loss: development of a work instability scale for traumatic brain injury. Brain Inj. 2006;20:835–843. doi: 10.1080/02699050600832221. [DOI] [PubMed] [Google Scholar]

[bib15] 15.Lilley R., Davie G., Langley J., Ameratunga S., Derrett S. Do outcomes differ between work and non-work-related injury in a universal injury compensation system? Findings from the New Zealand Prospective Outcomes of Injury Study. BMC Public Health. 2013;13:1–9. doi: 10.1186/1471-2458-13-995. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] 16.Cano S.J., Klassen A.F., Scott A.M., Cordeiro P.G., Pusic A.L. The BREAST-Q: further validation in independent clinical samples. Plast Reconstr Surg. 2012;129:293–302. doi: 10.1097/PRS.0b013e31823aec6b. [DOI] [PubMed] [Google Scholar]

[bib17] 17.Pallant J.F., Tennant A. An introduction to the Rasch measurement model: an example using the Hospital Anxiety and Depression Scale (HADS) Br J Clin Psychol. 2007;46:1–18. doi: 10.1348/014466506x96931. [DOI] [PubMed] [Google Scholar]

[bib18] 18.Chen W.H., Lenderking W., Jin Y., Wyrwich K.W., Gelhorn H., Revicki D.A. Is Rasch model analysis applicable in small sample size pilot studies for assessing item characteristics? An example using PROMIS pain behavior item bank data. Qual Life Res. 2014;23:485–493. doi: 10.1007/s11136-013-0487-5. [DOI] [PubMed] [Google Scholar]

[bib19] 19.Smith A.B., Rush R., Fallowfield L.J., Velikova G., Sharpe M. Rasch fit statistics and sample size considerations for polytomous data. BMC Med Res Methodol. 2008;8:33. doi: 10.1186/1471-2288-8-33. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20.Jerosch-Herold C, Chester R, Shepstone L, Vincent JI, MacDermid JC. An evaluation of the structural validity of the shoulder pain and disability index (SPADI) using the Rasch model. Qual Life Res 2018;27:389-400. [DOI] [PubMed]

[bib21] 21.McHorney C.A., Tarlov A.R. Individual-patient monitoring in clinical practice: are available health status surveys adequate? Qual Life Res. 1995;4:293–307. doi: 10.1007/BF01593882. [DOI] [PubMed] [Google Scholar]

[bib22] 22.Tang K., Beaton D.E., Amick B.C., 3rd, Hogg-Johnson H., Côté P., Loisel P. Confirmatory factor analysis of the Work Limitations Questionnaire (WLQ-25 ) in workers’ compensation claimants with chronic upper-limb disorders. 2013;23:228–238. doi: 10.1007/s10926-012-9397-6. [DOI] [PubMed] [Google Scholar]

[bib23] 23.Kersten P., White P.J., Tennant A. Is the pain visual analogue scale linear and responsive to change? An exploration using rasch analysis. PLoS One. 2014;9 doi: 10.1371/journal.pone.0099485. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] 24.Packham T., Macdermid J.C. Measurement properties of the patient-rated wrist and hand evaluation: Rasch analysis of responses from a traumatic hand injury population. J Hand Ther. 2013;26:216–224. doi: 10.1016/j.jht.2012.12.006. [DOI] [PubMed] [Google Scholar]

[bib25] 25.Christensen K.B., Makransky G., Horton M. Critical values for Yen’s Q3: identification of local dependence in the Rasch model using residual correlations. Appl Psychol Meas. 2017;41:178–194. doi: 10.1177/0146621616677520. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] 26.Walton D.M., MacDermid J.C. A brief 5-item version of the Neck Disability Index shows good psychometric properties. Health Qual Life Outcomes. 2013;11:108. doi: 10.1186/1477-7525-11-108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] 27.Nilsson Å.L., Tennant A. Past and present issues in Rasch analysis: the Functional Independence Measure (FIMTM) revisited. J Rehabil Med. 2011;43:884–891. doi: 10.2340/16501977-0871. [DOI] [PubMed] [Google Scholar]

[bib28] 28.Gothwal V.K., Wright T., Lamoureux E.L., Pesudovs K. Psychometric properties of visual functioning index using Rasch analysis. Acta Ophthalmol. 2010;88:797–803. doi: 10.1111/j.1755-3768.2009.01562.x. [DOI] [PubMed] [Google Scholar]

[bib29] 29.Koopmans L., Bernaards C.M., Hildebrandt V.H., van Buuren S., van der Beek A.J., de Vet H.C.W. Improving the Individual Work Performance Questionnaire using Rasch analysis. J Appl Meas. 2014;15:160–175. [PubMed] [Google Scholar]

[bib30] 30.Lamoureux E.L., Pallant J.F., Pesudovs K., Hassell J.B., Keeffe J.E. The impact of vision impairment questionnaire: an evaluation of its measurement properties using Rasch analysis. Investig Ophthalmol Vis Sci. 2006;47:4732–4741. doi: 10.1167/iovs.06-0220. [DOI] [PubMed] [Google Scholar]

[bib31] 31.Tavakol M., Dennick R. Making sense of Cronbach’s alpha. Int J Med Educ. 2011;2:53–55. doi: 10.5116/ijme.4dfb.8dfd. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] 32.Lambert S.D., Pallant J.F., Boyes A.W., King M.T., Britton B., Girgis A. A Rasch analysis of the Hospital Anxiety and Depression Scale (HADS) among cancer survivors. Psychol Assess. 2013;25:379–390. doi: 10.1037/a0031154. [DOI] [PubMed] [Google Scholar]

[bib33] 33.Miller E.A., MacDougall J.D., Tarnopolsky M.A., Sale D.G. Gender differences in strength and muscle fiber characteristics. Eur J Appl Physiol Occup Physiol. 1993;66:254–262. doi: 10.1007/BF00235103. [DOI] [PubMed] [Google Scholar]

[bib34] 34.Lu Z., MacDermid J.C., Nazari G. Agreement between original and Rasch-approved Neck Disability Index. BMC Med Res Methodol. 2020;20:180. doi: 10.1186/s12874-020-01069-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Evaluation of the Structural Validity of the Work Instability Scale Using the Rasch Model

Ze Lu, Msc

Joshua I Vincent, MPT, PhD

Joy C MacDermid, PT, PhD

Abstract

Objective

Design

Setting

Participants

Interventions

Main Outcome Measures

Results

Conclusion

Methods

Rasch analysis

Test of fit

Threshold

Targeting

DIF

Dimensionality

Local independence

Reliability

Results

Study participants

Table 1.

Test of fit

Table 2.

Table 3.

Threshold

Fig 1.

Targeting

Fig 2.

DIF

Fig 3.

Dimensionality

Local independence

Reliability

Revisiting the Rasch statistics

Fig 4.

Fig 5.

Table 4.

Discussion

Study limitations

Implications

Conclusions

Supplier

Footnotes

Supplementary Data

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases