Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Jul 1.
Published in final edited form as: Arch Phys Med Rehabil. 2021 Mar 5;102(7):1317–1323. doi: 10.1016/j.apmr.2021.02.014

Crosswalking the Patient-Reported Outcomes Measurement Information System Physical Function, Pain Interference, and Pain Intensity Scores to the Roland-Morris Disability Questionnaire and the Oswestry Disability Index

Maria Orlando Edelen 1, Anthony Rodriguez 2, Patricia Herman 2, Ron D Hays 3
PMCID: PMC8263492  NIHMSID: NIHMS1695069  PMID: 33684368

Abstract

Objective:

To link scores from two condition-specific measures for chronic lower back pain (CLBP), the Oswestry Disability Index (ODI) and the Roland-Morris Disability Questionnaire (RMDQ), to Patient Reported Outcomes Measurement Information System (PROMIS®) physical function, pain interference, and pain intensity scores.

Design:

Ordinary least squares (OLS) regression analyses of existing data to link the PROMIS scores with the ODI and RMDQ.

Setting:

NA

Participants:

Data are from three samples of adults with CLBP: The Center for Excellence in Research for Complementary and Integrative Health (CERC) Study (N=1677); the Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel (ACT; N=384); and the pain subsample of the PROMIS 1 Wave 2 Pain and Depression study (PROMIS 1 W2; N=218).

Interventions:

NA

Main Outcome Measures:

PROMIS physical function, pain interference, and pain intensity (CERC, ACT and PROMIS 1 W2); ODI (CERC and PROMIS 1 W2); and RMDQ (ACT and PROMIS 1 W2).

Results:

In predicting PROMIS scores, the ODI model R2 values ranged from 0.26-0.56 and the RMDQ model R2 values ranged from 0.13-0.50. ODI and RMDQ models were least precise in predicting the PROMIS pain intensity score (R2 values ranged from 0.13-0.41) relative to the other PROMIS scores. Models with the three PROMIS scores as predictors yielded R2 values ranging from 0.64 – 0.68 and 0.46 – 0.58 for the ODI and RMDQ respectively. Models using combined data from two studies (i.e., PROMIS 1 W2 and ACT, or PROMIS 1 W2 and CERC) tended to be more precise than models using only a single study sample.

Conclusion:

Model results reported here to can be used translate PROMIS physical function, pain interference, and pain intensity scores to and from the ODI and RMDQ. The empirical linkages can facilitate comparisons across CLBP interventions and broaden interpretation of study results.

Keywords: Chronic lower back pain; Patient reported outcome measures; Oswestry Disability Index; Roland-Morris Disability Questionnaire; Patient Reported Outcomes Measurement Information System (PROMIS®); Physical function, pain interference, and pain intensity scores; Score linking; Score crosswalks


A variety of patient-reported outcome measures (PROMs) have been used to evaluate the effectiveness of interventions for chronic low back pain (CLBP).1-10 The Oswestry Disability Index (ODI)11 and the Roland-Morris Disability Questionnaire (RMDQ)12 are two PROMs used specifically for CLBP. The ODI is focused on disability across a range of domains (e.g., physical function, pain, sleep), while the RMDQ is focused on limitations in physical function due to back pain.13

The U.S. National Institutes of Health (NIH) has encouraged routine use of the Patient-Reported Outcomes Measurement Information System (PROMIS®) measures. PROMIS measures, developed within an item response theory framework,14, 15 have demonstrated strong psychometric properties in the general population and across a number of clinical subgroups.16 The PROMIS-2917 profile measure assesses pain intensity, physical function, fatigue, pain interference, depressive symptoms, anxiety, ability to participate in social roles and activities, and sleep disturbance.

Although all PROMIS-29 scales are potentially useful, the physical function, pain interference, and pain intensity constructs are most relevant to assess limitations directly due to CLBP and potential improvements in functioning due to therapeutic interventions. PROMIS physical function has also been shown to perform at least as well as the RMDQ among CLBP patients.18 Additionally, an NIH Research Taskforce proposed the Impact Stratification Score (ISS) as the sum of PROMIS-29 physical function, pain interference, and pain intensity scores.19

Given the popularity of the ODI and RMDQ,20 the usefulness of PROMIS measures such as the PROMIS-29 will be significantly enhanced with the availability of empirical crosswalks from one set of scores to the other so that CLBP researchers using PROMIS scores can interpret their results in the context of familiar outcomes. Crosswalks will also serve to help CLBP researchers integrate results from studies using the ODI and RMDQ.

As part of a larger study on chronic back pain,20 this paper reports results on a preliminary set of empirical links between the ODI and RMDQ and the three PROMIS scores in the ISS. In addition to facilitating crosswalks for study comparisons, results from this paper can be used in the interpretation of meta-analyses of studies.21, 22

Using data from three large samples of adults with CLBP, we examine associations of the three PROMIS scores (physical function, pain interference, pain intensity) with the ODI and the RMDQ. We then derive equations to predict these PROMIS scores from the ODI and the RMDQ as well as equations to predict the ODI and the RMDQ from the PROMIS scores.

Methods

Data sources

The Center for Excellence in Research for Complementary and Integrative Health (CERC)23, 24 data were collected on patients being treated for CLBP and chronic neck pain (CNP; n=2024) and included three subgroups: CLBP only (n=518), CNP only (n=347), and CLBP + CNP (n=1159). For the present study, we use only the baseline data and exclude those indicating the presence of CNP only (n=347) resulting in a sample of 1677. The mean age of the 1677 respondents was 49 (SD=15) ranging from 21 to 95 years of age and 71% female. The sample was predominantly non-Hispanic White (83%) with relatively low rates for Hispanic White (3%), non-Hispanic: Black (2%), Asian (2%), Native Alaskan (2%), and multi-racial (3%); and Other (5%). Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel (ACT)25-27 data (n=750) were collected longitudinally on active military personnel participating in chiropractic clinical trials for low back pain (LBP) and comprised three subgroups: chronic LBP (CLBP; n=384), acute LBP (n=287), and subacute LBP (n=79). This dataset has multiple timepoints, but only the baseline data and the CLBP subgroup are used in this paper. The mean age of the CLBP subgroup of participants at baseline was 32 (SD=8) ranging from 18 to 50 years of age and included 24.7% females. The respondents classified themselves as non-Hispanic: White (53%), Black (18%), Asian (5%), other (3%); Hispanic (18%); or unspecified (2%).

The pain subsample of the PROMIS 1 Wave 2 Pain and Depression study (PROMIS 1 W2):16, 28 (n=218) was developed to test the utility of computer adaptive tests (CATs) with individuals diagnosed with back pain with or without sciatica of at least 6 weeks duration who were scheduled for epidural steroid injections. The median age group was 55-59 years and included 56% females. The sample consisted of non-Hispanic: Whites (84%), Blacks (3%), Asian (1%), Native Hawaiian or Pacific Islander (0.5%), multi-racial (6%); Hispanic: Whites (1%), multi-racial (4%); or unknown (1%).

Measures

PROMIS scales – included in CERC, ACT and PROMIS 1 W2 datasets. The CERC and ACT include the PROMIS-29 whereas the PROMIS 1 W2 sample assesses PROMIS scores via CAT.

Physical Function:

Capability of physical activities including functioning of upper and lower extremities, and central regions (neck, back), and instrumental activities of daily living, such as running errands.29 Scores are reported on a T-score metric based on a general population mean of 50 (SD=10). Higher scores indicate better functioning.

Pain Interference:

Consequences of pain on relevant aspects of one’s life. This includes the extent to which pain hinders engagement with social, cognitive, emotional, physical, and recreational activities.30 Scores are reported on a T-score metric based on a U.S. general population mean of 50 (SD=10). Higher scores indicate more pain interference.

Pain Intensity:

A single item measure reflecting how much a person hurts, on average over the past 7 days on a scale from 0-10. Higher scores indicate higher pain intensity. Although the 0–10 item has been collapsed to five categories (0 = 1; 1–3 = 2; 4–6 = 3; 7–9 = 4; 10 = 5) in some prior PROMIS studies,31, 32 we preserved the 11 categories of information. For the PROMIS 1 Wave 2 dataset, average pain was asked for both back and leg pain (i.e., In the past 7 days, when you had BACK pain, how would you rate your average pain; and In the past 7 days, when you had LEG pain, how would you rate your average pain). All respondents had either back pain only or both back and leg pain and answered both questions. For the purposes of these analyses, the maximum score of the two was selected.

ODI – included in CERC and PROMIS 1 W2 datasets.

The ODI is a measure of functional disability consisting of 10 items assessing pain intensity, personal care, lifting, walking, sitting, standing, sleeping, sex life (if applicable), social life, and traveling. Response options range from 0 to 5 with higher scores indicating greater disability.

The scale is scored by summing scores across all items, dividing the total score by the maximum possible (e.g., 50 if all 10 items are answered) and then multiplying by 100. The ODI score can be classified as: <20 = minimal disability, 21-40 = moderate, 41-60 = severe, 61-80 = crippled, and 81-100 = bedbound or exaggerating symptoms.11

RMDQ – included in ACT and PROMIS 1 W2 datasets.

The RMDQ is a 24-item measure assessing physical disability due to lower back pain. The items were chosen from the Sickness Impact Profile (SIP)33 and reflect physical function content directly relevant to lower back pain. Items are dichotomous and yield a total summary score ranging from 0 (no disability) to 24 (maximum disability).

Analyses

The analyses characterize relationships between the three PROMIS scores (physical function, pain interference, pain intensity) and the ODI and RMDQ scores. Following standard recommendations,34, 35 a minimum bivariate correlation between scales of 0.80 is required for latent variable equating approaches such as IRT co-calibration. For correlations lower than 0.80, ordinary least squares (OLS) regression models with linear equating36, 37 are recommended. OLS models were evaluated in terms of R2, and the normalized mean absolute error (NMAE). The NMAE provides information regarding the average deviation between the observed and predicted scores divided by the standard deviation of the observed score. Lower values of the NMAE indicate better performance.

We first evaluated the ODI and RMDQ in predicting the three PROMIS scores (physical function, pain interference, pain intensity). Next, we evaluated the three PROMIS scores in predicting the ODI and RMDQ. We estimated all models within each sample for all respondents with non-missing scores. We also report results for the combined samples to provide a reference for more heterogeneous patient groups with a wide range of disability. Further, we report the full model equations so that they can be applied by researchers to predict PROMIS scores from ODI and RMDQ and to predict ODI and RMDQ from PROMIS scores.

Results

Mean scores and correlations among the PROMs are shown in Tables 1 and 2, respectively, for the three samples individually and combined. Respondents in all three samples report lower than average physical functioning and higher than average pain interference (relative to the U.S. population mean of 50). Notably, the PROMIS 1 W2 sample displays lower physical functioning and higher pain interference than the other two samples. This trend of more disability among the PROMIS 1 W2 sample is also evident in the pain intensity scores, ODI disability scores, and RMDQ scores. When ACT and CERC samples are combined with PROMIS 1 W2, scores reflect more disability than in CERC or ACT alone.

Table 1.

Mean and standard deviation of PROMIS Physical Function, Pain Interference, and Pain Intensity scores, Oswestry Disability Index (ODI) scores, and Roland-Morris Disability Questionnaire (RMDQ) scores for each study sample and for combined datasets

PF PI Avg Pain ODI RMDQ
PROMIS I W2 (n=218) 37.5 (5.7) 64.2 (6.6) 6.5 (1.9) 37.4 (15.7) 13.0 (5.6)*
CERC (n=1677) 45.4 (7.2) 55.1 (7.3) 3.9 (2.0) 20.5 (12.8) NA
ACT (n=384) 44.1 (6.7) 58.9 (6.3) 5.5 (1.9) NA 9.0 (5.4)
PROMIS + CERC (n=1,895) 44.4 (7.5) 56.3 (7.8) 4.2 (2.2) 22.4 (14.2) NA
PROMIS + ACT (n=563) 42.0 (7.1) 60.5 (6.8) 5.2 (2.1) NA 10.3 (5.7)

NOTE: CERC= Center for Excellence in Research for Complementary and Integrative Health; ACT= Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel; PROMIS I W2 = PROMIS 1 Wave 2 Pain and Depression study. NA=Not Administered; PF = PROMIS Physical Function T score; PI = PROMIS Pain Interference T score; Avg Pain = PROMIS Pain Intensity score.

*

Only n=179 of the PROMIS 1 W2 sample completed the RMDQ.

Table 2.

Product-moment correlations and sample sizes among PROMIS Physical Function, Pain Interference, and Average Pain Intensity scores, Oswestry Disability Index (ODI) scores, and Roland-Morris Disability Questionnaire (RMDQ) Scores in each of three samples individually and combined.

PF PI Avg Pain RMDQ PROMIS I W2
ODI PROMIS I W2 −.69
(n=212)
.73
(n=212)
.51
(n=212)
.68
(n=175)
ODI CERC −.71
(n=1527)
.71
(n=1527)
.59
(n=1674)
NA
ODI CERC+PROMIS 1 W2 −.74
(n=1739)
.75
(n=1739)
.64
(n=1886)
NA
RMDQ PROMIS I W2 −.61
(n=179)
.61
(n=179)
.37
(n=179)
NA
RMDQ ACT −.69
(n=384)
.67
(n=384)
.36
(n=384)
NA
RMDQ ACT +PROMIS 1 W2 −.71
(n=563)
.69
(n=563)
.45
(n=563)
NA

NOTE: All correlations p<.001. CERC= Center for Excellence in Research for Complementary and Integrative Health; ACT= Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel; PROMIS I W2 = PROMIS 1 Wave 2 Pain and Depression study; NA=Not Applicable; PF = PROMIS Physical Function T score; PI = PROMIS Pain Interference T score; Avg Pain = PROMIS Pain Intensity score. PF, PI, and Avg Pain scores are from the samples noted in the corresponding first column (e.g. the PF score that is correlated with the ODI from PROMIS I W2 comes from the PROMIS I W2 sample).

The product-moment correlations among the PROMs (Table 2) show that the ODI and RMDQ are more strongly associated with PROMIS physical function and pain interference than with pain intensity, and this is especially true for the RMDQ. The association between the ODI and RMDQ is close in magnitude to that between the ODI and PROMIS physical function. Also noteworthy is the consistent increase in inter-relationships for combined samples relative to individual samples. This is due to the increase in the range of values from combining datasets. Because none of the correlations meet the 0.80 criterion for latent variable equating, we use OLS with linear equating to link the scales.

Predicting PROMIS scores from the ODI and RMDQ total scores

ODI Total score models

We estimated models predicting the three PROMIS scores from the ODI total score using data from PROMIS 1 W2 and CERC samples (see Table 3). In these models, R2 values ranged from 0.26-0.56 with NMAE ranging from 0.57 to 0.75 standard deviations. Models based on combined data performed better than individual sample models, and models predicting PROMIS physical function and pain interference scales were better than those predicting pain intensity. Specifically, for physical function, overall R2 ranged from 0.47 – 0.55 and NMAE ranged from 0.58 to 0.63. Similar patterns were observed for pain interference. The ODI score did not perform well in predicting PROMIS pain intensity, where R2 values ranged from 0.26-0.41 and NMAE was relatively high (0.66 – 0.75).

Table 3.

Model equation, R2 and NMAE for regressions predicting PROMIS Physical Function, Pain Interference, and Average Pain Intensity scores from Oswestry Disability Index for the PROMIS I W2 and CERC samples, individually and combined.

PROMIS Score Sample Equation R2 NMAE
Physical Function PROMIS I W2 (n=212) 46.762 – 0.249 x ODI total score 0.47 0.63
CERC (n=1527) 53.622 – 0.401 x ODI total score 0.51 0.60
CERC+PROMIS 1 W2 (n=1739) 53.216 – 0.389 x ODI total score 0.55 0.58
Pain Interference PROMIS I W2 (n=212) 52.957 + 0.301 x ODI total score 0.55 0.57
CERC (n=1527) 46.865 + 0.403 x ODI total score 0.50 0.62
CERC+PROMIS 1 W2 (n=1739) 47.019 + 0.408 x ODI total score 0.56 0.57
Pain Intensity PROMIS I W2 (n=212)  4.133 + 0.062 x ODI total score 0.26 0.75
CERC (n=1674) 1.932 + 0.095 x ODI total score 0.35 0.70
CERC+PROMIS 1 W2 (n=1886) 1.959 + 0.099 x ODI total score 0.41 0.66

NMAE = Normalized Mean Absolute Error; CERC= CERC= Center for Excellence in Research for Complementary and Integrative Health; PROMIS I W2 = PROMIS 1 Wave 2 Pain and Depression study. R2 and NMAE are adjusted using linear equating to address regression to the mean.

Table 3 provides regression results for each model for use in translating scores from one scale to another, although the models predicting PROMIS pain intensity from ODI are limited by the low level of explained variance.

RMDQ Total score models

Results from models predicting the three PROMIS scores from the RMDQ total score using data from PROMIS 1 W2 and ACT samples individually and combined are shown in Table 4. Across all models, R2 values ranged from 0.13-0.50 with NMAE ranging from 0.57 to 0.89 standard deviations. Although the pattern of results is like that shown for the ODI models, the PROMIS scores were less accurately predicted by the RMDQ than the ODI. However, like the ODI results, the RMDQ was a stronger predictor of PROMIS physical function and pain interference scores than pain intensity. Specifically, for physical function the overall R2 ranged from 0.37 to 0.50 and for pain interference, R2 ranged from 0.37-0.47. Moreover, NMAE ranged from 0.57 to 0.69 and 0.62 to 0.69 standard deviations for physical function and pain interference, respectively. In contrast, pain intensity models yielded R2 ranging from 0.13-0.20. and NMAE ranged from 0.83 to 0.87. In contrast to ODI results, models involving PROMIS 1 W2 data did not perform better than those using ACT data. However, the combined sample did perform better than using PROMIS 1 W2 or ACT alone. Table 4 provides regression results for each model for use in translating scores from one scale to another, although the models predicting PROMIS pain intensity from RMDQ may not provide enough precision to be useful.

Table 4.

Model equation, R2 and NMAE for regressions predicting PROMIS Physical Function, Pain Interference, and Average Pain Intensity scores from Roland-Morris Disability Questionnaire in PROMIS 1 W2 and ACT samples individually and combined.

PROMIS Score Sample Equation R2 NMAE
Physical Function PROMIS I W2 (n=179) 45.928 – 0.640 x RMDQ total score 0.37 0.69
ACT (n=384) 51.928 – 0.869 x RMDQ total score 0.48 0.57
ACT+PROMIS 1 W2 (n=563) 51.063 – 0.879 x RMDQ total score 0.50 0.62
Pain Interference PROMIS I W2 (n=179) 54.762 + 0.707 x RMDQ total score 0.37 0.69
ACT (n=384) 51.814 + 0.784 x RMDQ total score 0.45 0.64
ACT+PROMIS 1 W2 (n=563) 52.154 + 0.811 x RMDQ total score 0.47 0.62
Average Pain PROMIS I W2 (n=179) 4.746 + 0.129 x RMDQ total score 0.13 0.87
Intensity ACT (n=384) 3.449 + 0.126 x RMDQ total score 0.13 0.89
ACT+PROMIS 1 W2 (n=563) 3.510 + 0.161 x RMDQ total score 0.20 0.83

NOTE: NMAE = Normalized Mean Absolute Error; PROMIS I W2 = PROMIS 1 Wave 2 Pain and Depression study; ACT= Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel. R2 and NMAE are adjusted using linear equating to address regression to the mean.

Predicting ODI from PROMIS scales

The three PROMIS scores do reasonably well in predicting the ODI; Table 5 shows results of these models for CERC and PROMIS 1 W2 samples individually and combined. There is little variation in the precision of the prediction of ODI score according to sample, but the model using data from the combined sample yields the highest R2 (0.68). Across all models, R2 values ranged from 0.64 to 0.68 with NMAE ranging from 0.47 to 0.50. Beta coefficients for all three PROMIS scores were significant at p<.01.

Table 5.

Model equation, R2, NMAE for regressions predicting Oswestry Disability Index from PROMIS Physical Function, Pain Interference and Pain Intensity for PROMIS I W2 and CERC samples individually and combined.

Sample Equation R2 NMAE
PROMIS I W2 (n=212) −7.114 – 0.961 x PF + 1.134 x PI + 1.193 x Avg Pain 0.66 0.49
CERC (n=1524) 20.910 – 0.765 x PF + 0.541 x PI + 1.165 x Avg Pain 0.64 0.50
CERC + PROMIS I W2 (n=1736) 17.572 – 0.789 x PF + 0.623 x PI + 1.192 x Avg Pain 0.68 0.47

NOTE: NMAE = Normalized Mean Absolute Error; PROMIS I W2 = PROMIS 1 Wave 2 Pain and Depression study; CERC= Center for Excellence in Research for Complementary and Integrative Health. PF = PROMIS Physical Function T score; PI = PROMIS Pain Interference T score; Avg Pain = PROMIS Pain Intensity score. R2 and NMAE are adjusted using linear equating to address regression to the mean.

Predicting RMDQ from PROMIS scales

Table 6 shows results of models predicting RMDQ from the three PROMIS scores. Beta coefficients for PROMIS physical function and pain interference scores across all models were significant at p<.05. However, the PROMIS pain intensity item was not a significant predictor in these models. Although, R2 values for these RMDQ models (range = 0.46 – 0.58) are consistently lower relative to models predicting ODI (range = 0.64 – 0.68), they provide enough precision to be useful. As with ODI models, there is little variation in precision of the prediction of RMDQ score according to sample, but the pattern of results differs from that seen with the ODI: The results for the PROMIS 1 W2 sample yield the lowest precision, and the model using data from the combined samples does not improve the R2 over the model that just includes the ACT sample.

Table 6.

Model equation, R2, and NMAE for regressions predicting Roland-Morris Disability Questionnaire from PROMIS Physical Function, Pain Interference and Average Pain Intensity for PROMIS I W2 and ACT samples individually and combined.

Sample Equation R2 NMAE
PROMIS I W2 (n=179) 6.433 – 0.357 x PF + 0.289 x PI + 0.234 x Avg Pain 0.46 0.63
ACT (n=384) 5.451 – 0.356 x PF + 0.316 x PI + 0.140 x Avg Pain 0.58 0.56
ACT + PROMIS I W2 (n=563) 5.689 – 0.352 x PF + 0.307 x PI + 0.157 x Avg Pain 0.58 0.55

NOTE: NMAE = Normalized Mean Absolute Error; PROMIS I W2 = PROMIS 1 Wave 2 Pain and Depression study; ACT= Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel. PF = PROMIS Physical Function T score; PI = PROMIS Pain Interference T score; Avg Pain = PROMIS Pain Intensity score. R2 and NMAE are adjusted using linear equating to address regression to the mean.

Discussion

These analyses provide preliminary equations for translation of PROMIS physical function, pain interference, and pain intensity scores to and from two condition-specific instruments, the ODI and the RMDQ, using data from three large samples of patients with CLBP. Because relationships of PROMIS scores with the ODI and RMDQ were not strong enough to support latent variable equating, we used OLS regression with linear equating. The level of disability in the CERC and ACT datasets was relatively low compared to the PROMIS 1 W2 sample, so the combined results provide an increased range of disability. We focused on the three PROMIS scores that are most relevant to CLBP disability and recovery and are part of the ISS, but it is notable that the relationship of physical function and pain interference with ODI and RMDQ are stronger than that with pain intensity. This is especially true for the RMDQ.

Models reported here yielded useful coefficients to estimate PROMIS scores, particularly physical function and pain interference, from the ODI or RMDQ. Even so, the ODI and RMDQ total scores never accounted for more than 56% of the variance in PROMIS scores. This may be due in part to predicting a single PROMIS score (e.g., physical function) from multidimensional composites. We conducted exploratory analyses using individual items within the ODI and RMDQ as predictors of the PROMIS scores to determine whether use of items might improve the prediction. For ODI, we found that item level analyses increased the variance explained in the prediction models by 7% to 12%. In contrast, use of RMDQ items instead of total score increased the variance explained in the prediction models by between 1% and 2%. This discrepancy in impact is likely due to greater heterogeneity of ODI compared to RMDQ items.

In predicting PROMIS scores, ODI models are stronger than RMDQ models across all three PROMIS scores, and notably the addition of the more disabled PROMIS 1 W2 sample had more of an impact on precision of prediction equations for ODI than for RMDQ. In fact, the extended range of disability did not consistently improve predictions for RMDQ.

Models predicting ODI and RMDQ measure scores from PROMIS performed better than models predicting PROMIS from legacy measures, and this was particularly true for models predicting ODI. This better performance may be due to predicting a single score from three predictors as opposed to one.

Limitations

These analyses are based on relationships found in these datasets; the stronger relationships of PROMIS scores with ODI relative to RMDQ may be due to basic differences between the CERC and ACT samples. However, these differences are maintained in the PROMIS 1 W2 sample—thus, the superiority of ODI models are not explained by possible anomalies in the CERC and ACT samples. Another limitation is that correlations among observed scores precluded use of IRT equating. However, our best performing prediction equations achieved R2 values of approximately 0.65 and higher, which correspond closely with accuracy reported by Schalet et al38 in an IRT-based linking of the PROMIS Global Health physical summary scores with those from the VR-12. The mental health summary score linkages reported by Schalet et al were less accurate, corresponding closely with R2 values of approximately 0.45, a level of accuracy that was reached or exceeded in many of the prediction models reported in this paper. The samples represent a limited range of disability, and results should be replicated in samples with a broader range of disability. Although it is unclear whether these prediction equations will be applicable to higher levels of disability than is represented here, a comparison of actual and predicted mean scores for quantiles of each dependent variable found prediction accuracy to be fairly stable across the range of scores in our data. Finally, although the addition of the PROMIS 1 W2 sample provides obvious benefits, the PROMIS scores from that sample are based on CAT administration of the PROMIS domains as opposed to the 4-item short forms in the PROMIS-29.

Conclusion

Researchers can use these results to translate PROMIS physical function, pain interference, and pain intensity scores to and from the ODI and RMDQ – the two condition-specific instruments most frequently used to evaluate interventions for CLBP. The empirical linkages can be used to facilitate comparisons across CLBP interventions and broaden interpretation of study results. Further studies are needed to evaluate the robustness of the prediction equations presented here.

Acknowledgement of Financial Support:

This work is funded by National Center for Complementary and Integrative Health Grant # 1R01AT010402-01A1

Abbreviations:

ACT

Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel

CAT

Computer Adaptive Test

CERC

Center for Excellence in Research for Complementary and Integrative Health

CLBP

Chronic Low Back Pain

CNP

Chronic Neck Pain

NIH

National Institutes of Health

NMAE

Normalized Mean Absolute Error

ODI

Owestry Disability Index

OLS

Ordinary Least Squares

PROM

Patient-Reported Outcome Measures

PROMIS

Patient-Reported Outcomes Measurement Information System

PROMIS-29

29-Item PROMIS Profile Measure

RMDQ

Roland-Morris Disability Questionnaire

SIP

Sickness Impact Profile

Footnotes

This material is being presented at the 6th PROMIS International Conference 2020, October 25-27, 2020

The authors have no conflicts of interest to disclose.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Chou R, Atlas SJ, Stanos SP, Rosenquist RW. Nonsurgical interventional therapies for low back pain: a review of the evidence for an American Pain Society clinical practice guideline. Spine (Phila Pa 1976) 2009;34(10):1078–93. [DOI] [PubMed] [Google Scholar]
  • 2.Cramer H, Lauche R, Haller H, Dobos G. A systematic review and meta-analysis of yoga for low back pain. Clin J Pain 2013;29(5):450–60. [DOI] [PubMed] [Google Scholar]
  • 3.Farabaugh RJ, Dehen MD, Hawk C. Management of chronic spine-related conditions: consensus recommendations of a multidisciplinary panel. J Manipulative Physiol Ther 2010;33(7):484–92. [DOI] [PubMed] [Google Scholar]
  • 4.Furlan AD, Giraldo M, Baskwill A, Irvin E, Imamura M. Massage for low-back pain. Cochrane Database Syst Rev 2015(9):Cd001929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Henschke N, Ostelo RW, van Tulder MW, Vlaeyen JW, Morley S, Assendelft WJ et al. Behavioural treatment for chronic low-back pain. Cochrane Database Syst Rev 2010;2010(7):Cd002014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ivanova JI, Birnbaum HG, Schiller M, Kantor E, Johnstone BM, Swindle RW. Real-world practice patterns, health-care utilization, and costs in patients with low back pain: the long road to guideline-concordant care. Spine (Phila Pa 1976) 2011;11(7):622–32. [DOI] [PubMed] [Google Scholar]
  • 7.Kamper SJ, Apeldoorn AT, Chiarotto A, Smeets RJ, Ostelo RW, Guzman J et al. Multidisciplinary biopsychosocial rehabilitation for chronic low back pain. Cochrane Database Syst Rev 2014(9):Cd000963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Skelly AC, Chou R, Dettori JR, Turner JA, Friedly JL, Rundell SD et al. Noninvasive Nonpharmacological Treatment for Chronic Pain: A Systematic Review. Rockville, MD: AHRQ; 2018. June Report No.: 18-EHC013-EF. [PubMed] [Google Scholar]
  • 9.Skolasky RL, Wegener ST, Aaron RV, Ephraim P, Brennan G, Greene T et al. The OPTIMIZE study: protocol of a pragmatic sequential multiple assessment randomized trial of nonpharmacologic treatment for chronic, nonspecific low back pain. BMC musculoskeletal disorders 2020;21(1):293-. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Vickers AJ, Cronin AM, Maschino AC, Lewith G, MacPherson H, Foster NE et al. Acupuncture for chronic pain: individual patient data meta-analysis. Archives of internal medicine 2012;172(19):1444–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Fairbank JC, Pynsent PB. The Oswestry Disability Index. Spine (Phila Pa 1976) 2000;25(22):2940–52; discussion 52. [DOI] [PubMed] [Google Scholar]
  • 12.Roland M, Morris R. A study of the natural history of back pain. Part I: development of a reliable and sensitive measure of disability in low-back pain. Spine (Phila Pa 1976) 1983;8(2):141–4. [DOI] [PubMed] [Google Scholar]
  • 13.Burbridge C, Randall JA, Abraham L, Bush EN. Measuring the impact of chronic low back pain on everyday functioning: content validity of the Roland Morris disability questionnaire. J Patient Rep Outcomes 2020;4(1):70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Cella D, Riley W, Stone A, Rothrock N, Reeve B, Yount S et al. The Patient-Reported Outcomes Measurement Information System (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005-2008. J Clin Epidemiol 2010;63(11):1179–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Reeve BB, Hays RD, Bjorner JB, Cook KF, Crane PK, Teresi JA et al. Psychometric evaluation and calibration of health-related quality of life item banks: plans for the Patient-Reported Outcomes Measurement Information System (PROMIS). Med Care 2007;45(5 Suppl 1):S22–31. [DOI] [PubMed] [Google Scholar]
  • 16.Cook KF, Jensen SE, Schalet BD, Beaumont JL, Amtmann D, Czajkowski S et al. PROMIS measures of pain, fatigue, negative affect, physical function, and social function demonstrated clinical validity across a range of chronic conditions. J Clin Epidemiol 2016;73:89–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cella D, Choi SW, Condon DM, Schalet B, Hays RD, Rothrock NE et al. PROMIS(®) Adult Health Profiles: Efficient Short-Form Measures of Seven Health Domains. Value Health 2019;22(5):537–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Chiarotto A, Roorda LD, Crins MH, Boers M, Ostelo RW, Terwee CB. PROMIS Physical Function Short Forms Display Item- and Scale-Level Characteristics at Least as Good as the Roland Morris Disability Questionnaire in Patients With Chronic Low Back Pain. Arch Phys Med Rehabil 2020;101(2):297–308. [DOI] [PubMed] [Google Scholar]
  • 19.Deyo RA, Dworkin SF, Amtmann D, Andersson G, Borenstein D, Carragee E et al. Report of the NIH Task Force on research standards for chronic low back pain. J Pain 2014;15(6):569–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Herman P, Edelen M, Rodriguez A, Hilton L, Hays R. A protocol for chronic pain outcome measurement enhancement by linking PROMIS-29 scale to legacy measures and improving chronic pain stratification. BMC Musculoskeletal Disorders In Press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Stewart LA, Tierney JF. To IPD or not to IPD? Advantages and disadvantages of systematic reviews using individual patient data. Eval Health Prof 2002;25(1):76–97. [DOI] [PubMed] [Google Scholar]
  • 22.Stewart LA, Tierney JF, Clarke M. Reviews of individual patient data. In: Higgins JP, Gren S, editors. Cochrane Handbook for Systematic Reviews of Interventions. 5.1.0 ed.: The Cochrane Collaboration; 2011. [Google Scholar]
  • 23.Herman PM, Kommareddi M, Sorbero ME, Rutter CM, Hays RD, Hilton LG et al. Characteristics of Chiropractic Patients Being Treated for Chronic Low Back and Neck Pain. J Manipulative Physiol Ther 2018;41(6):445–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.U.S. National Library of Medicine. RAND Center of Excellence for the Study of Appropriateness of Care in CAM (CERC) 2017. April 16, 2019 [cited 2020 September 25]. Available from: URL: https://clinicaltrials.gov/ct2/show/NCT03162952.
  • 25.Goertz CM, Long CR, Vining RD, Pohlman KA, Walter J, Coulter I. Effect of Usual Medical Care Plus Chiropractic Care vs Usual Medical Care Alone on Pain and Disability Among US Service Members With Low Back Pain: A Comparative Effectiveness Clinical Trial. JAMA Netw Open 2018;1(1):e180105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Goertz CM, Long CR, Vining RD, Pohlman KA, Kane B, Corber L et al. Assessment of chiropractic treatment for active duty, U.S. military personnel with low back pain: study protocol for a randomized controlled trial. Trials 2016;17:70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.U.S. National Library of Medicine. Assessment of Chiropractic Treatment for Low Back Pain and Smoking Cessation in Military Active Duty Personnel 2012. December 19, 2018 [cited 2020 September 25]. Available from: URL: https://clinicaltrials.gov/ct2/show/NCT01692275.
  • 28.Dagmar A PROMIS 1 Wave 2 Pain. V1 ed.: Harvard Dataverse; 2016. [Google Scholar]
  • 29.Bruce B, Fries JF, Ambrosini D, Lingala B, Gandek B, Rose M et al. Better assessment of physical function: item improvement is neglected but essential. Arthritis Res Ther 2009;11(6):R191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Amtmann D, Cook KF, Jensen MP, Chen W-H, Choi S, Revicki D et al. Development of a PROMIS item bank to measure pain interference. Pain 2010;150(l):173–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Hays RD, Bjorner JB, Revicki DA, Spritzer KL, Cella D. Development of physical and mental health summary scores from the patient-reported outcomes measurement information system (PROMIS) global items. Quality of life research : an international journal of quality of life aspects of treatment, care and rehabilitation 2009;18(7):873–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Norquist JM, Watson DJ, Yu Q, Paolini JF, McQuarrie K, Santanello NC. Validation of a questionnaire to assess niacin-induced cutaneous flushing. Curr Med Res Opin 2007;23(7):1549–60. [DOI] [PubMed] [Google Scholar]
  • 33.Bergner M, Bobbitt RA, Carter WB, Gilson BS. The Sickness Impact Profile: development and final revision of a health status measure. Med Care 1981;19(8):787–805. [DOI] [PubMed] [Google Scholar]
  • 34.Dorans NJ. Equating, Concordance, and Expectation. Applied Psychological Measurement 2004;28(4):227–46. [Google Scholar]
  • 35.Choi SW, Schalet B, Cook KF, Cella D. Establishing a common metric for depressive symptoms: linking the BDI-II, CES-D, and PHQ-9 to PROMIS depression. Psychol Assess 2014;26(2):513–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Thompson NR, Lapin BR, Katzan IL. Mapping PROMIS Global Health Items to EuroQol (EQ-5D) Utility Scores Using Linear and Equipercentile Equating. Pharmacoeconomics 2017;35(11): 1167–76. [DOI] [PubMed] [Google Scholar]
  • 37.Revicki DA, Kawata AK, Harnam N, Chen WH, Hays RD, Cella D. Predicting EuroQol (EQ-5D) scores from the patient-reported outcomes measurement information system (PROMIS) global items and domain item banks in a United States sample. Qual Life Res 2009;18(6):783–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Schalet BD, Rothrock NE, Hays RD et al. Linking Physical and Mental Health Summary Scores from the Veterans RAND 12-Item Health Survey (VR-12) to the PROMIS® Global Health Scale. J Gen Intern Med 2015; 30: 1524–1530. 10.1007/s11606-015-3453-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES