Abstract
Background
The Modified Low Back Pain Disability Questionnaire (MLBPDQ) is used for evaluating the functional disability in patients with low back pain (LBP). However, the measurement characteristics of the MLBPDQ among Arab patients are unknown. In this study, we aimed to translate and cross-culturally adapt the MLBPDQ into Arabic and evaluate its psychometric properties.
Methods
An Arabic version of the MLBPDQ was developed through forward translation, translation synthesis, and backward translation. Sixty-eight patients (55 males and 13 females) with a mean age 37.01 ± 7.57 years were recruited to assess its psychometric properties. Reliability was evaluated using internal consistency (Cronbach’s α), test retest reliability (utilizing intraclass correlation coefficient [ICC]), standard error of measurement (SEM), minimal detectable change at 95% confidence level (MDC95%), and 95% limits of agreement (LOA). The construct validity was investigated by correlating the new translation with four other measures of LBP (using Spearman’s rho). Finally, receiver operating characteristic curve was constructed to compute the sensitivity, using the area under the curve (AUC), and the minimum important change (MIC). An alpha level of 0.05 was set for statistical tests and all the psychometric values were tested against a priori hypotheses.
Results
The culturally aligned MLBPDQ showed good internal consistency (Cronbach’s α = 0.85). The ICC, SEM, MDC95%, and LOA between baseline and two days later were 0.98, 1.60, 4.43, and -4.23 to 7.70, respectively, while the values between baseline and 14 days later were 0.94, 2.77, 7.67, and -6.59 to 13.53, respectively. The scale also demonstrated moderate to excellent correlation (rho = 0.54–0.86) with the other four questionnaires. The AUC value of the Arabic-MLBPDQ was 0.68, and the MIC was 3 points.
Conclusion
The Arabic version of the MLBPDQ demonstrates adequate psychometric properties and can be used to assess disability level in patients with LBP in Arabic-speaking communities.
Introduction
Among other disabling conditions, low back pain (LBP) has been identified as the highest contributor to disability globally [1]. Though recovery from LBP and return-to-work may occur in a month for as many as 75% [2], return to normalcy in those whose pain persists for 6–10 weeks after onset may take up to a year [3]. The lost productive time and missed workdays due to LBP may cost employers $19.8 billion per year [4]. Therefore, measures of disability associated with LBP are essential. Such measures would help evaluate and monitor patients in diagnostic and treatment stages, which might assist in reducing painful, costly episodes of LBP.
Self-report measures of LBP associated disability are widely used by clinicians because they are easy to administer and inexpensive [5]. One of the most commonly used self-report disability questionnaires for people with LBP is the Oswestry LBP Disability Questionnaire (ODQ), which was described by Fairbank and colleagues [6]. Following the original English version of the ODQ, many variants of it have been produced in English [7–12], and there are about 31 adaptations in other languages [13], including Arabic [14–16].
While the ODQ has been adapted into many cultures, approximately 60% (19 out of 31) of the validation studies have reported missing responses, ranging from 11% to 90%, to the question about how LBP interfered with patients’ sex life [13,15–32]. Speculations about why respondents were unable or unwilling to answer this question include that they could be too young [32] or never had sex [15,31,32]. Others might be too embarrassed or find the question unacceptable due to cultural or religious influence [13,15,16,19,23,31,32]. Although a previous study showed that Arab people are willing to discuss their sexual life and how it is affected by LBP [33], it has been reported that up to 69% of respondents did not answer the sex-life questions of the Arabic ODQ [15,16]. An attempt was made to resolve this problem by removing the question and then validating the questionnaire [15].
Fritz and Irrgang [12] published a modified version of the ODQ (MLBPDQ). They replaced the sex-life question with a question about the effect that LBP has on employment and/or homemaking. The MLBPDQ was found to be a valid, reliable, and sensitive tool [12,34,35]. It was also adapted into three different languages [36–38]. To our knowledge, a validated Arabic version of the MLBPDQ has never been published. Therefore, our aims were to (1) translate and cross-culturally adapt the MLBPDQ into Arabic and establish (2) the reliability, (3) validity, and (4) sensitivity of the Arabic version.
Materials and methods
This study was in two stages. Stage one, translation and cross-cultural adaptation of the MLBPDQ into Arabic, and Stage two, testing the psychometric properties of the Arabic-MLBPDQ. The Research Ethics Committee of the University of Tabuk, Tabuk, Saudi Arabia (SA) approved the study, and all respondents gave their signed written informed consent.
The Modified Disability Questionnaire
Like the original version, the MLBPDQ includes 10 items covering LBP and assessments of the effects of pain on function. In addition to pain assessment, the functions covered are personal care, lifting, walking, sitting, standing, sleeping, social life, traveling, and employment/homemaking (replacing the sex life item). Each item consists of six statements that range from 0 (no disability) to 5 (maximal disability). The patient chooses the statement that most closely represents his/her status. To obtain a disability score, the sum of the scores is divided by the total possible score (i.e., 50). To obtain the percentage of a patient’s disability, the resulting total is multiplied by 100: 0% (no disability) and 100% (the most severe disability).
Translation and cross-cultural adaptation
The MLBPDQ was translated through a process of forward translation, translation synthesis, and backward translation [39] (Fig 1, removed at the time of retraction). First, two translators proficient in English who were native Arabic speakers translated the English version of the MLBPDQ into Arabic. The first translator was a physician who was aware of the MLBPDQ concept. The second translator, a computer engineer, had no medical background and was unaware of the concept. Second, the translators synthesized the two versions into one. Third, two other translators whose native language was English and who were proficient in Arabic translated the Arabic version of the MLBPDQ back into English. Neither translator had medical background, nor access to the original version of the questionnaire.
Fig 1. Processes for cross-cultural adaptation and validation of the Modified Low Back Pain Disability Questionnaire.

After that, a four-member committee of experts produced a prefinal Arabic version of the MLBPDQ for field-testing. The committee consisted of two healthcare professionals, a linguistic professional, and the principal investigator (HSA). One of the healthcare professionals was proficient in methodology, and the principal investigator relayed questions or queries raised in committee meetings to the forward and back translators. The committee reviewed and analyzed any discrepancy or inconsistency in previous stages of the translation process. They also judged the document and made any changes necessary to ensure clarity and suitability for the general Arab public. The reviewers made four main suggestions. The first suggestion was to convert the distance unit from miles to kilometers in Section 4 (walking). The second suggestion was to restructure the last selection in Section 4 to “I am in bed most of the time and cannot go to the toilet without help of others”. The third suggestion was to add “to practice social activity” to selection 4 in Section 8 (Social Life). The fourth suggestion was to add the word “commuting” to the title of Section 9. The rest of the modifications suggested by the review committee are presented in S1 Table (removed at the time of retraction).
The prefinal version was completed by 30 patients to evaluate the questionnaire’s comprehensibility and provide final input on its language. Overall, no major difficulties were faced by respondents, and they could read and understand all the 10 sections. Finally, the Arabic-MLBPDQ was produced and ready for psychometric testing (see S1 Appendix, removed at the time of retraction).
Psychometric testing
Using convenience sampling, patients from local hospitals in Tabuk, SA who met the inclusion/exclusion criteria were recruited. The inclusion criteria were patients presenting with acute or chronic LBP, aged 18–65 years, and fluent in Arabic. Excluded were patients who were pregnant and those with a history of psychiatric disorders, malignancies, or neurological pathologies. Terwee and colleagues [40] believed 50 participants could be used to adequately measure the floor and ceiling effects, reliability, agreement, minimum important change (MIC), and construct validity of a questionnaire; therefore, considering losses to withdrawal, follow-up, or protocol violation, we set to recruit ≥ 60 patients.
Because most of the change in patients’ condition was observed immediately following the injury [5], it is vital to perform assessments during the first two weeks of enrollment. Therefore, follow-up assessments occurred two and 14 days after baseline. Fig 1 (removed at the time of retraction) illustrates the three sessions of assessment.
In Session 1, the baseline assessment, respondents completed a demographic survey that indicated whether they met the exclusion/inclusion criteria. Those who qualified completed the following questionnaires in Arabic: the MLBPDQ, the Fear Avoidance Beliefs Questionnaire (FABQ) [41], the Quebec Back Pain Disability Scale (Quebec) [42], the Roland-Morris Disability Questionnaire (RM) [43], and the Visual Analog Scale (VAS) [44]. Table 1 summarizes the psychometric properties of the questionnaires.
Table 1. Psychometric properties of Arabic questionnaires included in the study.
| Questionnaire | FABQ | Quebec | RM |
|---|---|---|---|
| Study | Alanazi et al. 2017 [41] | Alnahhal and May 2012 [42] | Maki et al. 2014 [43] |
| Internal consistency | - | α = 0.92 | α = 0.729 |
| Test-retest reliability | ICC = 0.76 | κ = 0.86–0.98 | ICC = 0.90 |
| Construct validity | r = 0.234–0.283a | r = 0.69a | r = 0.259c |
| r = -0.115–0.12b | |||
| r = 0.66d | |||
| r = 0.092–0.208c | |||
| Sensitivity | ES = 0.25 | - | - |
FABQ, Fear-Avoidance Beliefs Questionnaire; Quebec, Quebec Back Pain Disability Scale; RM, Roland-Morris Disability Questionnaire; α, Cronbach’s alpha; ICC, intraclass correlation coefficient; κ, weighted kappa; r, Spearman correlation coefficient; ES, effect size.
aCorrelated with the Modified Low Back Pain Disability Questionnaire.
bCorrelated with the Quebec Back Pain Disability Scale.
cCorrelated with the Visual Analog Scale.
dCorrelated with the Numeric Pain Rating Scale.
In Session 2, which occurred 48 hours later, respondents answered a 7-level global change scale to detect any big alterations in LBP characteristics or symptoms since baseline. The scale asked respondents to rate the extent that their LBP had changed over the past two days. The scale had seven response options: completely gone, much better, better, a little better, about the same, a little worse, and much worse. Respondents who answered “about the same” or “a little better” or “a little worse” were classified as stable [45] and completed the Arabic-MLBPDQ again.
In Session 3, held 14 days following the baseline assessment, respondents completed the Arabic-MLBPDQ for a third time and completed the four other scales in addition to the global change scale.
Data analyses
Data analyses included the assessment of the Arabic-MLBPDQ for floor and ceiling effects, reliability, construct validity, and sensitivity. All the obtained psychometric values were tested against a priori hypotheses. IBM SPSS Statistics for Windows version 25.0 (Armonk, NY) was utilized to perform the statistical tests with alpha level at 0.05.
Floor and ceiling effects
Floor and/or ceiling effects exist if more than 15% of respondents obtained the maximum or minimum possible score [40]. Floor and ceiling effects were defined by computing the number of respondents who scored the lowest status (90–100) or the highest status (0–10), respectively, on the Arabic-MLBPDQ [13].
Reliability
Internal consistency of the Arabic-MLBPDQ was evaluated by calculating Cronbach’s α at baseline. Test-retest reliability was determined by testing and then retesting and calculating the intraclass correlation coefficients (ICC) in a one-way random effects model with multiple measures. Cronbach’s α and ICC values were interpreted as follows: < 0.50, poor; 0.50–0.75, moderate; 0.75–0.90, good; and > 0.90, excellent [46,47]. Furthermore, measurement error was examined by calculating the standard error of measurement (SEM). The minimal true change in score for one person beyond measurement error was estimated by calculating the minimal detectable change at 95% confidence level (MDC95%) [40]. The following formulas were used to calculate the SEM and MDC95%, respectively: (where SD is the standard deviation) [48]; [46]. Finally, the 95% limits of agreement (LOA) between the scores of the Arabic-MLBPDQ on baseline and the following administrations were visually assessed by constructing a Bland-Altman plot [49]. The records of only those patients classified as stable in Sessions 2 and 3 were used to evaluate the reliability. Our hypotheses regarding the values of Cronbach’s α, ICC, SEM, MDC95%, and LOA for the Arabic-MLBPDQ are stated in Table 2.
Table 2. A Priori hypotheses for testing the psychometric properties of the Arabic-Modified Lower Back Pain Disability Questionnaire.
| Reliability | ||
|---|---|---|
| Internal consistency | Cronbach’s α = 0.70–0.95 [40] | |
| Test-retest | ICC > 0.70 [40] | |
| SEM | ≤ 2.15–6.5 [12,20–22,27,32,34,50] | |
| MDC 95% | ≤ 6–13.67 [12,13,20–22,27,32,34,36,50–52] | |
| 95% LOA | –12.7 to 13.7 [16,30,32] | |
| Construct validity | ||
| Instrument | Construct measured | Direction and magnitude of the relationships |
| FABQ | Fear-avoidance beliefs | Little or no correlation (r = 0.19) [13] |
| Quebec | Functional disability | Moderate to excellent positive relationship (r = 0.79–0.90) [14,15,29] |
| RM | Functional disability | Moderate to excellent positive relationship (r = 0.50–0.84) [14,23,25,27,29,30,32,36,50,53–59] |
| VAS | Pain intensity | Fair to excellent positive relationship (r = 0.33–0.84) [13,14,16,19–23,25,27,28,31,32,50,51,53–56,58–60] |
| Sensitivity to change | ||
| AUC | > 0.70 [40] | |
| MIC | 4–11 [12,34,52,61–64] | |
ICC, intraclass correlation coefficient; SEM, standard error of measurement; MDC95%, minimal detectable change at 95% confidence level; LOA, limits of agreement; FABQ, Fear-Avoidance Beliefs Questionnaire; r, Pearson correlation coefficient; Quebec, Quebec Back Pain Disability Scale; RM, Roland-Morris Disability Questionnaire; VAS, Visual Analog Scale; AUC, area under the curve; MIC, minimum important change.
Validity
Construct validity was evaluated by correlating the Arabic-MLBPDQ with the Arabic versions of the FABQ, the Quebec, the RM, and the VAS and calculating a Spearman rank correlation coefficient (Spearman’s rho). Spearman’s rho values were interpreted as follows: < 0.25, little or no relationship; 0.25–0.50, fair; 0.50–0.75, moderate; and ≥ 0.75, excellent [48]. Table 2 presents a priori hypotheses to test the construct validity of the Arabic-MLBPDQ. The hypotheses were formulated based on the findings of previous validation studies of the ODQ and MLBPDQ. According to Terwee et al. [40], 75% or more of the hypotheses need to be confirmed to support the construct validity of the instrument being assessed.
Sensitivity
Sensitivity to change, or responsiveness, of the Arabic-MLBPDQ was examined by constructing a receiver operating characteristic (ROC) curve from the change scores between the two-week follow-up and the baseline. The area under the curve (AUC) was used to quantify the ability of the Arabic-MLBPDQ to segregate patients who were improved from those who remained stable based on the 7-level global change scale. AUC values range from 0.5, indicating no diagnostic accuracy, to 1, indicating perfect diagnostic accuracy [65]. We hypothesized that an AUC value of 0.70 or more [40] would be obtained for the Arabic-MLBPDQ (Table 2). The MIC of the Arabic-MLBPDQ was then estimated using the ROC curve. The MIC was determined by locating the point on the curve nearest to the left-hand corner of the graph. This point is associated with the maximum sensitivity and specificity of the questionnaire and represents a cutoff value to separate patients who have experienced improvements in their condition from those who have not [66]. Our predefined hypothesis regarding the MIC value is stated in Table 2, which was formulated based on previous MIC values obtained for the ODQ and MLBPDQ among patients with nonspecific LBP utilizing the same approach described above.
Results
Sixty-eight men and women with LBP were enrolled to assess the psychometric properties of the translated questionnaire. The reliability was assessed in respondents who were classified as stable (61 respondents at two days and 53 respondents at 14 days), while the answers of all 68 respondents at baseline and 14-day later were used to calculate validity and sensitivity (Fig 1, removed at the time of retraction). Thus, all groups met the 50-participant requirement prescribed by Terwee and colleagues [40].
Respondents’ demographic characteristics are presented in Table 3. Categorical variables are provided in frequencies and percentages. Continuous variables are summarized by group using means and standard deviations. Table 4 illustrates the test values at baseline and retest values after two days’ and after two weeks’ follow-up for the Arabic-MLBPDQ and the four other questionnaires.
Table 3. Demographic characteristics of the respondents (N = 68).
| Characteristic | N | % | |
|---|---|---|---|
| Sex | Male | 55 | 80.9 |
| Female | 13 | 19.1 | |
| Marital status | Single | 10 | 14.7 |
| Married | 57 | 83.8 | |
| Divorced | 1 | 1.5 | |
| Education level | High school | 16 | 23.5 |
| Diploma | 12 | 17.6 | |
| University | 40 | 58.9 | |
| Employment status | Employed | 59 | 86.8 |
| Unemployed | 5 | 7.4 | |
| Student | 4 | 5.8 | |
| Duration of low back pain | 3 weeks to 3 months | 37 | 54.4 |
| >3 months | 31 | 45.6 | |
| Age mean ± SD (years) | 37.01 ± 7.57 | ||
| Weight mean ± SD (kg) | 74.7 ± 9.72 | ||
| Height mean ± SD (cm) | 169.58 ± 7.94 | ||
SD, standard deviation.
Table 4. Test values at baseline and retest values after 2 and 14 days for Arabic questionnaires.
| Questionnaire | Baseline (N = 68) | At 2 days (N = 61) | At 14 days (N = 68) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Min–Max | Mean | SD | Min–Max | Mean | SD | Min–Max | Mean | SD | |
| MODQ | 12–56 | 31.8 | 11.3 | 8–58 | 31.0 | 11.7 | 6–54 | 27.9 | 11.4 |
| FABQ | 13–41 | 25.5 | 7.2 | - | - | - | 9–34 | 22.7 | 6.7 |
| Quebec | 15–52 | 31.8 | 11.1 | - | - | - | 5–48 | 27.5 | 10.4 |
| RM | 3–14 | 7.3 | 3.1 | - | - | - | 1–13 | 6.2 | 2.6 |
| VAS | 2–8 | 4.8 | 1.6 | - | - | - | 2–6 | 3.5 | 1.5 |
All questionnaires were in Arabic.
Min, minimum; Max, maximum; SD, standard deviation; MLBPDQ, Modified Low Back Pain Disability Questionnaire; FABQ, Fear-Avoidance Beliefs Questionnaire; Quebec, Quebec Back Pain Disability Scale; RM, Roland-Morris Disability Questionnaire; VAS, Visual Analog Scale.
Floor and ceiling effects
No floor or ceiling effects were detected. Two respondents obtained the highest status scores of 8% and 6% at two days and 14 days, respectively. No respondents obtained the lowest status score.
Reliability
Table 5 summarizes the reliability properties of the Arabic-MLBPDQ. A Cronbach’s α of 0.85 was obtained indicating good internal consistency. Removal of item 5 or 6 slightly increased the Cronbach’s α to 0.86. Regarding test-retest reliability, with 61 respondents, the Arabic-MLBPDQ showed excellent reliability (ICC = 0.98; 95% CI: 0.97–0.99) between baseline (M = 32.8 ± 11.3) and two days later (M = 31.0 ± 11.7). Similarly, after 14 days, with 53 respondents, the questionnaire continued to show excellent reliability between the baseline (M = 33.3 ± 11.4) and 14 days later (M = 29.8 ± 11.7; ICC = 0.94; 95% CI: 0.91–0.97). The SEM and MDC95% were 1.60 and 4.43, respectively, for two days, and 2.77 and 7.67, respectively, for 14 days. Furthermore, the Bland-Altman plot for two and 14 days test-retest showed a good distribution of scores with no systematic bias. The mean difference and 95% LOA were calculated as 1.74 (-4.23 to 7.70) after two days, and 3.47 (-6.59 to 13.53) after 14 days (Figs 2 and 3, removed at the time of retraction). These reliability values of the Arabic-MLBPDQ confirm our predefined hypotheses presented in Table 2.
Table 5. Psychometric properties of the Arabic Modified Low Back Pain Disability Questionnaire.
| Psychometric property | Baseline (N = 68) | At 2 days (N = 61) | At 14 days (N = 68) | |
|---|---|---|---|---|
| Reliability | Internal consistency | Cronbach’s α = 0.85 | - | - |
| Test-retest | - | ICC = 0.98 (95% CI = 0.97–0.99) | ICC = 0.94 (95% CI: 0.91–0.97)a | |
| SEM | - | 1.60 | 2.77a | |
| MDC95% | - | 4.43 | 7.67a | |
| Mean difference (95% LOA) | - | 1.74 (-4.23 to 7.70) | 3.47 (-6.59 to 13.53)a | |
| Construct validity | FABQ | rho = 0.60* | - | rho = 0.70* |
| Quebec | rho = 0.77* | - | rho = 0.86* | |
| RM | rho = 0.54* | - | rho = 0.63* | |
| VAS | rho = 0.62* | - | rho = 0.62* | |
| Sensitivity | AUC | - | - | 0.68** (95% CI = 0.52–0.84) |
| MIC | - | - | 3 (73.3% sensitivity– 50.0% specificity) | |
All questionnaires were in Arabic.
ICC, intraclass correlation coefficient; CI, confidence interval; SEM, standard error of measurement; MDC95%, minimal detectable change at 95% confidence level; LOA, limits of agreement; FABQ, Fear-Avoidance Beliefs Questionnaire; Quebec, Quebec Back Pain Disability Scale; RM, Roland-Morris Disability Questionnaire; VAS, Visual Analog Scale; rho, Spearman rank correlation coefficient; AUC, area under the curve; MIC, minimum important change.
aAssessed in respondents classified as stable (N = 59).
*Two-tailed correlation is significant at α = 0.01.
**Significant at α = 0.05.
Fig 2. The 95% limits of agreement of the Arabic MLBPDQ scores between baseline and two days.

MLBPDQ: Modified Low Back Pain Disability Questionnaire; SD: standard deviation.
Fig 3. The 95% limits of agreement of the Arabic MLBPDQ scores between baseline and 14 days.

MLBPDQ: Modified Low Back Pain Disability Questionnaire; SD: standard deviation.
Validity
As shown in Table 5, the construct validity testing using Spearman’s rho at baseline and after 14 days showed significant moderate correlations between the Arabic-MLBPDQ and the FABQ, the RM, and the VAS, and excellent positive correlation with the Quebec. These results confirm our predefined hypotheses, except for the FABQ (i.e., confirming 75% of the hypotheses).
Sensitivity
The sensitivity of the Arabic-MLBPDQ was tested with 68 patients. An AUC value of 0.68 (standard error 0.08; 95% CI, 0.52–0.84) was obtained after constructing the ROC cure (Fig 4). This value was less than we hypothesized, but significant at 0.05 alpha level. The MIC identified form the ROC curve was 3 points, corresponding to 73.3% sensitivity and 50.0% specificity. This MIC is less than the value stated in our predefined hypothesis.
Fig 4. Receiver operating characteristic curve for the Arabic Modified Low Back Pain Disability Questionnaire.
Discussion
In this study, we evaluated the reliability, validity, and sensitivity of the MLBPDQ after translation and cross-cultural adaptation to Arabic. The results showed that this version has excellent reliability, moderate-to-excellent validity, and adequate sensitivity. Because improving patients’ health and healthcare provision are the overriding goals of creating culturally aligned, accurately translated, and rigorously validated health assessments, the Arabic-MLBPDQ can be expected to aid assessment of LBP and associated disability by clinicians in Arabic-speaking communities.
Chang and colleagues [67] cautioned against adapting a direct translated instrument because of language differences, especially those highlighted by idiomatic expressions and colloquial phrases. When adapting an instrument, therefore, the overall goal should be making the instrument widely accepted in the target culture and not including questions that would be outside the respondents’ experiences. In this study, in addition to some grammatical corrections and sentence restructuring, the expert committee recommended four noteworthy modifications.
First, the reviewers unanimously suggested converting the distance unit from miles to kilometers (Section 4 (walking); options 2, 3, and 4). This is because Arabic countries typically use metric units rather than imperial units to measure distance. Although the English MLBPDQ is annotated with conversion of miles to kilometers for selection 2, the annotation does not convert all the options in that section. This might make it difficult for some patients to comprehend those selections. Moreover, the converted distance in the three options was rounded to 1.5, 1, and 0.5 km, respectively, to make it easier for patients to understand. The word “approximately” was also added at the end of each option.
Second, the reviewers suggested changing “I am in bed most of the time and have to crawl to the toilet” to “I am in bed most of the time and cannot go to the toilet without the help of others.” This is because it is very uncommon in Arab cultures for a patient to be in this stage of disability without a relative or caregiver around to help them with their daily living activities. Concurrently, the intended meaning of being bedbound and unable to walk to the toilet independently was retained.
Third, in selection 4 of Section 8 (Social Life), the reviewers suggested adding “to practice social activity” at the end of the sentence. This was to approximate the meaning of “going out” in the English MLBPDQ. Fourth, the reviewers agreed upon adding the word “commuting” to the title of Section 9 to be read as “traveling/commuting.” The reason was that the word “traveling” in Arabic literally means traveling from one city/country to another, which could confuse patients. We believe that these modifications made the Arabic-MLBPDQ more aligned to Arab cultures.
No floor or ceiling effects were detected for the Arabic-MLBPDQ at the three assessment sessions. This indicates a good distribution of scores for the Arabic-MLBPDQ, good content validity, and another indication of adequate reliability [40]. Homogeneity of items is an important feature of a questionnaire, especially if all items are measuring the same construct [40]. In the present study, the obtained internal consistency value of 0.85 indicates good homogeneity of all the 10 items of the adapted questionnaire. It was not too low (i.e., lack of association between the items), nor too high (i.e., redundancy of some items) [40]. In comparison with previous reports, the internal consistency value of the Arabic-MLBPDQ is higher than the value of the Persian-MLBPDQ [38] (see Table 6), and comparable with the values of some validation studies of the ODQ (0.83–0.87) [13,17,19,20,23–25,55,57,59].
Table 6. Psychometric properties of the published Modified Low Back Pain Disability Questionnaire and Arabic Oswestry Disability Questionnaire.
| First Author, Yr. | Adapted To | Internal consistency | Reliability | Construct Validity | Sensitivity | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cronbach’s α | Days | ICC | SEM | MDC | LOA | Measure | r | Days | AUC | MIC | ||
| Modified Low Back Pain Disability Questionnaire | ||||||||||||
| Current Study | Arabic (SA) | 0.85 | 2 | 0.98 | 1.60 | 4.43 | -4.23 to 7.70 | VAS | 0.62 | 14 | 0.68 | 3 |
| 14 | 0.94 | 2.77 | 7.67 | -6.59 to 13.53 | Quebec | 0.77–0.86 | ||||||
| RM | 0.54–0.63 | |||||||||||
| FABQ | 0.60–0.70 | |||||||||||
| Fritz, 2001 [12] | Original | - | 28 | 0.90 | 5.40 | 12.68 | - | - | - | 28 | 0.94 | 6 |
| Denteneer, 2018 [36] | Dutch | - | 63 | 0.89 | 3.19 | 8.80 | - | RM | 0.69 | 63 | 0.64 | - |
| SF-36 | -0.59 to -0.29 | |||||||||||
| Sakulsriprasert, 2006 [37] | Thai | - | 20–30 min | 0.98 | - | - | - | - | - | - | - | - |
| Baradaran, 2016 [38] | Persian (Iran) | 0.69 | Not reported | 0.68 | - | - | - | SF-36 | -0.55 to -0.18 | - | - | - |
| Arabic Oswestry Disability Questionnaire | ||||||||||||
| Algarni, 2014 [14] | Arabic (SA) | 0.89 | 2 | 0.99 | - | - | - | VAS | 0.71 | - | - | - |
| Quebec | 0.79 | |||||||||||
| RM | 0.66 | |||||||||||
| Guermazi, 2004 [15] | Arabic (Tunisia) | 0.76 (items 3, 4, 6, 7 & 8). 0.70 (items 1, 2 & 5). |
3 | 0.98 | - | - | - | VAS-pain | 0.58 | - | - | - |
| VAS-handicap | 0.70 | |||||||||||
| Waddel Index | 0.73 | |||||||||||
| Quebec | 0.87 | |||||||||||
| Ramzy, 2008 [16] | Arabic (UAE) | 0.99 | 2 | 0.99 | - | - | -2.40 to 3.76 | VAS | 0.81–0.90 | - | - | - |
| Squat test | -0.77 to -0.70 | |||||||||||
ICC, intraclass correlation coefficient; SEM, standard error of measurement; MDC, minimal detectable change; LOA, 95% limits of agreement; r, correlation coefficient; AUC, area under the curve; MIC, minimum important change; SA, Saudi Arabia; VAS, Visual Analog Scale; Quebec, Quebec Back Pain Disability Scale; RM, Roland-Morris Disability Questionnaire; FABQ, Fear-Avoidance Beliefs Questionnaire; SF-36, short form health survey questionnaire; UAE, United Arab Emirates.
The test-retest reliability of the Arabic-MLBPDQ was excellent. The noted ICC values were close to previously reported reliability coefficients of the MLBPDQ. For example, the English MLBPDQ showed an excellent ICC value of 0.90 at four-weeks’ follow-up [12]. The reliability of the Dutch MLBPDQ, although with a longer follow-up period (nine weeks), was also excellent (0.89) [36]. The Thai version demonstrated a total ICC value of 0.98, but with 20 to 30 minutes of inter-administration time [37]. Regarding the original ODQ, the Arabic-SA [14] and the Arabic-Tunisian [15] versions had excellent reliability, with ICC values of 0.999 and 0.98, respectively (two to four days’ follow-up). These values are comparable with the reliability coefficients reported in the current study (Table 6).
It is important to note that ICC value alone does not provide enough information about measurement error of an instrument [68]. Therefore, we calculated the SEM for the Arabic-MLBPDQ, which is an estimate of measurement error. The less SEM, the more reliability of that instrument [48]. The SEM is also used to calculate the MDC, which reflects the smallest change in score for one person beyond measurement error [40,48]. For instance, the MDC95% value of 7.67 calculated for 14 days indicates that, for a specific patient, a change of more than 8 points is most likely due to true change in the functional disability status of that patient rather than measurement error. This threshold is relativity less than the values reported in most of the previous validation studies of the ODQ (ranging from 9 to 13) [13,21,22,27,32,50,51], and the MLBPDQ (8.8) [36]. The SEM and MDC95% of the Arabic-MLBPDQ reported in this study suggest the absolute reliability of the questionnaire.
Another measure of reliability assessed in this study are the LOA, which represent the degree of agreement of scores obtained on two different occasions [48]. The 14-day LOA analysis of the Arabic-MLBPDQ indicates that a deterioration more than 14 points and improvement more than seven points is considered a true change in a patient’s disability status at a 95% confidence level [30]. When comparing the LOA of the Arabic-MLBPDQ with other versions validated previously, the upper limit is extremely similar to the values calculated for the Chinese (13.7) [32] and the Danish (12.4–13.6) [30] versions of the ODQ; however, the lower limit is less (-12.5 and -9.2 to -12.7 for the Chinese and the Danish, respectively). The Arabic ODQ-United Arab Emirates (UAE) [16] showed narrower limits of agreement of -2.4 to 3.76 at 95% confidence level for two days retest (Table 6).
It has been recommended that a priori hypotheses need to be stated when evaluating the construct validity of an instrument [40]. This is to avoid potential risk of bias when interpreting the correlations with other instruments. In this study, the construct validity of the Arabic-MLBPDQ was supported by confirming three out of four (75%) of the predefined hypotheses. The Arabic-MLBPDQ showed significant excellent correlation with the Arabic Quebec, which is similar to the reported correlation between the Arabic-SA ODQ (r = 0.792) [14] and the Arabic-Tunisian ODQ (r = 0.86) [15] with the Arabic Quebec. Furthermore, the moderate correlation values calculated in this report with the RM were comparable with the correlations reported between the two questionnaires in Dutch (r = 0.69) [36], and the Arabic-SA ODQ with the Arabic RM (r = 0.656) [14] (Table 6), but slightly less than the values calculated in previous validation studies of the ODQ in other languages [25,27,29,53,55,56,59]. Similarly, a moderate degree of association was detected between the Arabic-MLBPDQ and VAS. This value is similar to the values obtained in other reports [13,32,50,54,55], and slightly higher than the one obtained between the Arabic-UAE ODQ and VAS [16]. In term of association between the Arabic-MLBPDQ and FABQ, it was stronger than the values reported for the Hausa version of the ODQ (r = 0.19) [13]. This association value provides further information about the direct proportionality of fear-avoidance beliefs with self-reported disability due to LBP [69–75].
The sensitivity to change of the Arabic-MLBPDQ as indicated by the AUC value is similar to the sensitivity of the Dutch version (AUC = 0.64) [36]. However, the English version of the MLBPDQ achieved excellent sensitivity of AUC = 0.94 [12] (Table 6). A possible explanation for the higher sensitivity value of the English version of the MLBPDQ is that re-administration time was after four weeks. On the other hand, the Arabic-MLBPDQ was re-administered two weeks after baseline. This might have slightly decreased the likelihood of detecting changes in patients’ condition; however, we believe that the sensitivity value described in this study highlights the usefulness of the Arabic-MLBPDQ.
Another measure of responsiveness evaluated in this study is the MIC. The MIC, also called the minimal clinically important difference and the minimal clinically important change [48], is interpreted as the smallest change in score in the construct measured that is considered useful by the patient. Consequently, this change would lead to an adjustment of the patient’s management in the absence of excessive side effects and extra costs [76]. It is suggested that the MIC should be greater than the MDC for an instrument be able to differentiate minimum important change from measurement error [40]. The obtained MIC of 3 points for the Arabic-MLBPDQ is less than the MDC of 7.67 points. Similar relationship between the MIC and MDC was also calculated for the English version of the MLBPDQ in three previous studies (6 vs. 12.6) [12], (9 vs. 12.8) [34], and (5 vs. 13.1) [35]. This was also the case in several responsiveness studies of the ODQ [52,62,77]. Some studies attributed that to the anchor used for calculation, the global change scale, which could be very subjective and influenced by recall bias [62,77]. Therefore, and since the MDC value of the Arabic-MLBPDQ exceeds the MIC, and it is relatively well above the SEM, we suggest considering a change of more than 8 points (i.e., the MDC95%) after two-week of treatment as a true change in patient status [35], as described earlier in the discussion.
A potential limitation of this study is that the patient sample group was drawn from a single Arab country, Saudi Arabia. However, we believe this will have a minimal effect on the generalizability of the results, because the translation and adaptation of the MLBPDQ was completed using Modern Standard Arabic, the language used in books, newspapers, magazines, media, formal speech, and communications and the most common form of Arabic taught in primary education in all Arab countries [78,79]. Further, the Arabic-MLBPDQ was tested among literate patients only. We recommend evaluating the psychometric properties among nonliterates as well, similar to the work done by Adamu and colleagues [13]. Another limitation of our study was not including the forward and backward translators on the expert committee. The principle investigator was a part of the committee and could deliver any questions or queries raised by the members, and the committee raised no questions to the translators during the meeting, but we believe that the translators’ presence could have made the discussion more productive. An additional limitation was our using a two-day interval to measure the test-retest reliability of the Arabic-MLBPDQ. Although a two-day interval is not uncommon in the previous validation studies of the ODQ [14,16,19,22,27,29,32,51,53,54,58,59], and the reliability coefficients obtained after two days and after 14 days are comparable, the risk of memory effect cannot be excluded with such a short interval. Finally, the MIC value computed in this study for the Arabic-MLBPDQ should be interpreted with caution because it is within the MDC. We recommend further research to be conducted in this area.
In conclusion, our study showed that the Arabic-MLBPDQ is a psychometrically valid, reliable, and, to some degree, sensitive tool to assess disability level in patients with LBP. We suggest that clinicians and researchers utilize this Arabic version of the MLBPDQ in their practice to monitor Arabic-speaking patients with LBP.
Supporting information
(XLSX)
Acknowledgments
We acknowledge the respondents who volunteered for this study. We also acknowledge Dr. Mishal M. Aldaihan from King Saud University for his help in data analyses and interpretation of the results.
Data Availability
Supporting Information files previously appended to this article and Figs 1–3 were removed at the time of the article’s retraction; the full underlying data for this study are not publicly available. See the accompanying retraction notice for more information.
Funding Statement
This study was supported by grants from the Deanship of Scientific Research, University of Tabuk, Tabuk, Saudi Arabia (Grant no. S-1438-0076) awarded to HSA - website: https://www.ut.edu.sa/web/deanship-of-scientific-research/home The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Hoy D, March L, Brooks P, et al. The global burden of low back pain: Estimates from the global burden of disease 2010 study. Ann Rheum Dis. 2014;73(6):968–974. doi: 10.1136/annrheumdis-2013-204428 [DOI] [PubMed] [Google Scholar]
- 2.Morris J, Watson PJ. Investigating decisions to absent from work with low back pain: A study combining patient and GP factors. Eur J Pain. 2011;15(3):278–285. doi: 10.1016/j.ejpain.2010.07.002 [DOI] [PubMed] [Google Scholar]
- 3.Koes BW, van Tulder MW, Thomas S. Diagnosis and treatment of low back pain. BMJ. 2006;332(7555):1430–1434. doi: 10.1136/bmj.332.7555.1430 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Stewart WF, Ricci JA, Chee E, Morganstein D, Lipton R. Lost productive time and cost due to common pain conditions in the US workforce. JAMA. 2003;290(18):2443–2454. doi: 10.1001/jama.290.18.2443 [DOI] [PubMed] [Google Scholar]
- 5.Deyo RA, Battie M, Beurskens AJ, et al. Outcome measures for low back pain research. A proposal for standardized use. Spine (Phila Pa 1976). 1998;23(18):2003–2013. [DOI] [PubMed] [Google Scholar]
- 6.Fairbank JC, Couper J, Davies JB, O'Brien JP. The Oswestry Low Back Pain Disability Questionnaire. Physiotherapy. 1980;66(8):271–273. [PubMed] [Google Scholar]
- 7.Baker DJ, Pynsent PB, Fairbank JC. The Oswestry Disability Index revisited: its reliability, repeatability, and validity, and a comparison with the St. Thomas Disability Index. In Roland M, Jenner J eds. Back pain: New Approaches to Rehabilitation and Education. Manchester, England: Manchester University Press, 1989:174–186. [Google Scholar]
- 8.Pynsent PB, Fairbank JC. Computer interview system for patients with back pain. J Biomed Eng. 1989;11(1):25–29. doi: 10.1016/0141-5425(89)90161-1 [DOI] [PubMed] [Google Scholar]
- 9.Meade TW, Dyer S, Browne W, Frank AO. Randomised comparison of chiropractic and hospital outpatient management for low back pain: Results from extended follow up. BMJ. 1995;311(7001):349–351. doi: 10.1136/bmj.311.7001.349 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Boden SD. Outcome assessment after spinal fusion: Why and how? Orthop Clin North Am. 1998;29(4):717–728. doi: 10.1016/s0030-5898(05)70043-2 [DOI] [PubMed] [Google Scholar]
- 11.Hupli M, Sainio P, Hurri H, Alaranta H. Comparison of trunk strength measurements between two different isokinetic devices used at clinical settings. J Spinal Disord. 1997;10(5):391–397. [PubMed] [Google Scholar]
- 12.Fritz JM, Irrgang JJ. A comparison of a modified Oswestry Low Back Pain Disability Questionnaire and the Quebec Back Pain Disability Scale. Phys Ther. 2001;81(2):776–788. doi: 10.1093/ptj/81.2.776 [DOI] [PubMed] [Google Scholar]
- 13.Adamu AS, Ibrahim AA, Rufa’i YA, Akindele MO, Kaka B, Mukhtar NB. Cross-cultural adaptation and validation of the Hausa version of the Oswestry Disability Index 2.1 a for patients with low back pain. Spine (Phila Pa 1976). 2019;44(18):E1092–E1102. [DOI] [PubMed] [Google Scholar]
- 14.Algarni AS, Ghorbel S, Jones JG, Guermazi M. Validation of an Arabic version of the Oswestry Index in Saudi Arabia. Ann Phys Rehabil Med. 2014;57(9–10):653–663. doi: 10.1016/j.rehab.2014.06.006 [DOI] [PubMed] [Google Scholar]
- 15.Guermazi M, Mezghani M, Ghroubi S, et al. The Oswestry Index for low back pain translated into Arabic and validated in an Arab population. Ann Readapt Med Phys. 2005;48(1):1–10. doi: 10.1016/j.annrmp.2004.06.055 [DOI] [PubMed] [Google Scholar]
- 16.Ramzy R. Validation of the Arabic version of the Oswestry Disability Index developed in Tunisia for low back pain patients in the UAE. M.Sc. Thesis, Stellenbosch University. 2008. Available from: http://scholar.sun.ac.za/handle/10019.1/2402.
- 17.Fujiwara A, Kobayashi N, Saiki K, Kitagawa T, Tamai K, Saotome K. Association of the Japanese Orthopaedic Association score with the Oswestry Disability Index, Roland-Morris Disability Questionnaire, and short-form 36. Spine (Phila Pa 1976). 2003;28(14):1601–1607. [PubMed] [Google Scholar]
- 18.Mousavi SJ, Parnianpour M, Mehdian H, Montazeri A, Mobini B. The Oswestry Disability Index, the Roland-Morris Disability Questionnaire, and the Quebec Back Pain Disability Scale: Translation and validation studies of the Iranian versions. Spine (Phila Pa 1976). 2006;31(14):E454–E459. [DOI] [PubMed] [Google Scholar]
- 19.Kim DY, Lee SH, Lee HY, et al. Validation of the Korean version of the Oswestry Disability Index. Spine (Phila Pa 1976). 2005;30(5):E123–E127. [DOI] [PubMed] [Google Scholar]
- 20.Domazet I, Nemir J, Barl P, Đurić KS, Pašalić I, Barić H, et al. Validation of the Croatian version of the Oswestry Disability Index. Eur. Spine J. 2018;27(11):2814–2822. doi: 10.1007/s00586-018-5757-z [DOI] [PubMed] [Google Scholar]
- 21.Valasek T, Varga PP, Szövérfi Z, Kümin M, Fairbank J, Lazary A. Reliability and validity study on the Hungarian versions of the Oswestry Disability Index and the Quebec Back Pain Disability Scale. Eur. Spine J. 2013;22(5):1010–1018. doi: 10.1007/s00586-012-2645-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Miekisiak G, Kollataj M, Dobrogowski J, Kloc W, Libionka W, Banach M, et al. Validation and cross-cultural adaptation of the Polish version of the Oswestry Disability Index. Spine (Phila Pa 1976). 2013;38(4):E237–E243. [DOI] [PubMed] [Google Scholar]
- 23.Payares K, Lugo LH, Morales V, Londono A. Validation in Colombia of the Oswestry Disability Questionnaire in patients with low back pain. Spine (Phila Pa 1976). 2011;36(26):E1730–E1735. [DOI] [PubMed] [Google Scholar]
- 24.Pekkanen L, Kautiainen H, Ylinen J, Salo P, Häkkinen A. Reliability and validity study of the Finnish version 2.0 of the Oswestry Disability Index. Spine (Phila Pa 1976). 2011;36(4):332–338. [DOI] [PubMed] [Google Scholar]
- 25.Monticone M, Baiardi P, Ferrari S, Foti C, Mugnai R, Pillastrini P, et al. Development of the Italian version of the Oswestry Disability Index (ODI-I): A cross-cultural adaptation, reliability, and validity study. Spine (Phila Pa 1976). 2009;34(19):2090–2095. [DOI] [PubMed] [Google Scholar]
- 26.Osthus H, Cziske R, Jacobi E. Cross-cultural adaptation of a German version of the Oswestry Disability Index and evaluation of its measurement properties. Spine (Phila Pa 1976). 2006;31(14):E448–E453. [DOI] [PubMed] [Google Scholar]
- 27.Mannion AF, Junge A, Fairbank JC, Dvorak J, Grob D. Development of a German version of the Oswestry disability index. Part 1: Cross-cultural adaptation, reliability, and validity. Eur Spine J. 2006;15(1):55–65. doi: 10.1007/s00586-004-0815-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Yu EM, Nosova EV, Falkenstein Y, Prasad P, Leasure JM, Kondrashov DG. Validation of a Russian Language Oswestry Disability Index Questionnaire. Global Spine J. 2016;6(7):636–639. doi: 10.1055/s-0035-1570085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Denis I, Fortin L. Development of a French-Canadian version of the Oswestry Disability Index: cross-cultural adaptation and validation. Spine (Phila Pa 1976) 2012;37(7):E439–E444. [DOI] [PubMed] [Google Scholar]
- 30.Lauridsen HH, Hartvigsen J, Manniche C, Korsholm L, Grunnet-Nilsson N. Danish version of the Oswestry Disability Index for patients with low back pain. Part 1: Cross-cultural adaptation, reliability and validity in two different populations. Eur. Spine J. 2006;15(11):1705–1716. doi: 10.1007/s00586-006-0117-9 [DOI] [PubMed] [Google Scholar]
- 31.Liu H, Tao H, Luo Z. Validation of the simplified Chinese version of the Oswestry Disability Index. Spine (Phila Pa 1976). 2009;34(11):1211–1216. [DOI] [PubMed] [Google Scholar]
- 32.Lue Y, Hsieh C, Huang M, Lin G, Lu Y. Development of a Chinese version of the Oswestry Disability Index version 2.1. Spine (Phila Pa 1976). 2008;33(21):2354–2360. [DOI] [PubMed] [Google Scholar]
- 33.Bahouq H, Fadoua A, Hanan R, Ihsane H, Najia H. Profile of sexuality in Moroccan chronic low back pain patients. BMC Musculoskelet Disord. 2013;14(1):63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Childs MJD, Piva SR. Psychometric properties of the functional rating index in patients with low back pain. Eur Spine J. 2005;14(10):1008–1012. doi: 10.1007/s00586-005-0900-z [DOI] [PubMed] [Google Scholar]
- 35.Cleland JA, Whitman JM, Houser JL, Wainner RS, Childs JD. Psychometric properties of selected tests in patients with lumbar spinal stenosis. Spine J. 2012;12(10):921–931. doi: 10.1016/j.spinee.2012.05.004 [DOI] [PubMed] [Google Scholar]
- 36.Denteneer L, Van Daele U, Truijen S, et al. The modified low back pain disability questionnaire: reliability, validity, and responsiveness of a Dutch language version. Spine (Phila Pa 1976). 2018;43(5):E292–E298. [DOI] [PubMed] [Google Scholar]
- 37.Sakulsriprasert P, Vachalathiti R, Vongsirinavarat M, Kantasorn J. Cross-cultural adaptation of modified Oswestry Low Back Pain Disability Questionnaire to Thai and its reliability. J Med Assoc Thai. 2006;89(10):1694–1701. [PubMed] [Google Scholar]
- 38.Baradaran A, Ebrahimzadeh MH, Birjandinejad A, Kachooei AR. Cross-cultural adaptation, validation, and reliability testing of the Modified Oswestry Disability Questionnaire in Persian population with low back pain. Asian Spine J. 2016;10(2):215–219. doi: 10.4184/asj.2016.10.2.215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine (Phila Pa 1976). 2000;25(24):3186–3191. [DOI] [PubMed] [Google Scholar]
- 40.Terwee CB, Bot SD, de Boer MR, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42. doi: 10.1016/j.jclinepi.2006.03.012 [DOI] [PubMed] [Google Scholar]
- 41.Alanazi F, Gleeson P, Olson S, Roddey T. Translation and validation of the Arabic version of the Fear-Avoidance Beliefs Questionnaire in patients with low back pain. Spine (Phila Pa 1976). 2017;42(7):E411–E416. [DOI] [PubMed] [Google Scholar]
- 42.Alnahhal A, May S. Validation of the Arabic version of the Quebec Back Pain Disability Scale. Spine (Phila Pa 1976). 2012;37(26):E1645–E1650. [DOI] [PubMed] [Google Scholar]
- 43.Maki D, Rajab E, Watson PJ, Critchley DJ. Cross-cultural translation, adaptation, and psychometric testing of the Roland-Morris Disability Questionnaire into Modern Standard Arabic. Spine (Phila Pa 1976). 2014;39(25):E1537–E1544. [DOI] [PubMed] [Google Scholar]
- 44.Waddell G. Clinical assessment of lumbar impairment. Clin Orthop Relat Res. 1987;(221):110–120. [PubMed] [Google Scholar]
- 45.Davidson M, Keating JL. A comparison of five low back disability questionnaires: Reliability and responsiveness. Phys Ther. 2002;82(1):8–24. doi: 10.1093/ptj/82.1.8 [DOI] [PubMed] [Google Scholar]
- 46.Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–163. doi: 10.1016/j.jcm.2016.02.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16(3):297–334. [Google Scholar]
- 48.Portney LG, Watkins MP. Foundations of clinical research: Applications to practice. Vol. 2. Prentice Hall Upper Saddle River, NJ; 2000. [Google Scholar]
- 49.Bland JM, Altman D. Statistical methods for assessing agreement between two methods of clinical measurement. The lancet. 1986;327(8476):307–310. [PubMed] [Google Scholar]
- 50.Mohan V, Prashanth G, Meravanigi G, Rajagopalan N, Yerramshetty J. Adaptation of the Oswestry Disability Index to Kannada language and evaluation of its validity and reliability. Spine (Phila Pa 1976). 2016;41(11):E674–E680. [DOI] [PubMed] [Google Scholar]
- 51.Grotle M, Brox JI, Vollestad NK. Cross-cultural adaptation of the Norwegian versions of the Roland-Morris Disability Questionnaire and the Oswestry Disability Index. J Rehabil Med. 2003;35(5):241–247. doi: 10.1080/16501970306094 [DOI] [PubMed] [Google Scholar]
- 52.Monticone M, Baiardi P, Ferrari S, Foti C, Mugnai R, Pillastrini P, et al. Development of the Italian version of the Oswestry Disability Index (ODI-I): A cross-cultural adaptation, reliability, and validity study. Spine (Phila Pa 1976). 2009;34(19):2090–2095. [DOI] [PubMed] [Google Scholar]
- 53.Vincent JI, Macdermid JC, Grewal R, Sekar VP, Balachandran D. Translation of Oswestry Disability Index into Tamil with cross cultural adaptation and evaluation of reliability and validity. Open Orthop J. 2014;8:11–19. doi: 10.2174/1874325001408010011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Joshi VD, Raiturker PPP, Kulkarni AA. Validity and reliability of English and Marathi Oswestry Disability Index (version 2.1 a) in Indian population. Spine (Phila Pa 1976). 2013;38(11):E662–E668. [DOI] [PubMed] [Google Scholar]
- 55.Vigatto R, Alexandre NM, Correa Filho HR. Development of a Brazilian Portuguese version of the Oswestry Disability Index: Cross-cultural adaptation, reliability, and validity. Spine (Phila Pa 1976) 2007;32(4):481–486. [DOI] [PubMed] [Google Scholar]
- 56.Yakut E, Düger T, Öksüz Ç, Yörükan S, Üreten K, Turan D, et al. Validation of the Turkish version of the Oswestry Disability Index for patients with low back pain. Spine (Phila Pa 1976). 2004;29(5):581–585. [DOI] [PubMed] [Google Scholar]
- 57.Boscainos PJ, Sapkas G, Stilianessi E, Prouskas K, Papadakis SA. Greek versions of the Oswestry and Roland-Morris disability questionnaires. Clin Orthop Relat Res. 2003;411:40–53. [DOI] [PubMed] [Google Scholar]
- 58.Shah S, Balaganapathy M. Reliability and validity study of the Gujarati version of the Oswestry Disability Index 2.1 a. J Back Musculoskelet Rehabil. 2017;30(5):1103–1109. doi: 10.3233/BMR-169728 [DOI] [PubMed] [Google Scholar]
- 59.Vogler D, Paillex R, Norberg M, de Goumoens P, Cabri J. Cross-cultural validation of the Oswestry disability index in French. Ann Readapt Med Phys 2008;51(5):379–385. doi: 10.1016/j.annrmp.2008.03.006 [DOI] [PubMed] [Google Scholar]
- 60.Gamus D, Glasser S, Langner E, Beth-Hakimian A, Caspi I, Carmel N, et al. Psychometric properties of the Hebrew version of the Oswestry Disability Index. J Back Musculoskelet Rehabil. 2017;30(1):135–143. [DOI] [PubMed] [Google Scholar]
- 61.Beurskens A, De Vet H, Köke A. Responsiveness of functional status in low back pain: a comparison of different instruments. Pain. 1996;65(1):71–76. doi: 10.1016/0304-3959(95)00149-2 [DOI] [PubMed] [Google Scholar]
- 62.Ma C, Wu S, Xiao L, Xue Y. Responsiveness of the Chinese version of the Oswestry disability index in patients with chronic low back pain. Eur Spine J. 2011;20(3):475–481. doi: 10.1007/s00586-010-1624-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Coelho RA, Siqueira FB, Ferreira PH, Ferreira ML. Responsiveness of the Brazilian–Portuguese version of the Oswestry Disability Index in subjects with low back pain. Eur Spine J. 2008;17(8):1101–1106. doi: 10.1007/s00586-008-0690-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Lauridsen HH, Hartvigsen J, Manniche C, Korsholm L, Grunnet-Nilsson N. Responsiveness and minimal clinically important difference for pain and disability instruments in low back pain patients. BMC Musculoskelet Disord. 2006;7(1):82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Deyo RA, Centor RM. Assessing the responsiveness of functional scales to clinical change: an analogy to diagnostic test performance. J Chronic Dis 1986;39(11):897–906. doi: 10.1016/0021-9681(86)90038-x [DOI] [PubMed] [Google Scholar]
- 66.Habibzadeh F, Habibzadeh P, Yadollahie M. On determining the most appropriate test cut-off value: the case of tests with continuous results. Biochem Med (Zagreb). 2016;26(3):297–307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Chang AM, Chau JP, Holroyd E. Translation of questionnaires and issues of equivalence. J Adv Nurs. 1999;29(2):316–322. doi: 10.1046/j.1365-2648.1999.00891.x [DOI] [PubMed] [Google Scholar]
- 68.Harris KD, Heer DM, Roy TC, Santos DM, Whitman JM, Wainner RS. Reliability of a measurement of neck flexor muscle endurance. Phys Ther 2005;85(12):1349–1355. [PubMed] [Google Scholar]
- 69.Buragadda S, Aleisa ES, Melam GR. Fear avoidance beliefs and disability among women with low back pain. Neuropsychiatry. 2018;8(1):80–86. [Google Scholar]
- 70.Chung EJ, Hur YG, Lee BH. A study of the relationship among fear-avoidance beliefs, pain and disability index in patients with low back pain. J Exerc Rehabil 2013;9(6):532–535. doi: 10.12965/jer.130079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Grotle M, Vøllestad NK, Veierød MB, Brox JI. Fear-avoidance beliefs and distress in relation to disability in acute and chronic low back pain. Pain 2004;112(3):343–352. doi: 10.1016/j.pain.2004.09.020 [DOI] [PubMed] [Google Scholar]
- 72.Fritz JM, George SZ, Delitto A. The role of fear-avoidance beliefs in acute low back pain: relationships with current and future disability and work status. Pain 2001;94(1):7–15. doi: 10.1016/s0304-3959(01)00333-5 [DOI] [PubMed] [Google Scholar]
- 73.Cai C, Pua Y, Lim KC. Correlates of self-reported disability in patients with low back pain: the role of fear-avoidance beliefs. Ann Acad Med Singapore. 2007;36(12):1013. [PubMed] [Google Scholar]
- 74.George SZ, Fritz JM, McNeil DW. Fear-avoidance beliefs as measured by the Fear-Avoidance Beliefs Questionnaire: Change in Fear-Avoidance Beliefs Questionnaire is predictive of change in self-report of disability and pain intensity for patients with acute low back pain. Clin J Pain 2006;22(2):197–203. doi: 10.1097/01.ajp.0000148627.92498.54 [DOI] [PubMed] [Google Scholar]
- 75.Sions JM, Hicks GE. Fear-avoidance beliefs are associated with disability in older American adults with low back pain. Phys Ther 2011;91(4):525–534. doi: 10.2522/ptj.20100131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Jaeschke R, Singer J, Guyatt GH. Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials. 1989;10(4):407–415. doi: 10.1016/0197-2456(89)90005-6 [DOI] [PubMed] [Google Scholar]
- 77.Park KB, Shin J, Lee J, Lee YJ, Kim M, Lee J, et al. Minimum clinically important difference and substantial clinical benefit in pain, functional, and quality of life scales in failed back surgery syndrome patients. Spine (Phila Pa 1976). 2017;42(8):E474–E481. [DOI] [PubMed] [Google Scholar]
- 78.Shaalan K. Rule-based approach in Arabic natural language processing. The International Journal on Information and Communication Technologies (IJICT) 2010;3(3):11–19. [Google Scholar]
- 79.MSA White Paper. 2019 [cited 25 December 2019]. In: Msarabic.com [Internet]. Dubai, United Arab Emirates: MSA Whitepaper. [about 2 screens] Available from: http://www.msarabic.com/index.php/en.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
(XLSX)
Data Availability Statement
Supporting Information files previously appended to this article and Figs 1–3 were removed at the time of the article’s retraction; the full underlying data for this study are not publicly available. See the accompanying retraction notice for more information.

