Abstract
Background
Outcomes of therapeutic studies in diffuse cutaneous systemic sclerosis (dcSSc) have mainly been measured for specific organs, particularly the skin and lungs. A new composite response index in dcSSc (CRISS) has been developed for clinical trials. The goal of this study was to determine whether, in an observational dcSSc cohort, immunosuppression was associated with global disease improvement measured with the CRISS.
Methods
We conducted a retrospective cohort study in a multi-centered SSc registry comparing 47 patients newly exposed to immunosuppression for ≥ 1 year to 254 unexposed patients. Inverse probability of treatment weighting (IPTW) was performed to create comparable exposed and unexposed groups by balancing for age, sex, disease duration, modified Rodnan skin score (mRSS), forced vital capacity, patient and physician global assessments, and Health Assessment Questionnaire score. A CRISS score ≥ 0.6 at 1 year was defined as improvement.
Results
Exposed patients had shorter disease duration (5.5 versus 11.7 years, p < 0.01), more interstitial lung disease (67.4% versus 40.3%, p < 0.01), and worse physician global severity scores (4.2 versus 2.5 points, p < 0.01) compared to unexposed patients. Improvement in CRISS scores was more common in exposed patients after IPTW (odds ratio 1.85, 95% confidence interval 1.11, 3.09). Of the individual CRISS variables, only mean patient global assessment scores were significantly better among exposed than unexposed patients (− 0.4 versus 0 points, p = 0.03) while other variables including mRSS were similar.
Conclusion
Using a composite response measure, immunosuppression was associated with better outcomes at 1 year in a dcSSc cohort. These results provide real-world data that align with clinical trials to support our current use of immunosuppression.
Keywords: Systemic sclerosis, Scleroderma, Immunosuppression, Outcomes, CRISS
Background
Systemic sclerosis (SSc) is an autoimmune fibrosing disorder that affects the skin and internal organs. Diffuse cutaneous SSc (dcSSc) is characterized by skin thickening extending proximally to the elbows and knees and/or the trunk. It is associated with increased organ involvement, decreased patient function, and worse prognosis [1–5].
The mainstay of treatment in dcSSc is immunosuppression, including methotrexate (MTX), cyclophosphamide (CYC), azathioprine (AZA), and mycophenolate (MPA). Immunosuppression is posited to play a role in reducing early disease activity with the goal of preventing progression and irreversible damage [6]. Current guidelines suggest that immunosuppression should be considered in dcSSc patients [7], especially for the treatment of active lung, skin, or musculoskeletal manifestations [7, 8]. Clinical trials and other studies have shown benefits of immunosuppression on improving or at least stabilizing interstitial lung disease (ILD) with CYC [9] and MPA [10]. There is also some evidence for AZA as maintenance therapy in ILD [11]. MTX, which failed to meet clinical endpoints in one trial, was shown to benefit skin disease when the data was re-analyzed using Bayesian methods [12, 13].
While immunosuppressive therapies have been studied with regard to outcomes in specific organs, the benefit in overall disease has not been well evaluated because of the lack of adequate global measures. The American College of Rheumatology (ACR) approved a provisional composite response index in dcSSc (CRISS) [14] with the goal of improving outcome assessment in clinical trials by better capturing the multifaceted aspects of dcSSc. This measure includes disease areas that are susceptible to improve, including skin involvement based on the modified Rodnan skin score (mRSS), lung involvement based on the forced vital capacity (FVC), patient and physician global assessments of disease, and function based on the Health Assessment Questionnaire Disability Index (HAQ).
The CRISS was developed with a very specific goal, to assess global disease changes in dcSSc patients of < 5 years disease duration included in clinical trials. Although the CRISS was derived from observational dcSSc cohort data, whether it can measure response in this setting remains unknown. However, the main drawback of observational studies is the inherent confounding between treated and untreated groups. In recent years, advanced statistical methods such as propensity weighting, which can produce quasi-randomization, have been developed to overcome this limitation [15]. These methods have been used to ascertain treatment effects for otherwise difficult to answer questions in other SSc and rheumatology cohorts [16, 17].
The goal of this study was to determine whether immunosuppression was associated with global disease improvement in dcSSc in a real-world setting. We hypothesized that, in an observational dcSSc cohort, immunosuppression would be associated with improvement in overall disease measured with the CRISS after applying propensity weighting.
Methods
Study population
The Canadian Scleroderma Research Group (CSRG) registry recruits and follows patients from 15 centers in Canada and Mexico. These centers see local and regional referrals. All patients must have a diagnosis of SSc (confirmed by an experienced rheumatologist), be ≥ 18 years of age, provide informed consent, and be fluent in English, French, or Spanish. Over 98% of the cohort meets the 2013 ACR/EULAR classification criteria for SSc [18]. Patients with dcSSc defined by skin thickening proximal to the elbows or knees and/or trunk at any time enrolled between January 2005 and July 2017 were included in this study. Ethics committee approval for this study was obtained at the Jewish General Hospital, Montreal, Canada (approval number 2019-1597) and at all participating CSRG study sites.
Exposure
Medication exposure was recorded yearly by study physicians and coded as current, past, or never. Patients exposed to immunosuppression (MTX, CYC, AZA, and MPA) at or prior to their initial entry visit into the CSRG (T0) were excluded. The index visit (T1) was defined as the first CSRG visit when exposure to immunosuppression was recorded. Patients were defined as exposed if they were also exposed at the subsequent annual visit (T2) following the index visit. One-year outcome using the CRISS was calculated comparing outcomes at T2 compared to T1. This way, we ensured that exposed patients had received between a minimum of 1 continuous year of treatment and up to a maximum of 2 years. Unexposed patients were those who had never been exposed to immunosuppression at or before CSRG entry and had at least two consecutive follow-up visits for which no treatment was recorded. Unexposed patients were matched to exposed patients based on the time since recruitment into the registry.
Outcomes
The CRISS was developed to assess the likelihood of improvement after 1 year of observation [14]. It consists of two steps. Step 1 identifies patients with significant worsening or new end-organ damage. These patients are automatically defined as not improved and assigned a CRISS score of 0. The criteria for significant worsening or end-organ damage are as follows: new onset scleroderma renal crisis (SRC), new left heart failure with leftventricular ejection fraction ≤ 45% on transthoracic echocardiogram requiring treatment, new pulmonary arterial hypertension (confirmed on right heart catheterization) requiring treatment, ≥ 15% decline in FVC%, new interstitial lung disease (ILD) and FVC% below 80% predicted. Step 2 of the CRISS calculates an estimated improvement after 1 year using the CRISS equation. This takes into account the changes in: mRSS, percent predicted forced vital capacity (FVC%), patient and physician global assessment of disease severity, and HAQ-DI. A final CRISS score after both steps of ≥ 0.6 is considered as improved disease.
For individual CRISS variables, a categorical improvement after 1 year was defined by a favorable change in the absolute difference between measures at T1 and T2 as follows: mRSS change by ≥ 5 points [19], FVC % predicted by ≥ 5% [20], HAQ by ≥ 0.14 points [19], and patient and physician global assessments by ≥ 20% (≥ 2 points) based on previously used cutoffs [21].
Definition of variables
Disease duration was defined from the onset of the first non-Raynaud’s phenomenon symptom to the index visit (T1). Smoking status was classified as either never smoker or past and/or current smoker. Skin involvement was assessed using the modified Rodnan skin score (mRSS), which ranges from 0 (no involvement) to 3 (severe thickening) in 17 areas (score range 0–51). FVC% was extracted from pulmonary function tests. The presence of ILD was determined using a published clinical decision rule [22]. Using this rule, ILD was considered present if a high-resolution computed tomography (HRCT) scan of the lung was interpreted by an experienced radiologist as showing ILD or, in the case where no HRCT was available, if either a chest X-ray was reported as showing either increased interstitial markings (not thought to be due to congestive heart failure) or fibrosis, and/or if a study physician reported the presence of typical “velcro-like crackles” on physical exam. Function was assessed using the HAQ-DI questionnaire which is scored from 0 (no disability) to 3 (severe disability). Patient and physician global assessment scores were rated 0–10 (no disease to very severe disease) on numeric rating scales. For patient assessment scores, patients were asked “in the past week, how was your overall health?”. The physician global severity question asked “How would you rate the patient’s overall health for the past week?”. Other covariates recorded at the index visit included physician reports of inflammatory myositis, arthritis, digital ulcers, prior scleroderma renal crisis, and the gastrointestinal-14 (GI-14) score, a summative score of 14 patient-reported symptoms [23]. The GI-14 correlates well with the UCLA Scleroderma Clinical Trial Consortium GI Tract Instrument [23]. Pulmonary hypertension was defined as an estimated systolic pulmonary artery pressure (sPAP) ≥ 45 mmHg measured using the Doppler flow measurement of the tricuspid regurgitant jet on cardiac echocardiography (used as a non-invasive screening tool for pulmonary hypertension) [24]. Antinuclear antibody was detected by immunofluorescence, and other autoantibodies were detected by line immunoassay (Euroimmun, Lübeck, Germany).
Statistical analysis
Descriptive statistics were used to summarize baseline demographic and clinical characteristics of the exposed and unexposed patients. Continuous variables are presented as mean ± standard deviation (SD), and categorical variables are presented as counts and percentages. Student’s t test and Wilcoxon-Mann-Whitney U test were used to compare continuous variables. Chi-square test and Fisher exact test were used for categorical variables.
Due to the inherent differences between exposed and unexposed patients in an observational study, inverse probability of treatment weighting (IPTW) was used to balance the study groups [25]. Propensity scores were computed using conditional logistic regression by including age, sex, disease duration, and CRISS variables. The weights were calculated as the inverse of the propensity score and applied to each stratum of exposure and covariate. To evaluate residual differences in baseline covariates between the two groups, we calculated standardized differences for each variable. A standardized difference ≤ 0.1 represents meaningful balance [26]. The effect of the exposure on CRISS scores was assessed using weighted linear regression models which generated odds ratios (OR) with 95% confidence intervals (CI) adjusted for age, sex, and disease duration.
Missing data were imputed multiply and longitudinally 50 times at each observation using R version 3.2.0 for Windows (http://r-project.org). All other statistical analyses were performed with SAS v.9.4 (SAS Institute, USA).
Results
There were 433 dcSSc patients without prior exposure to immunosuppression recruited in the CSRG cohort. Of these, 47 were exposed and 254 were unexposed patients for at least 2 consecutive follow-up visits (Fig. 1). None of the patients excluded were due to known deaths. Of the exposed group, 20 (42.6%), 15 (31.9%), 11 (23.4%), and 1 (2.1%) patient(s) were treated with MTX, MPA, CYC, and AZA, respectively. Exposed patients were younger (50.1 ± 10.4 versus 55.1 ± 12.3 years, p = 0.01), had shorter disease duration (5.5 ± 7.4 versus 11.7 ± 9.3 years, p < 0.01), had a higher prevalence of ILD (67.4% versus 40.3%, p < 0.01), and had worse physician global assessments of disease severity (4.2 ± 2.3 versus 2.5 ± 1.9 out of 10 points, p < 0.01) compared to unexposed patients (Table 1). Fewer exposed patients had anti-centromere antibodies (12.5% versus 27.6%, p = 0.04) or a history of inflammatory arthritis (27.7% versus 40.6%, p = 0.01).
Table 1.
Exposed (n = 47) | Unexposed (n = 254) | p | |
---|---|---|---|
Patient characteristics | |||
Median date of cohort entry, mm/yyyy | 12/2008 | 02/2007 | |
Age, mean ± SD | 50.1 ± 10.4 | 55.1 ± 12.3 | 0.01 |
Female, n (%) | 37 (81.5%) | 207 (78.7%) | 0.66 |
Ethnicity, n (%) | 0.39 | ||
Caucasian | 36 (76.6%) | 204 (81.6%) | |
Aboriginal | 1 (2.1%) | 14 (5.6%) | |
Black | 1 (2.1%) | 3 (1.2%) | |
Asian | 1 (2.1%) | 5 (2.0%) | |
Latin America | 2 (4.3%) | 4 (1.6%) | |
Other | 6 (12.8%) | 20 (8.0%) | |
Past and/or current smoker, n (%) | 27 (62.8%) | 148 (60.2%) | 0.50 |
Disease duration (years), mean ± SD | 5.5 ± 7.4 | 11.7 ± 9.3 | < 0.01 |
Interstitial lung disease, n (%) | 31 (67.4%) | 102 (40.3%) | < 0.01 |
Pulmonary hypertension, n (%)^ | 4 (8.7%) | 17 (7.5%) | 0.76 |
Inflammatory myositis, n (%) | 7 (14.9%) | 36 (14.2%) | 0.89 |
Inflammatory arthritis, n (%) | 13 (27.7%) | 103 (40.6%) | 0.01 |
Digital ulcers, n (%) | 29 (61.7%) | 164 (64.6%) | 0.71 |
Prior SRC, n (%) | 5 (10.6%) | 14 (5.5%) | 0.32 |
GI-14 score, mean ± SD | 2.6 ± 2.5 | 2.6 ± 2.5 | 0.93 |
Auto antibodies, n (%)* | |||
Anti-centromere | 5 (12.5%) | 64 (27.6%) | 0.04 |
Anti-topoisomerase I | 11 (27.5%) | 40 (17.2%) | 0.13 |
Anti-RNA polymerase III | 11 (27.5%) | 46 (19.8%) | 0.27 |
Antinuclear antibody (titer ≥ 1/160) | 38 (88.4%) | 219 (92.4%) | 0.37 |
CRISS variables | |||
mRSS, mean ± SD | 15.0 ± 9.4 | 13.1 ± 9.4 | 0.15 |
FVC%, mean ± SD | 88.5 ± 18.7 | 91.4 ± 17.6 | 0.36 |
HAQ, mean ± SD | 0.8 ± 0.7 | 0.7 ± 0.7 | 0.65 |
PTGA disease severity, mean ± SD | 3.5 ± 2.6 | 3.2 ± 2.2 | 0.58 |
PGA disease severity, mean ± SD | 4.2 ± 2.3 | 2.5 ± 1.9 | < 0.01 |
SRC scleroderma renal crisis, GI-14 score Canadian Scleroderma Research Group 14 point score for gastrointestinal involvement, CRISS composite response index in diffuse systemic sclerosis, mRSS modified Rodnan skin score, FVC% percent of predicted forced vital capacity, HAQ Health Assessment Questionnaire, PTGA patient global assessment, PGA physician global assessment
^Only 46 exposed and 227 unexposed patients had known pulmonary hypertension status
*Only 40 exposed and 232 unexposed patients had documented auto antibodies respectively
At 1 year, more patients in the exposed group had overall improvement, defined as a CRISS score ≥ 0.6, compared to the unexposed group (23.4% versus 11.8%, p = 0.03) (Table 2). This is in spite of the fact that more exposed patients had a CRISS = 0 because of significant organ worsening or end-organ damage than unexposed patients (10.6% versus 3.1%, p = 0.02). The only component of the CRISS that was significantly more improved in the exposed group was the mean patient global assessment score (− 0.4 ± 1.8 versus 0 ± 1.6, p = 0.03). The mean changes in other individual CRISS variables (mRSS, FVC%, HAQ-DI, and physician global assessment of disease severity) were small and not different between exposed and unexposed patients. Among subjects with a calculable CRISS (step 2) score, i.e., those who were not automatically assigned a CRISS = 0, the number of subjects with a categorical improvement in each disease measure was examined. The only CRISS variable that was different between exposed and unexposed groups was again patient global assessment scores where 42.9% of the exposed group had improved patient global compared to 26.4% of the unexposed group, p = 0.03 (Table 2). There were no deaths observed after 1 year.
Table 2.
Exposed, n = 47 | Unexposed, n = 254 | p | |
---|---|---|---|
Change in CRISS variables (mean ± SD) | |||
mRSS | − 1.1 ± 6.4 | − 0.3 ± 6.3 | 0.28 |
FVC %predicted | − 1.3 ± 11.1 | − 0.4 ± 7.9 | 0.66 |
HAQ | − 0.1 ± 0.5 | 0 ± 0.3 | 0.56 |
PTGA disease severity | − 0.4 ± 1.8 | 0 ± 1.6 | 0.03 |
PGA disease severity | 0.1 ± 2.1 | − 0.1 ± 2.0 | 0.79 |
CRISS outcomen(%) | |||
Automatically not improved CRISS (step 1)* | 5 (10.6%) | 8 (3.1%) | 0.02 |
Improved CRISS at 1 year | 11 (23.4%) | 30 (11.8%) | 0.03 |
Improvement among subjects with calculated CRISS (step 2)n(%)# | n = 42 | n = 246 | |
Improved mRSS ≥ 5 | 13 (31.0%) | 50 (20.3%) | 0.12 |
Improved FVC %predicted ≥ 5 | 11 (26.2%) | 56 (22.8%) | 0.63 |
Improved HAQ ≥ 0.14 | 13 (31.0%) | 53 (21.5%) | 0.18 |
Improved PTGA disease severity ≥ 2 | 18 (42.9%) | 65 (26.4%) | 0.03 |
Improved PGA disease severity ≥ 2 | 11 (26.2%) | 85 (34.6%) | 0.28 |
mRSS modified Rodnan skin score, FVC% percent of predicted forced vital capacity, HAQ Health Assessment Questionnaire, PTGA patient global assessment, PGA physician global assessment, CRISS composite response index in diffuse systemic sclerosis
*Of those considered not improved, these are patients with significant new or worsening organ involvement who were assigned an automatic CRISS score of 0 without the need for formulaic calculation of CRISS score
#Cutoffs represent the absolute difference in measures after 1 year towards an improved score
Primary outcome in statistically weighted cohort
Statistical balance between exposed and unexposed groups was achieved with IPTW with standardized differences < 0.1 for all potentially confounding variables in the weighted populations (Table 3). In multivariate analyses, we found that prior to weighting, treatment already tended towards a higher likelihood of improvement with an OR (95% CI) of 2.00 (0.82, 4.92), p = 0.13 (Table 4). After weighting, exposed patients were significantly more likely to have improved disease measured with the CRISS than unexposed patients (OR (95% CI) 1.85 (1.11, 3.09), p = 0.02), adjusting for age, sex, and disease duration.
Table 3.
Standardized difference* between groups before IPTW | Standardized difference* between groups after IPTW | |
---|---|---|
Age | 0.44 | 0.04 |
Female sex | 0.07 | 0.02 |
Disease duration | 0.73 | 0.08 |
mRSS | 0.20 | 0.06 |
FVC %predicted | 0.15 | 0.05 |
HAQ | 0.07 | 0.02 |
PTGA disease severity | 0.13 | 0.05 |
PGA disease severity | 0.80 | 0.02 |
IPTW inverse probability of treatment weighting, mRSS modified Rodnan skin score, FVC% percent of predicted forced vital capacity, HAQ Health Assessment Questionnaire, PTGA patient global assessment, PGA physician global assessment
*Standardized difference ≤ 0.1 represents meaningful balance
Table 4.
Before IPTW | After IPTW | |||
---|---|---|---|---|
OR (95% CI) | p values | OR (95% CI) | p values | |
Immunosuppression use | 2.00 (0.82, 4.92) | 0.13 | 1.85 (1.11, 3.09) | 0.02 |
Female | 1.07 (0.41, 2.79) | 0.90 | 1.50 (0.73, 3.05) | 0.26 |
Age | 1.00 (0.97, 1.04) | 0.86 | 1.03 (1.00, 1.05) | 0.04 |
Disease duration | 1.01 (0.96, 1.05) | 0.80 | 1.00 (0.97, 1.02) | 0.81 |
IPTW inverse probability of treatment weighting, OR (95% CI) odds ratio with 95% confidence interval
Post hoc analysis
A post hoc analysis was performed to examine the impact of patient global assessment scores on the CRISS, since there may be a placebo effect. Exposed patients may have been influenced to report more global improvement due to the unblinded nature of the study and was the only variable that was different between CRISS improvers and non-improvers. We tested the robustness of our findings using a “worst-case” scenario. All subjects with an improved patient global at 1 year were assigned a change in patient global = 0 (i.e., no change), while those with worsening patient global scores were unchanged. In this analysis, the association between treatment and improved CRISS after IPTW persisted, with an OR (95% CI) of 1.86 (1.07, 3.24), p = 0.03 (Table 5).
Table 5.
Before IPTW | After IPTW | |||
---|---|---|---|---|
OR (95% CI) | p values | OR (95% CI) | p values | |
Immunosuppression use | 2.08 (0.78, 5.57) | 0.14 | 1.86 (1.07, 3.24) | 0.03 |
Female | 0.89 (0.32, 2.47) | 0.83 | 1.25 (0.58, 2.63) | 0.56 |
Age | 1.00 (0.97, 1.04) | 0.84 | 1.03 (1.00, 1.06) | 0.04 |
Disease duration | 1.02 (0.97, 1.06) | 0.47 | 1.01 (0.98, 1.03) | 0.57 |
IPTW inverse probability of treatment weighting, OR (95% CI) odds ratio with 95% confidence interval
A second post hoc analysis was performed to examine whether patients with improved patient global scores might have more improvement in the other CRISS features. We found that among the exposed patients, those who had improved patient global scores by at least 2 points had significantly better mean change in FVC% (2.2 ± 7.6% improvement versus a − 2.8 ± 6.4% decline, p = 0.04). However, no correlation was found between improved patient global and other mean disease scores when the overall group of both exposed and unexposed patients was assessed. These are shown in supplementary Table 1.
Discussion
This is the first study to investigate global outcomes in an observational cohort of dcSSc patients using the CRISS. As expected, patients exposed to immunosuppression had worse disease compared to unexposed patients, reflecting confounding by indication. However, after weighting observations by the probability of treatment, we were able to balance the groups and thus correct for baseline differences in the exposed and unexposed groups. We found that immunosuppressive treatment was a significant predictor of improvement at 1 year. This was despite the fact that the exposed group was sicker and had more patients with substantially worsened disease after 1 year. Indeed, even if there was residual confounding, our findings of improved CRISS in the exposed group may be conservative estimates of the truth.
It is important to note that, except for the patient global assessment scores, none of the other individual CRISS variables showed statistically significant differences in favor of exposure. The main concern with regard to the validity of this finding remains the possibility of a placebo effect. Since treatment was not blinded, exposure to immunosuppression may have biased patients to report improvement. We therefore performed a post hoc analysis that would address this bias and found that this was not the case. The fact that patient global assessment scores did not significantly impact our findings is in keeping with the original CRISS cohort where patient global assessment was the least predictive of improved CRISS [14]. The difference in patient global scores might otherwise be due to improvement in domains that we did not measure such as pain or fatigue or that the patient’s appreciation of their own disease is more sensitive than other measures. In a second post hoc analysis, we found that FVC% change was more favorable among exposed patients who also reported an improved patient global score than those who did not have an improved patient global. Although this finding is limited by the small number of patients in this subgroup, it suggests that bias from non-blinded exposure to immunosuppression was not the only reason that exposed patients reported global disease improvement.
Our results lend weight to the idea that the mean change in any individual disease measure such as mRSS and FVC may not be representative of the heterogeneous disease changes in a patient. There were significantly more CRISS improvers in spite of little to no difference in the mean change of individual CRISS variables between the exposure groups. It may be that the combination of changes within an individual is more significant than the pooled mean of a specific outcome measure. This supports the concept that a combined response index may be more valuable than organ-specific outcomes [12, 14] and this deserves further examination. Such measures are frequently used in other rheumatologic conditions such as rheumatoid arthritis and lupus for which core outcome measures are composite indices [27, 28].
With regard to immunosuppression, previous studies have shown the effects to be modest at best. One example is the effect of methotrexate on skin involvement which failed to meet trial endpoints until the data was re-analyzed using Bayesian methods [12, 13]. In other cases such as lung disease, immunosuppression achieved only FVC stability [9–11]. Because exposed patients had more severe baseline disease and more significant disease worsening (automatic assigned CRISS score = 0), the lack of worsening of individual CRISS measures among the exposed may be significant in and of itself. However, it is impossible to infer causality or to separate the effects of treatment from the natural course of disease stability. Supporting the idea that a global disease measure may be more appropriate than individual measures to assess response to immunosuppression is a recent observational study by the ESOS group which found no difference in skin changes between exposed and unexposed dcSSc patients, although survival was better among the exposed [29]. Low-dose cyclophosphamide has also shown some benefit when evaluated using another global measure, the Medsger disease severity score [30]. Further support is seen in aggressive treatment with high-dose immunosuppression [31] and in the context of stem cell transplant [32] which has shown improvement in various disease endpoints including global disease measures such as the global rank composite score (GRCS) [33] and the European Scleroderma Research Group Activity Index (EScSG-AI) [34].
Limitations
Assessing the validity of the CRISS was not our goal. The reported sensitivity and specificity of this score for improvement was 98.2% and 93.1%, respectively [14]. In addition, it has been shown to have good face and content validity and good sensitivity to change [35, 36]. Another limitation is that our sample population encompassed a wider spectrum of patients than those for which the CRISS was developed. In particular, the average disease duration was long in both groups, i.e., 5.5 ± 7.4 and 11.7 ± 9.3 years among the exposed and unexposed groups, respectively, whereas the mean disease duration of the CRISS cohort was 2.36 ± 1.5 years [14]. Although the CRISS was created using observational patient cohorts with < 5 years disease duration, the items and weights were developed by asking experts to rate a patient’s improvement given a list of disease variable changes after 1 year. There is no compelling reason to suspect that these conclusions should differ in patients with longer disease duration but showing the same pattern of change.
We were also limited by the low number of exposed patients in the study (n = 47) because patients with prior immunosuppression were excluded (see Fig. 1). This selection also contributed to the longer disease durations, suggesting that our patients may have had a less progressive disease course. Furthermore, certain aspects of dcSSc such as skin involvement tend to stabilize or regress after 5 years. This may account for the lower baseline mRSS scores and explain why only minimal differences were seen in objective disease measures after 1 year between the exposure groups. Despite adjusting for disease duration, exposed patients had earlier disease and thus a higher likelihood of spontaneous regression that is impossible to differentiate from treatment effects. Nevertheless, the use of propensity weighting adjusted for major imbalances between the 2 groups and allowed the assessment of treatment in an observational cohort such as ours. Medication exposure in the registry was assessed at yearly intervals without details regarding dose and start/stop dates. While we do not know the exact duration of treatment, we selected patients who were exposed for at least 2 consecutive yearly visits in order to reduce potential misclassification bias. Finally, while IPTW balancing was statistically valid, we recognize that there may be residual confounding inherent with the observational study design.
Conclusion
After applying the CRISS in a real-world setting beyond the intended clinical trial population, our findings reinforce the hypothesis that immunosuppression may improve overall disease. Further, our findings that organ-specific outcomes did not differ between CRISS improvers and non-improvers may reflect the concept that a composite score within an individual is a more sensitive measure than the aggregate mean of organ-specific measures across individuals.
Supplementary information
Acknowledgements
Canadian Scleroderma Research group (CSRG) authors:
All CSRG members meet criteria for authorship. Members of the CSRG:
Janet E. Pope1, Murray Baron2, Marie Hudson2, Geneviève Gyger2, Maggie J. Larché3, Nader A. Khalidi3, Ariel Masetto4, Evelyn Sutton5, David Robinson6, Tatiana S. Rodriguez-Reyna7, Nancy Maltez8, Doug Smith8, Carter Thorne9, Alena Ikic10, Paul R. Fortin10, Marvin J. Fritzler11
1Western University, Division of Rheumatology, St. Joseph’s Health Care, London, Canada
2McGill University, Division of Rheumatology, Jewish General Hospital, Montreal, Quebec, Canada
3McMaster University, Division of Rheumatology, St Joseph’s Healthcare, Hamilton, Ontario, Canada
4Université de Sherbrooke, Department of Rheumatology, Sherbrooke, Quebec, Canada
5Dalhousie University, Division of Rheumatology, Nova Scotia Rehabilitation Centre, Halifax, Nova Scotia,
6University of Manitoba, Internal Medicine and Rheumatology, Health Sciences Centre, Winnipeg, Manitoba, Canada.
7Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubiran, Department of Immunology and Rheumatology, Mexico City, Mexico.
8University of Ottawa, Division of Rheumatology, Department of Medicine, Ottawa, Ontario, Canada
9Southlake Regional Health Centre, Division of Rheumatology, Newmarket, Ontario, Canada
10Université Laval, Division of Rhumatologie, CHU de Québec, Québec City, Québec, Canada
11University of Calgary, Department of Medicine, Calgary, Ottawa, Ontario, Canada ethics
This study has been approved by the locally appointed ethics committee (Jewish General Hospital) and at all participating CSRG study sites.
Declarations
This manuscript has not been submitted and is not simultaneously being submitted elsewhere.
Abbreviations
- ACR
American College of Rheumatology
- AZA
Azathioprine
- CRISS
Composite response index in dcSSc
- CSRG
Canadian Scleroderma Research Group
- CYC
Cyclophosphamide
- dcSSc
Diffuse cutaneous systemic sclerosis
- FVC
Forced vital capacity
- GI-14
Gastrointestinal-14 score
- HAQ
Health Assessment Questionnaire Disability Index
- HRCT
High-resolution computed tomography
- ILD
Interstitial lung disease
- IPTW
Inverse probability of treatment weighting
- MPA
Mycophenolate
- mRSS
Modified Rodnan skin score
- MTX
Methotrexate
- sPAP
Systolic pulmonary artery pressure
- SSc
Systemic sclerosis
Authors’ contributions
BZ, MH, and MB made substantial contributions to manuscript drafting. BZ, MH, and MW made substantial contributions to the analysis and interpretation of data. MB made substantial contributions to the study conception, design, and interpretation of data. The CSRG made substantial contributions to the acquisition of data. All authors read and approved the final manuscript.
Funding
The CSRG has been supported by the Canadian Institutes of Health Research; the Scleroderma Society of Canada and its Chapters of Ontario, Saskatchewan and Québec; Cure Scleroderma Foundation; INOVA Diagnostics Inc. (San Diego, CA); Dr. FookeLaboratorien GmbH (Neuss, Germany); Euroimmun (Lubeck, Germany); Mikrogen GmbH (Neuried, Germany); Fonds de la recherche en santé du Québec; the Canadian Arthritis Network; and the Lady Davis Institute of the Jewish General Hospital, Montreal, QC.
Availability of data and materials
The data that support the findings of this study are available from the CSRG but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the CSRG.
Consent for publication
No individual person’s data in any form (including any individual details, images or videos) is contained in the manuscript. All participants in the CSRG have agreed that de-identified and aggregate data be made available for scientific publications.
Competing interests
MB has received funding from Corbus as an investigator in an unrelated study using CRISS as an outcome measure. The other authors have no disclosures.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Murray Baron, Email: mbaron@jgh.mcgill.ca.
for the Canadian Scleroderma Research Group:
Janet E. Pope, Murray Baron, Marie Hudson, Geneviève Gyger, Maggie J. Larché, Nader A. Khalidi, Ariel Masetto, Evelyn Sutton, David Robinson, Tatiana S. Rodriguez-Reyna, Nancy Maltez, Doug Smith, Carter Thorne, Alena Ikic, Paul R. Fortin, and Marvin J. Fritzler
Supplementary information
Supplementary information accompanies this paper at 10.1186/s13075-020-02220-0.
References
- 1.Clements PJ, Hurwitz EL, Wong WK, Seibold JR, Mayes M, White B, et al. Skin thickness score as a predictor and correlate of outcome in systemic sclerosis: high-dose versus low-dose penicillamine trial. Arthritis Rheum. 2000;43(11):2445–2454. doi: 10.1002/1529-0131(200011)43:11<2445::AID-ANR11>3.0.CO;2-Q. [DOI] [PubMed] [Google Scholar]
- 2.Geirsson AJ, Wollheim FA, Akesson A. Disease severity of 100 patients with systemic sclerosis over a period of 14 years: using a modified Medsger scale. Ann Rheum Dis. 2001;60(12):1117–1122. doi: 10.1136/ard.60.12.1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hachulla E, Carpentier P, Gressin V, Diot E, Allanore Y, Sibilia J, et al. Risk factors for death and the 3-year survival of patients with systemic sclerosis: the French ItinerAIR-Sclerodermie study. Rheumatology (Oxford) 2009;48(3):304–308. doi: 10.1093/rheumatology/ken488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Scussel-Lonzetti L, Joyal F, Raynauld JP, Roussin A, Rich E, Goulet JR, et al. Predicting mortality in systemic sclerosis: analysis of a cohort of 309 French Canadian patients with emphasis on features at diagnosis as predictive factors for survival. Medicine (Baltimore) 2002;81(2):154–167. doi: 10.1097/00005792-200203000-00005. [DOI] [PubMed] [Google Scholar]
- 5.LeRoy EC, Black C, Fleischmajer R, Jablonska S, Krieg T, Medsger TA, Jr, et al. Scleroderma (systemic sclerosis): classification, subsets and pathogenesis. J Rheumatol. 1988;15(2):202–205. [PubMed] [Google Scholar]
- 6.Cappelli S, Bellando-Randone S, Guiducci S, Matucci-Cerinic M. Is immunosuppressive therapy the anchor treatment to achieve remission in systemic sclerosis? Rheumatology (Oxford) 2014;53(6):975–987. doi: 10.1093/rheumatology/ket312. [DOI] [PubMed] [Google Scholar]
- 7.Denton CP, Hughes M, Gak N, Vila J, Buch MH, Chakravarty K, et al. BSR and BHPR guideline for the treatment of systemic sclerosis. Rheumatology (Oxford) 2016;55(10):1906–10. doi: 10.1093/rheumatology/kew224. [DOI] [PubMed] [Google Scholar]
- 8.Kowal-Bielecka O, Fransen J, Avouac J, Becker M, Kulak A, Allanore Y, et al. Update of EULAR recommendations for the treatment of systemic sclerosis. Ann Rheum Dis. 2017;76(8):1327–1339. doi: 10.1136/annrheumdis-2016-209909. [DOI] [PubMed] [Google Scholar]
- 9.Tashkin DP, Elashoff R, Clements PJ, Goldin J, Roth MD, Furst DE, et al. Cyclophosphamide versus placebo in scleroderma lung disease. N Engl J Med. 2006;354(25):2655–2666. doi: 10.1056/NEJMoa055120. [DOI] [PubMed] [Google Scholar]
- 10.Tashkin DP, Roth MD, Clements PJ, Furst DE, Khanna D, Kleerup EC, et al. Mycophenolate mofetil versus oral cyclophosphamide in scleroderma-related interstitial lung disease (SLS II): a randomised controlled, double-blind, parallel group trial. Lancet Respir Med. 2016;4(9):708–719. doi: 10.1016/S2213-2600(16)30152-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hoyles RK, Ellis RW, Wellsbury J, Lees B, Newlands P, Goh NS, et al. A multicenter, prospective, randomized, double-blind, placebo-controlled trial of corticosteroids and intravenous cyclophosphamide followed by oral azathioprine for the treatment of pulmonary fibrosis in scleroderma. Arthritis Rheum. 2006;54(12):3962–3970. doi: 10.1002/art.22204. [DOI] [PubMed] [Google Scholar]
- 12.Johnson SR, Feldman BM, Pope JE, Tomlinson GA. Shifting our thinking about uncommon disease trials: the case of methotrexate in scleroderma. J Rheumatol. 2009;36(2):323–329. doi: 10.3899/jrheum.071169. [DOI] [PubMed] [Google Scholar]
- 13.Pope JE, Bellamy N, Seibold JR, Baron M, Ellman M, Carette S, et al. A randomized, controlled trial of methotrexate versus placebo in early diffuse scleroderma. Arthritis Rheum. 2001;44(6):1351–1358. doi: 10.1002/1529-0131(200106)44:6<1351::AID-ART227>3.0.CO;2-I. [DOI] [PubMed] [Google Scholar]
- 14.Khanna D, Berrocal VJ, Giannini EH, Seibold JR, Merkel PA, Mayes MD, et al. The American College of Rheumatology provisional composite response index for clinical trials in early diffuse cutaneous systemic sclerosis. Arthritis Rheumatol. 2016;68(2):299–311. doi: 10.1002/art.39501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Johnson SR, Tomlinson GA, Hawker GA, Granton JT, Feldman BM. Propensity score methods for bias reduction in observational studies of treatment effect. Rheum Dis Clin N Am. 2018;44(2):203–213. doi: 10.1016/j.rdc.2018.01.002. [DOI] [PubMed] [Google Scholar]
- 16.Kang EH, Jin Y, Tong AY, Desai RJ, Kim SC. Risk of serious infection among initiators of TNF inhibitors plus methotrexate versus triple therapy for rheumatoid arthritis: a cohort study. Arthritis Care Res (Hoboken). 2020. 10.1002/acr.24038. [DOI] [PubMed]
- 17.Johnson SR, Granton JT, Tomlinson GA, Grosbein HA, Le T, Lee P, et al. Warfarin in systemic sclerosis-associated and idiopathic pulmonary arterial hypertension. A Bayesian approach to evaluating treatment for uncommon disease. J Rheumatol. 2012;39(2):276–285. doi: 10.3899/jrheum.110765. [DOI] [PubMed] [Google Scholar]
- 18.Alhajeri H, Hudson M, Fritzler M, Pope J, Tatibouet S, Markland J, et al. 2013 American College of Rheumatology/European league against rheumatism classification criteria for systemic sclerosis outperform the 1980 criteria: data from the Canadian Scleroderma Research Group. Arthritis Care Res (Hoboken). 2015;67(4):582–587. doi: 10.1002/acr.22451. [DOI] [PubMed] [Google Scholar]
- 19.Khanna D, Furst DE, Hays RD, Park GS, Wong WK, Seibold JR, et al. Minimally important difference in diffuse systemic sclerosis: results from the D-penicillamine study. Ann Rheum Dis. 2006;65(10):1325–1329. doi: 10.1136/ard.2005.050187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kafaja S, Clements PJ, Wilhalme H, Tseng CH, Furst DE, Kim GH, et al. Reliability and minimal clinically important differences of forced vital capacity: results from the Scleroderma Lung Studies (SLS-I and SLS-II) Am J Respir Crit Care Med. 2017;197(5):644–652. doi: 10.1164/rccm.201709-1845OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Sultan N, Pope JE, Clements PJ, Scleroderma Trials Study G. The health assessment questionnaire (HAQ) is strongly predictive of good outcome in early diffuse scleroderma: results from an analysis of two randomized controlled trials in early diffuse scleroderma. Rheumatology (Oxford) 2004;43(4):472–478. doi: 10.1093/rheumatology/keh070. [DOI] [PubMed] [Google Scholar]
- 22.Steele R, Hudson M, Lo E, Baron M, Canadian Scleroderma Research G. Clinical decision rule to predict the presence of interstitial lung disease in systemic sclerosis. Arthritis Care Res (Hoboken). 2012;64(4):519–524. doi: 10.1002/acr.21583. [DOI] [PubMed] [Google Scholar]
- 23.Baron M, Hudson M, Steele R, Lo E. Validation of the UCLA Scleroderma Clinica Trial Gastrointestinal Tract Instrument version 2.0 for systemic sclerosis. J Rheumatol. 2011;38(9):1925–1930. doi: 10.3899/jrheum.110060. [DOI] [PubMed] [Google Scholar]
- 24.Hsu VM, Moreyra AE, Wilson AC, Shinnar M, Shindler DM, Wilson JE, et al. Assessment of pulmonary arterial hypertension in patients with systemic sclerosis: comparison of noninvasive tests with results of right-heart catheterization. J Rheumatol. 2008;35(3):458–465. [PubMed] [Google Scholar]
- 25.He H, Hu J, He J. Overview of propensity score methods. In: He H, Wu P, Chen D-G, editors. Statistical causal inferences and their applications in public health research. Cham: Springer International Publishing; 2016. pp. 29–48. [Google Scholar]
- 26.Austin PC. Propensity-score matching in the cardiovascular surgery literature from 2004 to 2006: a systematic review and suggestions for improvement. J Thor Cardiovasc Surg. 2007;134(5):1128–35.e3. doi: 10.1016/j.jtcvs.2007.07.021. [DOI] [PubMed] [Google Scholar]
- 27.Anderson J, Caplan L, Yazdany J, Robbins ML, Neogi T, Michaud K, et al. Rheumatoid arthritis disease activity measures: American College of Rheumatology recommendations for use in clinical practice. Arthritis Care Res (Hoboken) 2012;64(5):640–647. doi: 10.1002/acr.21649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Mikdashi J, Nived O. Measuring disease activity in adults with systemic lupus erythematosus: the challenges of administrative burden and responsiveness to patient concerns in clinical research. Arthritis Res Ther. 2015;17(1):183. doi: 10.1186/s13075-015-0702-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Herrick AL, Pan X, Peytrignet S, Lunt M, Hesselstrand R, Mouthon L, et al. Treatment outcome in early diffuse cutaneous systemic sclerosis: the European Scleroderma Observational Study (ESOS) Ann Rheum Dis. 2017;76(7):1207–1218. doi: 10.1136/annrheumdis-2016-210503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Valentini G, Paone C, La Montagna G, Chiarolanza I, Menegozzo M, Colutta E, et al. Low-dose intravenous cyclophosphamide in systemic sclerosis: an open prospective efficacy study in patients with early diffuse disease. Scand J Rheumatol. 2006;35(1):35–38. doi: 10.1080/03009740510026896. [DOI] [PubMed] [Google Scholar]
- 31.McSweeney PA, Nash RA, Sullivan KM, Storek J, Crofford LJ, Dansey R, et al. High-dose immunosuppressive therapy for severe systemic sclerosis: initial outcomes. Blood. 2002;100(5):1602–1610. doi: 10.1182/blood.V100.5.1602.h81702001602_1602_1610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.van Laar JM, Farge D, Sont JK, Naraghi K, Marjanovic Z, Larghero J, et al. Autologous hematopoietic stem cell transplantation vs intravenous pulse cyclophosphamide in diffuse cutaneous systemic sclerosis: a randomized clinical trial. JAMA. 2014;311(24):2490–2498. doi: 10.1001/jama.2014.6368. [DOI] [PubMed] [Google Scholar]
- 33.Sullivan KM, Goldmuntz EA, Keyes-Elstein L, McSweeney PA, Pinckney A, Welch B, et al. Myeloablative autologous stem-cell transplantation for severe scleroderma. N Engl J Med. 2018;378(1):35–47. doi: 10.1056/NEJMoa1703327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Siegert E, Riemekasten G. Are we too lenient with immunosuppression in severe cases of systemic sclerosis? Rheumatology. 2016;55(10):1914–1916. doi: 10.1093/rheumatology/kew242. [DOI] [PubMed] [Google Scholar]
- 35.Melsens K, De Keyser F, Decuman S, Piette Y, Vandecasteele E, Smith V. Disease activity indices in systemic sclerosis: a systematic literature review. Clin Exp Rheumatol. 2016;34 Suppl 100(5):186–192. [PubMed] [Google Scholar]
- 36.Tay T, Ferdowsi N, Baron M, Stevens W, Hudson M, Proudman SM, et al. Measures of disease status in systemic sclerosis: a systematic review. Semin Arthritis Rheum. 2017;46(4):473–487. doi: 10.1016/j.semarthrit.2016.07.010. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support the findings of this study are available from the CSRG but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the CSRG.