Abstract
Objectives
Patients with rheumatoid arthritis (RA) and interstitial lung disease (ILD) have increased mortality compared to the general population and factors capable of predicting RA-ILD long-term clinical outcomes are lacking. In oncology, radiomics allows the quantification of tumour phenotype by analysing the characteristics of medical images. Using specific software, it is possible to segment organs on high-resolution computed tomography (HRCT) images and extract many features that may uncover disease characteristics that are not detected by the naked eye. We aimed to investigate whether features from whole lung radiomic analysis of HRCT may alone predict mortality in RA-ILD patients.
Methods
High-resolution computed tomographies of RA patients from January 2012 to March 2022 were analyzed. The time between the first available HRCT and the last follow-up visit or ILD-related death was recorded. We performed a volumetric analysis in 3D Slicer, automatically segmenting the whole lungs and trachea via the Lung CT Analyzer. A LASSO-Cox model was carried out by considering ILD-related death as the outcome variable and extracting radiomic features as exposure variables.
Results
We retrieved the HRCTs of 30 RA-ILD patients. The median survival time (interquartile range) was 48 months (36–120 months). Thirteen out of 30 (43.33%) patients died during the observation period. Whole line segmentation was fast and reliable. The model included either the median grey level intensity within the whole lung segmentation [high-resolution (HR) 9.35, 95% CI 1.56–55.86] as a positive predictor of death and the 10th percentile of the number of included voxels (HR 0.20, 95% CI 0.05–0.84), the voxel-based pre-processing information (HR 0.23, 95% CI 0.06–0.82) and the flatness (HR 0.42, 95% CI 0.18–0.98), negatively correlating to mortality. The correlation of grey level values to their respective voxels (HR 1.52 95% CI 0.82–2.83) was also retained as a confounder.
Conclusion
Radiomic analysis may predict RA-ILD patients’ mortality and may promote HRCT as a digital biomarker regardless of the clinical characteristics of the disease.
Keywords: radiomics, rheumatoid arthritis-associated interstitial lung disease, high-resolution computed tomography, biomarker, LASSO
Introduction
Patients with rheumatoid arthritis (RA) have decreased survival compared to the general population (1, 2). RA-associated interstitial lung disease (RA-ILD) is a common extra-articular manifestation of RA. The median survival of RA-ILD patients is about 3–7 years, which is markedly reduced compared to RA patients without ILD and the general population (3–5). Detection of RA-ILD varies widely by different research methods. Consequently, the reported prevalence of RA-ILD reflects such high variance, with studies based on chest computed tomography (CT) scans indicating RA-ILD presence in 10–30% of patients. Despite the recent advance with potential diagnostic biomarkers such as the MUC5B promotor variant (6) or sound analysis of vesicular murmur (7), there is a scarcity of factors capable of predicting RA-ILD long-term clinical outcomes. In oncology, radiomics allows for comprehensive tumour phenotype quantification by examining medical images’ characteristics. In brief, using specific software, it is possible to segment organs on CT images to extract a massive number of features that have the potential to uncover disease characteristics that fail to be seen by the naked eye. The hypothesis of radiomics is that the distinctive imaging features between disease forms may help make a prognosis and predict the therapeutic response for various conditions, thus providing valuable information for personalised therapy and patient management (8). We aimed to investigate whether features from whole lung radiomic analysis of high-resolution computed tomography (HRCT) might alone predict mortality in RA-ILD patients.
Materials and methods
We retrieved the consecutive HRCTs of patients affected with RA according to 2010 EULAR/ACR criteria (9), and RA-ILD followed at the Rheumatology departments of two Italian tertiary centres from January 2012 to March 2022. To be included in such retrospective analysis, HRCTs had to have been carried out at one of the Radiology departments of the same centres, and the DICOM files had to have been stored in the respective picture archiving and communication systems (PACS). We recorded the time interval between the first available HRCT and the last follow-up visit or eventual death for physician-reported ILD-related causes on death certificates for each patient. We also recorded clinical and demographic characteristics together with rheumatoid factor (RF) and anti-cyclic citrullinated peptide antibodies (ACPA) status. We excluded patients with known overlapping connective tissue disease or secondary Sjogren’s syndrome. In particular, we considered only HRCT examinations with 0.625–1.25-mm slice thickness and full-inspiration scans from the lung apices to below the costo-phrenic angles. Both centres used the following parameters: tube voltage set at 120 kV; tube current: fixed mAs depending on the patient body weight; pitch: 0.8 or more, adjusted on the seriousness of patient dyspnoea, collimation: 0.6 mm; images were reconstructed from raw data using a slice thickness of 0.6 mm with an index of 0.4 mm using an edge-enhancing algorithm. All images were viewed at a window setting optimised for assessment of the lung parenchyma (width 1,500 HU; level −700 HU).
The study was approved and reviewed by the local Ethical Committee (Biopure registry, IRB Approval n.5940, Azienda Ospedaliera Universitaria di Bari). All patients gave their written informed consent.
Segmentation and radiomic features extraction
We performed a volumetric analysis and visualisation in 3D Slicer1, automatically segmenting the whole lungs and trachea via the Lung CT Analyzer project2. As previously described elsewhere (10), such an extension allows fully reproducible automated whole lungs and trachea segmentation using 13 manually marked points. Three points were placed in axial and coronal views inside the right and the left lung, and one point in the trachea (further details in Supplementary material). For each HRCT, segmentation was repeated twice by a rheumatologist and a radiologist in order to check for radiomic feature stability (see below). Two expert radiologists then reviewed the latter segmentations to confirm that all lung parenchyma had been appropriately isolated.
The PyRadiomics platform was developed for cancer research with the US National Cancer Institute grant 5U24CA194354 (8). It can extract radiomic data from medical imaging loading, pre-process the image and segmentation maps, calculate features using the different feature classes, and return results as continuous variables in CSV format. In a Python 3.9 environment, the PyRadiomics API (version 3.01) was implemented to extract 120 features for segmented regions (see Supplementary material for further technical details). Such features, including skewness and kurtosis, together with more specific texture-related ones, comply with feature definitions as described by the imaging biomarker standardization initiative (IBSI) (8).
Collinearity was checked, and all redundant features were removed (i.e., with a correlation higher than 0.8 as an absolute value, see Supplementary material). We carried out feature z-score standardisation before the next step.
Statistics
Intraclass correlation coefficient (ICC) was used to test radiomic feature stability by comparing the abovementioned segmentations. The survival function was plotted with the Kaplan–Meier estimate together with the at-risk table. Cox survival analysis and least absolute shrinkage and selection operator (LASSO) regression univariable analyses were performed to assess the association of each radiomic feature with death. Values of p < 0.20 were selected as candidate variables. The method of LASSO was used to determine predictors as such procedures demonstrated superior accuracy than stepwise elimination (11). As a proof-of-concept study, demographics and disease characteristics were not included in the regression model to demonstrate the feasibility and investigate the association of features alone with RA patients’ death. Cox-Snell residuals were plotted against the Nelson–Aalen cumulative hazard rate function to test the model’s reliability. Stata 17 (StataCorp, TX, USA), together with Python 3.9 herein invoked with Pystata API, numpy 1.22, pandas 1.4.3, and scikit-learn 1.1.2 libraries, were used on a terminal powered by an Apple™ Silicon M1Max with 64 GB RAM.
Results
We retrieved HRCTs of 30 RA-ILD patients, 11 males (36.67%) with median age [interquartile range (IQR)] of 72 (65–78) years at the instrumental examination. They had established RA with median disease duration (IQR) of 132 (65–278) months. ACPA positivity was found in 25 out of 30 patients (83.33%), whereas RF-positive individuals were found in 18 out of 29 (62.97%). Usual interstitial pneumonia (UIP) was the most frequent finding (18/30, 60%) at HRCT, followed by unclassifiable patterns (8/30, 26, 67%). Non-specific interstitial pneumonia (NSIP) in two patients (6.67%), whereas organising pneumonia (OP) and lymphocytic interstitial pneumonia (LIP) were found in one patient, respectively (3.33%, for both). Thirteen out of 30 patients (43.33%) were on methotrexate at HRCT examination, whereas 19 out of them (63.33%) were treated with biologic agents and two with baricitinib (6.67%). Complete patient characteristics and ILD patterns are shown in Table 1. The mean follow-up time (± standard deviation) was 37.99 ± 29.50 months median survival time (MST—IQR) was 48 months (36–120 months). Thirteen out of 30 (43.33%) patients died during the observation period, and the cause of death was attributed to ILD as judged by the physician. Kaplan–Meier survival function was plotted in Figure 1A.
TABLE 1.
Patient characteristics.
Available observations | ||
Male, n (%) | 30 | 11 (36.67) |
Age at HRCT, years, median (IQR) | 30 | 72 (65–78) |
RA disease duration, month, median (IQR) | 30 | 132 (65–278) |
RF positivity, n (%) | 29 | 18 (62.07) |
ACPA positivity n (%) | 30 | 25 (83.33) |
ILD pattern at HRCT, (n%) | 30 | |
UIP | 18 (60) | |
NSIP | 2 (2.67) | |
LIP | 1 (3.33) | |
OP | 1 (3.33) | |
Unclassifiable | 8 (26.67) | |
Therapy at HRCT, n (%) | 30 | |
MTX | 17 (56.67) | |
Tocilizumab | 7 (23.33) | |
Abatacept | 6 (0.20) | |
Rituximab | 5 (16.66) | |
Anakinra | 1 (3.33) | |
Baricitinib | 2 (6.67) |
ACPA, anti-cyclic citrullinated peptide antibodies; HRCT, high-resolution computed tomography; LIP, lymphocytic interstitial pneumonia; ILD, interstitial lung disease; IQR, interquartile range; MTX, methotrexate; NSIP, non-specific interstitial pneumonia; OP, organising pneumonia; RA, rheumatoid arthritis; RF, rheumatoid factor; UIP, usual interstitial pneumonia.
FIGURE 1.
(A) Survival function displayed using Kaplan–Meier estimate, with at-risk table. Deaths have been reported in parentheses. (B) Cox-Snell residuals plotted against the Nelson–Aalen cumulative hazard rate function to test the reliability of the model. Hazard function followed the 45-degree line very closely except for huge values of time.
The death cause was acute ILD exacerbations in 38.46% of cases (5/13) and pneumonia in 23.07% (3/13). Finally, another 38.46% (5/13) of death certificates and health records reported generic “RA-ILD” as the cause of death.
The automatic segmentation allowed for the comprehensive and precise isolation of RA-ILD lung parenchyma and upper airways coherent with radiologist judgement in all cases at the first attempt (Figure 2). The segmentation procedure took a mean 3.13 ± 2.11 min on average. Such a procedure showed excellent feature stability with ICC = 1, indicating perfect reliability between operators. After checking for collinearity (Supplementary Figure 4), only 22 features were retained (see Supplementary material for the full list).
FIGURE 2.
Automated whole lungs and trachea segmentation using 3D Slicer (http://www.slicer.org) via the Lung CT Analyzer project (https://github.com/rbumm/SlicerLungCTAnalyzer/).
After the LASSO-Cox procedure, the model included the features:
-
•
The median grey level intensity within the whole lung segmentation mask [coded as original_firstorder_median, high-resolution (HR) 9.35, 95% CI 1.56–55.86].
-
•
The 10th percentile of the number of voxels included in the whole lung segmentation mask (coded as original_firstorder_10Percentile HR 0.20, 95% CI 0.05–0.84).
-
•
3D Slicer voxel-based pre-processing information (coded as diagnostics_Imageoriginal_Mean, HR 0.23, 95% CI 0.06–0.82).
-
•
Flatness—the relationship between the largest and smallest principal components in the whole lung mask (coded as original_shape_Flatness, HR 0.42, 95% CI 0.18–0.98).
-
•
Correlation of grey level values to their respective voxels in the Grey level co-occurrence matrix (coded as original_glcm_Correlation, HR 1.52 95% CI 0.82–2.83), retained as a confounder.
We observed that the hazard function followed the 45-degree line very closely except for huge values of time (Figure 1B).
Discussion
Patients with RA-ILD have an increased disease burden, reduced quality of life and physical function, severe respiratory symptoms, and worse RA disease. ILD also leads to substantial healthcare costs and interactions; 72% have an all-cause inpatient admission, and 76% have an all-cause emergency department visit. Finally, as mentioned above, they also experience excess mortality compared to the general population and patients with RA without ILD (1, 2, 12). In this scenario, it is conceivable that improving patient management and follow-up is of utmost importance. Many efforts have been made to improve early ILD diagnosis. Investigating the role of potential perturbations to pulmonary mucosa in airways seems to be an approach with concrete potential. In fact, the MUC5B promotor variant in RA patients has been associated with a threefold increased risk for ILD (6).
Furthermore, an algorithm (Vector) to detect the presence of velcro crackles in pulmonary sounds showed promising results for screening RA patients suspected of ILD and who should be directed to HRCT for the diagnosis (12). But, only a few tools seem to have the potential for driving treatment strategy (13–15), mostly involving the shifts in levels of selected serum proteins, such as CXCL11/I-TAC and matrix metalloproteinase-13, which may not be available in routine clinical practice. Radiomics has been helpful in phenotyping cancers (8, 16, 17). It can quantify a large panel of phenotypic characteristics, such as shape and texture, potentially reflecting biologic properties like intra- and inter-tumour heterogeneities and related distinct treatment responses (8). An advantage of radiomics is the standardised procedure for extraction, relying on open-source libraries but dependent upon the segmentation of the region of interest of medical images, which can be obtained by an expert radiologist or machine learning algorithms.
In this study, we applied radiomics to baseline HRCT of the lungs of RA patients with ILD to search for predictors of mortality. To this end, we used an automated segmentation tool allowing whole lung and trachea segmentation. In literature, such a procedure appeared precise and accounted for high user reliability (10, 18). The same was true when we used this tool on ILD parenchyma in all our patients, despite two different CT machines and slice thickness, enabling us to extract 120 radiomic features for our analysis systematically. We found a model including n.5 radiomic features associated with RA_ILD patients’ death, with good fitting except for large values of the time. Adjusting for original_glcm_Correlation, the radiomic feature original_firstorder_median was positively correlated, whereas original_firstorder_10Percentile, diagnostics_Imageoriginal_Mean and original_shape_Flatness radiomic features were negatively correlated to ILD-related mortality.
Our results confirm that lung imaging has prognostic potential. In this regard, they are somewhat consistent with the report from Oh et al. using quantitative HRCT (QCT) scores to predict mortality in RA-ILD (19). They analyzed a retrospective cohort of 144 RA-ILD patients diagnosed at a single centre between 1999 and 2015. All patients had HRCT performed at RA-ILD diagnosis, and baseline clinical data included autoimmunity status, inflammatory markers, pulmonary function tests, and medication use. To assess the baseline, HRCTs were assessed by an automated quantification system (AQS) that divided each lung image into small regions of interest and scored each region of interest for the presence of reticulation, architectural distortion, ground-glass opacification, and honeycombing. By combining these scores, the AQS generated a quantitative lung fibrosis score (QLF) capable of predicting RA-ILD outcomes but also independently associated with mortality after adjustment for age, baseline erythrocyte sedimentation rate (ESR), and pulmonary function tests (20). Although similar in its fundamentals, the radiomics approach offers several advantages. Radiomic features may be used with genomic data leading to the so-called radiogenomics, which showed potential for both diagnosis and prognosis in oncology research (8). This appears particularly interesting in RA-ILD given the recent findings about MUC5B- related ILD risk (6). Furthermore, the whole lung radiomics does not miss information from apparent healthy parenchyma, contributing to the image and tissue characteristics analysis that a naked-eye approach would never consider. Finally, de novo radiomic feature extraction may be easily standardised and does not rely on algorithm training.
We must acknowledge some weaknesses of our study. First, as the proof-of-concept method, we analyzed a small-sized retrospective cohort. As already mentioned, we did not apply non-linear machine learning methods, which might provide better modelling of radiomic features than linear methods. This preliminary study demonstrated that several radiomics features are predictors of RA-ILD patients’ mortality in the absence of demographics and disease-related characteristics. It is also conceivable that using a large, annotated dataset of radiomics features from HRCT-segmented whole-lung parenchyma, it could be possible to discriminate RA-ILD from interstitial lung involvement of different connective tissue diseases. More extensive studies with internal and external validation on independent cohorts are needed to confirm that a radiomics approach could be adopted in routine clinical practice. Further research is required to enable HRCT to provide digital biomarkers of RA-ILD outcomes.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by the Comitato Etico Interregionale. The patients/participants provided their written informed consent to participate in this study.
Author contributions
VV, AM, and MS designed the study and wrote the manuscript. VV and AS did the data analysis. ML and GC gathered the patient data. FI supervised each step of the whole study. All authors contributed to the article and approved the submitted version.
Footnotes
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2022.1069486/full#supplementary-material
References
- 1.Yoshida K, Lin T-C, Wei M, Malspeis S, Chu SH, Camargo CA, et al. The roles of post-diagnosis accumulation of morbidities and lifestyle changes on excess total and cause-specific mortality risk in rheumatoid arthritis. Arthritis Care Res (Hoboken). (2021) 73:188–98. 10.1002/acr.24120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Huang S, Kronzer VL, Dellaripa PF, Deane KD, Bolster MB, Nagaraja V, et al. Rheumatoid arthritis-associated interstitial lung disease: current update on prevalence, risk factors, and pharmacologic treatment. Curr Treatm Opt Rheumatol. (2020) 6:337–53. 10.1007/s40674-020-00160-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bongartz T, Nannini C, Medina-Velasquez YF, Achenbach SJ, Crowson CS, Ryu JH, et al. Incidence and mortality of interstitial lung disease in rheumatoid arthritis: a population based study. Arthritis Rheum. (2010) 62:1583–91. 10.1002/art.27405 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Manfredi A, Sebastiani M, Cerri S, Vacchi C, Tonelli R, Della Casa G, et al. Acute exacerbation of interstitial lung diseases secondary to systemic rheumatic diseases: a prospective study and review of the literature. J Thorac Dis. (2019) 11:1621–8. 10.21037/jtd.2019.03.28 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Manfredi A, Cassone G, Luppi F, Atienza-Mateo B, Cavazza A, Sverzellati N, et al. Rheumatoid arthritis related interstitial lung disease. Expert Rev Clin Immunol. (2021) 17:485–97. 10.1080/1744666X.2021.1905524 [DOI] [PubMed] [Google Scholar]
- 6.Juge P-A, Lee JS, Ebstein E, Furukawa H, Dobrinskikh E, Gazal S, et al. MUC5B promoter variant and rheumatoid arthritis with interstitial lung disease. N Engl J Med. (2018) 379:2209–19. 10.1056/NEJMoa1801562 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Manfredi A, Cassone G, Cerri S, Venerito V, Fedele AL, Trevisani M, et al. Diagnostic accuracy of a velcro sound detector (VECTOR) for interstitial lung disease in rheumatoid arthritis patients: the InSPIRAtE validation study (INterStitial pneumonia in rheumatoid ArThritis with an electronic device). BMC Pulm Med. (2019) 19:111. 10.1186/s12890-019-0875-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–7. 10.1158/0008-5472.CAN-17-0339 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Aletaha D, Neogi T, Silman AJ, Funovits J, Felson DT, Bingham CO, et al. 2010 Rheumatoid arthritis classification criteria: an American college of rheumatology/European league against rheumatism collaborative initiative. Arthritis Rheum. (2010) 62:2569–81. 10.1002/art.27584 [DOI] [PubMed] [Google Scholar]
- 10.Risoli C, Nicolò M, Colombi D, Moia M, Rapacioli F, Anselmi P, et al. Different lung parenchyma quantification using dissimilar segmentation software: a multi-center study for COVID-19 patients. Diagnostics (Basel). (2022) 12:1501. 10.3390/diagnostics12061501 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Thao LTP, Geskus R. A comparison of model selection methods for prediction in the presence of multiply imputed data. Biom J. (2019) 61:343–56. 10.1002/bimj.201700232 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Manfredi A, Cassone G, Furini F, Gremese E, Venerito V, Atzeni F, et al. Tocilizumab therapy in rheumatoid arthritis with interstitial lung disease: a multicentre retrospective study. Intern Med J. (2020) 50:1085–90. 10.1111/imj.14670 [DOI] [PubMed] [Google Scholar]
- 13.Ekici M, Baytar Y, Kardas RC, Sari A, Akdogan A, Durhan G, et al. Predictors of mortality in rheumatoid arthritis-associated lung disease: a retrospective study on ten years. Joint Bone Spine. (2021) 88:105133. 10.1016/j.jbspin.2021.105133 [DOI] [PubMed] [Google Scholar]
- 14.Chen J, Chen Y, Liu D, Lin Y, Zhu L, Song S, et al. Predictors of long-term prognosis in rheumatoid arthritis-related interstitial lung disease. Sci Rep. (2022) 12:9469. 10.1038/s41598-022-13474-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Alevizos MK, Danoff SK, Pappas DA, Lederer DJ, Johnson C, Hoffman EA, et al. Assessing predictors of rheumatoid arthritis-associated interstitial lung disease using quantitative lung densitometry. Rheumatology (Oxford). (2022) 61:2792–804. 10.1093/rheumatology/keab828 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Petrillo A, Fusco R, Di Bernardo E, Petrosino T, Barretta ML, Porto A, et al. Prediction of breast cancer histological outcome by radiomics and artificial intelligence analysis in contrast-enhanced mammography. Cancers (Basel). (2022) 14:2132. 10.3390/cancers14092132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Arezzo F, Loizzi V, La Forgia D, Moschetta M, Tagliafico AS, Cataldo V, et al. Radiomics analysis in ovarian cancer: a narrative review. Appl Sci. (2021) 11:7833. 10.3390/app11177833 [DOI] [Google Scholar]
- 18.Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin J-C, Pujol S, et al. 3D slicer as an image computing platform for the quantitative imaging network. Magn Reson Imaging. (2012) 30:1323–41. 10.1016/j.mri.2012.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Oh JH, Kim GHJ, Cross G, Barnett J, Jacob J, Hong S, et al. Automated quantification system predicts survival in rheumatoid arthritis-associated interstitial lung disease. Rheumatology (Oxford). (2022) 61:4702–10. 10.1093/rheumatology/keac184 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.McDermott G, Sparks JA. Quantitative chest imaging and prediction of mortality in rheumatoid arthritis–associated interstitial lung disease. Rheumatology. (2022) 61:4583–4. 10.1093/rheumatology/keac329 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.