Skip to main content
American Journal of Respiratory and Critical Care Medicine logoLink to American Journal of Respiratory and Critical Care Medicine
editorial
. 2022 Jun 15;206(7):812–814. doi: 10.1164/rccm.202206-1036ED

Deep Learning–based Classification of Fibrotic Lung Disease: Can Computer Vision See the Future?

Bina Choi 1,2, Samuel Y Ash 1,2
PMCID: PMC9799281  PMID: 35704686

Despite existing diagnostic criteria and guidelines for identifying and classifying fibrotic lung disease such as idiopathic pulmonary fibrosis (IPF) (1, 2), their diagnosis can be challenging. Current guidelines emphasize the role of high-resolution computed tomography (HRCT), and place particular importance on identifying the presence of underling usual interstitial pneumonia (UIP), which suggests a diagnosis of IPF (1). However, this approach is binary: it requires patients be classified based on the predominant pattern on HRCT, while in practice patients may have some evidence of UIP features but a different predominant disease pattern (1, 2). Clinically, current UIP diagnosis also relies on subjective readings of the HRCT that may vary from radiologist to radiologist (3). These issues are of particular concern because of the importance of UIP in identifying patients likely to have faster disease progression and worse prognosis (47). Thus, missed or inaccurate diagnosis has the potential to have significant clinical impact, and there has been great interest in developing tools to improve the detection and classification of UIP on HRCT (3).

Deep learning is a form of machine learning that utilizes multi-layered, or deep, neural networks to learn from complex data such as imaging. Deep learning based tools have proliferated in radiologic research over the past decade and have shown great promise in the analysis of HRCTs for diseases ranging from lung cancer to IPF (810). One of the strengths of deep learning algorithms is that they may capture patterns not seen or ignored by the human eye, potentially improving disease classification and quantification (11).

In this issue of the Journal, Walsh and colleagues (pp. 883–891) report the results of their study using the Systematic Objective Fibrotic Imaging Analysis Algorithm (SOFIA), a previously developed and validated deep convolutional neural network tool, to identify UIP in 516 patients from the Australian IPF registry (12, 13). For each participant’s HRCT, SOFIA generates 500 unique, 4-slice image montages. Then, for each montage, the device calculates the probabilities for each of 4 categories: definite UIP, probability UIP, indeterminate UIP, and not UIP. These probabilities (which sum to 1.0) are then averaged, and the device provides the patient level probability for each category (13). The investigators then used Cox proportional hazards and logistic regression models to determine the relationships between SOFIA UIP probabilities with transplant-free survival and with 12-month disease progression, respectively. Multiple forms of analysis were performed including using predicted probabilities as continuous measures as well as grouped into categories based on the Prospective Investigation of Pulmonary Embolism Diagnosis (PIOPED) approach. Univariate analyses, bivariable analyses that included both SOFIA based information and expert radiologist assessment, and multivariable models that were additionally adjusted for age, gender, radiologist-defined computed tomography disease severity, and other clinical measures of disease severity, were all performed as well.

In general, the authors found that not only did SOFIA UIP probabilities, both as a continuous variable and into grouped into PIOPED categories, predict transplant free survival and disease progression in univariate analyses, but also when evaluated using bivariable analyses that included radiologist defined disease extent. In fact, in those bivariate analyses only the SOFIA based measures were associated with adverse outcomes, not the radiologist defined disease extent. Similar findings were present in multivariable analyses adjusted for total disease extent and clinical variable associated with adverse outcomes in fibrotic lung disease like age and lung function. Importantly, the predictive utility of SOFIA UIP was maintained in subgroup analyses of patients with UIP on HRCT or histopathology versus and in those with other fibrotic patterns.

A particularly interesting finding of this study was the predictive probability of SOFIA in the subset of patients with indeterminate UIP on HRCT. Using the PIOPED based binning strategy, SOFIA reclassified over a quarter of HRCTs classified as indeterminate UIP by expert radiologists as intermediate, high, or pathognomonic probability of UIP, and this reclassification predicted transplant-free survival in the multivariable model, with a hazard ratio of 1.73 (95% confidence interval, 1.40–2.14) in the indeterminate UIP subgroup. The reclassification of a large proportion of indeterminate UIP HRCT cases is perhaps unsurprising when considering that the interobserver agreement between two radiologists for the ATS/ERS/JRS/ALAT (American Thoracic Society, European Respiratory Society, Japanese Respiratory Society, and Latin American Thoracic Society) criteria is only moderate, even among expert thoracic radiologists with over ten years of experience (3). Still, these results highlight the potential utility of a deep learning model to identify subtle fibrosis patterns and improve clinical diagnosis, especially in indeterminate cases that are more challenging to determine.

Despite this study’s many strengths, it has several limitations as well. The number of analyses and comparisons raises the question of multiple testing, and 34.6% of the cohort was receiving antifibrotic therapy, potentially affecting the analysis of transplant-free survival, as patients with more severe disease are more likely to be receiving therapy. Still, the biggest limitations of this work are not unique to this specific study but related to the field of artificial intelligence in medicine more generally. For example, when there are multiple possible algorithms for diagnosing or classifying a disease such as pulmonary fibrosis available, how do we know which to choose? And once one is chosen, who will fund its certification as software as a medical device with regulators, especially when it is unclear who would pay for its clinical use? Finally, who is responsible if the algorithm is wrong and misdiagnoses a patient as having a disease when they do not, or vice versa? Before a deep learning model like SOFIA is brought to the clinical setting, these questions and others need answering. In the meantime, work such as this by Walsh and colleagues demonstrates the potential power of deep learning in medicine and the need to answer these difficult questions so that patients can benefit from the insight artificial intelligence can provide.

Footnotes

Supported by the NIH (T32HL007633) (B.C.) and the NIH (K08HL145118) (S.Y.A.).

Originally Published in Press as DOI: 10.1164/rccm.202206-1036ED on June 15, 2022

Author disclosures are available with the text of this article at www.atsjournals.org.

References

  • 1. Raghu G, Remy-Jardin M, Myers JL, Richeldi L, Ryerson CJ, Lederer DJ, et al. American Thoracic Society, European Respiratory Society, Japanese Respiratory Society, and Latin American Thoracic Society Diagnosis of idiopathic pulmonary fibrosis. An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am J Respir Crit Care Med . 2018;198:e44–e68. doi: 10.1164/rccm.201807-1255ST. [DOI] [PubMed] [Google Scholar]
  • 2. Ryerson CJ, Corte TJ, Lee JS, Richeldi L, Walsh SLF, Myers JL, et al. A standardized diagnostic ontology for fibrotic interstitial lung disease. An International Working Group perspective. Am J Respir Crit Care Med . 2017;196:1249–1254. doi: 10.1164/rccm.201702-0400PP. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Walsh SL, Calandriello L, Sverzellati N, Wells AU, Hansell DM, UIP Observer Consort Interobserver agreement for the ATS/ERS/JRS/ALAT criteria for a UIP pattern on CT. Thorax . 2016;71:45–51. doi: 10.1136/thoraxjnl-2015-207252. [DOI] [PubMed] [Google Scholar]
  • 4. Flaherty KR, Travis WD, Colby TV, Toews GB, Kazerooni EA, Gross BH, et al. Histopathologic variability in usual and nonspecific interstitial pneumonias. Am J Respir Crit Care Med . 2001;164:1722–1727. doi: 10.1164/ajrccm.164.9.2103074. [DOI] [PubMed] [Google Scholar]
  • 5. Monaghan H, Wells AU, Colby TV, du Bois RM, Hansell DM, Nicholson AG. Prognostic implications of histologic patterns in multiple surgical lung biopsies from patients with idiopathic interstitial pneumonias. Chest . 2004;125:522–526. doi: 10.1378/chest.125.2.522. [DOI] [PubMed] [Google Scholar]
  • 6. Kim EJ, Elicker BM, Maldonado F, Webb WR, Ryu JH, Van Uden JH, et al. Usual interstitial pneumonia in rheumatoid arthritis-associated interstitial lung disease. Eur Respir J . 2010;35:1322–1328. doi: 10.1183/09031936.00092309. [DOI] [PubMed] [Google Scholar]
  • 7. Putman RK, Gudmundsson G, Axelsson GT, Hida T, Honda O, Araki T, et al. Imaging patterns are associated with interstitial lung abnormality progression and mortality. Am J Respir Crit Care Med . 2019;200:175–183. doi: 10.1164/rccm.201809-1652OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Handa T, Tanizawa K, Oguma T, Uozumi R, Watanabe K, Tanabe N, et al. Novel artificial intelligence-based technology for chest computed tomography analysis of idiopathic pulmonary fibrosis. Ann Am Thorac Soc . 2022;19:399–406. doi: 10.1513/AnnalsATS.202101-044OC. [DOI] [PubMed] [Google Scholar]
  • 9. Bratt A, Williams JM, Liu G, Panda A, Patel PP, Walkoff L, et al. Predicting usual interstitial pneumonia histopathology from chest CT imaging with deep learning. Chest . 2022 doi: 10.1016/j.chest.2022.03.044. [DOI] [PubMed] [Google Scholar]
  • 10. Lakhani P, Sundaram B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology . 2017;284:574–582. doi: 10.1148/radiol.2017162326. [DOI] [PubMed] [Google Scholar]
  • 11. Lee JG, Jun S, Cho YW, Lee H, Kim GB, Seo JB, et al. Deep learning in medical imaging: general overview. Korean J Radiol . 2017;18:570–584. doi: 10.3348/kjr.2017.18.4.570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Walsh SL, Mackintosh JA, Calandriello L, Silva M, Sverzellati N, Larici AR, et al. Deep learning-based outcome prediction in progressive fibrotic lung disease using high-resolution computed tomography. Am J Respir Crit Care Med . 2022;206:883–891. doi: 10.1164/rccm.202112-2684OC. [DOI] [PubMed] [Google Scholar]
  • 13.Walsh SLF, Calandriello L, Silva M, Sverzellati N. Deep learning for classifying fibrotic lung disease on high-resolution computed tomography: a case-cohort study. Lancet Respir Med. 2018;6:837–845. doi: 10.1016/S2213-2600(18)30286-8. [DOI] [PubMed] [Google Scholar]

Articles from American Journal of Respiratory and Critical Care Medicine are provided here courtesy of American Thoracic Society

RESOURCES