Skip to main content
RSNA Journals logoLink to RSNA Journals
. 2024 Feb 6;310(2):e231718. doi: 10.1148/radiol.231718

Improving Prognostication in Pulmonary Hypertension Using AI-quantified Fibrosis and Radiologic Severity Scoring at Baseline CT

Krit Dwivedi 1, Michael Sharkey 1, Liam Delaney 1, Samer Alabed 1, Smitha Rajaram 1, Catherine Hill 1, Christopher Johns 1, Alexander Rothman 1, Michail Mamalakis 1, A A Roger Thompson 1, Jim Wild 1, Robin Condliffe 1, David G Kiely 1, Andrew J Swift 1,
PMCID: PMC10902594  PMID: 38319169

Abstract

Background

There is clinical need to better quantify lung disease severity in pulmonary hypertension (PH), particularly in idiopathic pulmonary arterial hypertension (IPAH) and PH associated with lung disease (PH-LD).

Purpose

To quantify fibrosis on CT pulmonary angiograms using an artificial intelligence (AI) model and to assess whether this approach can be used in combination with radiologic scoring to predict survival.

Materials and Methods

This retrospective multicenter study included adult patients with IPAH or PH-LD who underwent incidental CT imaging between February 2007 and January 2019. Patients were divided into training and test cohorts based on the institution of imaging. The test cohort included imaging examinations performed in 37 external hospitals. Fibrosis was quantified using an established AI model and radiologically scored by radiologists. Multivariable Cox regression adjusted for age, sex, World Health Organization functional class, pulmonary vascular resistance, and diffusing capacity of the lungs for carbon monoxide was performed. The performance of predictive models with or without AI-quantified fibrosis was assessed using the concordance index (C index).

Results

The training and test cohorts included 275 (median age, 68 years [IQR, 60–75 years]; 128 women) and 246 (median age, 65 years [IQR, 51–72 years]; 142 women) patients, respectively. Multivariable analysis showed that AI-quantified percentage of fibrosis was associated with an increased risk of patient mortality in the training cohort (hazard ratio, 1.01 [95% CI: 1.00, 1.02]; P = .04). This finding was validated in the external test cohort (C index, 0.76). The model combining AI-quantified fibrosis and radiologic scoring showed improved performance for predicting patient mortality compared with a model including radiologic scoring alone (C index, 0.67 vs 0.61; P < .001).

Conclusion

Percentage of lung fibrosis quantified on CT pulmonary angiograms by an AI model was associated with increased risk of mortality and showed improved performance for predicting patient survival when used in combination with radiologic severity scoring compared with radiologic scoring alone.

© RSNA, 2024

Supplemental material is available for this article.


graphic file with name radiol.231718.VA.jpg


Summary

Combining artificial intelligence (AI)–quantified lung fibrosis and radiologic severity scoring at baseline CT improves prognostication in pulmonary hypertension; AI detects minor disease not identified by radiologists.

Key Results

  • ■ In this retrospective study of 521 patients with pulmonary hypertension, artificial intelligence (AI)–quantified lung fibrosis at baseline CT was associated with an increased risk of mortality (hazard ratio range, 1.01–1.03; P value range, <.001 to .04).

  • ■ Combining AI and radiologic severity scoring improved predictions of patient survival compared with radiologic scoring alone (concordance index, 0.67 vs 0.61; P < .001).

Introduction

Lung disease severity is an important distinguishing factor between two pulmonary hypertension (PH) phenotypes—pulmonary arterial hypertension (group 1) and PH associated with lung disease (PH-LD) (group 3). In practice, distinguishing between these two groups is challenging in patients with radiologically scored mild or no fibrosis due to overlapping clinical characteristics, particularly in the most common form of pulmonary arterial hypertension, idiopathic pulmonary arterial hypertension (IPAH) (13). Only patients diagnosed with IPAH are eligible for novel targeted therapies that improve survival (4). Chest CT imaging is routinely performed for diagnosis but is not currently used for prognostication (5). Lung disease is typically semiquantitatively scored as none, mild, moderate, or severe, but this scoring has poor reproducibility even among specialist radiologists (6,7).

There is a clinical need to better quantify lung disease in these patients. A recent seminal study by Hoeper et al (3) characterized this problem and identified a new phenotype—IPAH with a lung phenotype—which lies between IPAH and PH-LD and overlaps between radiologically scored no and mild lung disease. In the study by Hoeper et al (3), 71% of patients with IPAH with a lung phenotype were scored as having no fibrosis, and 25% were scored as having mild fibrosis. Currently, patients with mild lung disease are treated but show poorer response to treatment and lower survival (3,8). Advances in artificial intelligence (AI) deep learning approaches have made quantitative analysis of imaging features possible (9). The motivation of this work was to use AI with CT pulmonary angiography (CTPA) imaging performed routinely in these patients to help solve this important clinical problem. The AI model developed for this study provides the respective percentage of lung volume involvement of common radiologic patterns and an output that can be interpreted by the reading radiologist. The transparent patch-based approach used (10) and the model implemented (11) have been described in detail. The purpose of this study was to assess the value of AI-quantified fibrosis in predicting survival, independent of and in combination with radiologic scoring.

Materials and Methods

Study Patients

Adult patients with incidental IPAH and PH-LD diagnosed between February 2007 and January 2019 were prospectively recorded in Sheffield Pulmonary Vascular Disease Unit databases as part of the ASPIRE (Assessing the Spectrum of Pulmonary Hypertension Identified at a Referral Centre) registry, as previously described (3,12). The ASPIRE registry was established as a large clinicoradiologic imaging data set in PH and has been used in multiple studies (3,12). There may be overlap in patients with other unrelated studies given the large registry, but the analysis in this study is unique. For all patients, diagnosis was made at the host tertiary center after multidisciplinary review, and this approach was uniform throughout the study period. The current study is a secondary analysis of this prospectively collected data set. Ethical approval was granted by the institutional review board of Sheffield Teaching Hospitals NHS Foundation Trust and approved by the National Research Ethics Service (16/YH/0352). The need for written informed consent was waived due to the retrospective nature of this study.

The inclusion criterion was baseline CT imaging performed at the time of diagnosis. The exclusion criteria were as follows: patients with no baseline imaging or imaging inappropriate for AI analysis (section thickness > 1.25 mm), incomplete lung imaging or segmentation, non-CTPA imaging, severe breathing artifact, or lobectomy. These eligibility criteria were chosen in an attempt to capture a spectrum of lung disease and PH etiology, regardless of the assigned diagnosis of IPAH or PH-LD, and to minimize bias. Mortality data were obtained from systems linked to the National Health Service Personal Demographics Service, which is updated when a death is registered in the United Kingdom. Patients undergoing lung transplantation were censored at the time of surgery, and mortality data were collected using a census date of May 2019.

CT Image Acquisition, Reporting, and Analysis

CTPA protocol scans were performed in multiple centers on scanners from all major manufacturers with varying technical parameters (Table S1). The AI model was applied to all scans; reports for all scans were written by experienced subspecialty chest radiologists, who radiologically scored fibrosis as none, mild, moderate, or severe as previously described (3,13). All analyses were performed at the patient level with one single corresponding incident CTPA examination. Lung segmentation was performed using an externally validated nnU-net (no new U-Net) model (14). The AI method uses a patch-based five-class DenseNet121 deep learning architecture that classifies parenchyma into regions of ground glass, ground glass with reticulation, emphysema, honeycombing, and normal lung as per the Fleischner Society glossary of terms (15). Fibrosis for this patient cohort was defined as a composite variable summing regions of ground glass with reticulation and honeycombing. The method is described in more detail in Appendix S1, and the model code is available at https://github.com/mrsharkleton/lung_texture_segment.

Statistical Analysis

Survival analysis.—Patients were divided into a training set and a test set based on the institution where imaging was performed. Imaging examinations performed at the host institution composed the training set; all external imaging examinations composed the test set. Multivariable Cox proportional hazards regression was used to determine the association between AI-identified CT parenchymal patterns and survival. Hazard ratios (HRs) correspond to the ratio of two hazard rates (of death) when a continuous variable is increased by one. P < .05 was considered to indicate a statistically significant difference. The multivariable Cox proportional hazards model was adjusted for age, sex, World Health Organization functional class, pulmonary vascular resistance (PVR) (a hemodynamic marker of PH severity), and the diffusing capacity of the lungs for carbon monoxide (Dlco) (a spirometry marker and the current gold standard of lung disease severity assessment in these patients). Age, PVR, and Dlco were stratified using thresholds of 50 years, 5 Wood units (WU), and 45%, respectively, as established in prior studies (3,1618). Model assumptions of linearity and proportional hazards were assessed using scaled Schoenfeld residuals and Martingale residuals. In the training set, locally estimated scatterplot smoothing analysis, with estimated probability of death at 1 year on the y-axis and percentage of fibrosis on the x-axis, was used to derive the threshold for fibrosis corresponding to 20% 1-year mortality. Patients in the test set were grouped by the derived thresholds. Survival estimates from the time of registry were calculated using Kaplan-Meier analyses, and survival curves were compared using the log-rank test.

Added value of AI over visual radiologic scoring alone.—The strength of Cox proportional hazards regression models was used to determine the added predictive value of AI quantification. Model A included visual fibrosis scoring only; model B included visual scoring and AI-quantified percentage of fibrosis. The model concordance index (C-index) was used to assess the prognostic strength of the model: A better model fit is indicated by a higher C index. Both models included the same patients, and a likelihood ratio test was used to assess whether there was a statistically significant difference. The analysis was repeated stratified by assigned diagnosis of IPAH or PH-LD. Prespecified subgroup analysis was performed in all patients scored as having no fibrosis in visual scoring by radiologists. Cox proportional hazards regression was used to determine the association between AI-quantified disease severity and survival.

Analysis was performed with R version 4.1.2 (R Foundation for Statistical Computing). The study sample size and number of events are appropriately powered to detect statistically significant differences. Categorical data are presented as numbers and percentages, and continuous data are presented as medians and IQRs. Two-sample Wilcoxon rank sum tests were used to compare continuous data, and Pearson χ2 tests or Fisher exact tests were used to compare categorical data, between the training and test cohorts.

Results

Patient and Cohort Characteristics

Of 5643 patients in the ASPIRE registry, 521 patients (median age, 67 years [IQR, 25–74 years]; 270 women [52%]) met the eligibility criteria and were included for analysis. Of 1143 patients with a diagnosis of IPAH or PH-LD, 243 were excluded because there was no baseline imaging, 202 because CT section thickness was greater than 1.25 mm, 121 for non-CTPA imaging, 30 for severe breathing artifact, 24 for incomplete or incorrect lung segmentation, and two for lobectomy (Fig 1). There were no failures in AI analysis (Fig 2), and the model gave the same result for scans when repeated.

Figure 1:

Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) diagram shows patient selection. ASPIRE = Assessing the Spectrum of Pulmonary Hypertension Identified at a Referral Centre, CTPA = CT pulmonary angiography, IPAH = idiopathic pulmonary arterial hypertension, PH-LD = pulmonary hypertension associated with lung disease.

Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) diagram shows patient selection. ASPIRE = Assessing the Spectrum of Pulmonary Hypertension Identified at a Referral Centre, CTPA = CT pulmonary angiography, IPAH = idiopathic pulmonary arterial hypertension, PH-LD = pulmonary hypertension associated with lung disease.

Figure 2:

Example outputs from the artificial intelligence (AI) model. The lung parenchyma is automatically segmented and then classified into five lung parenchymal patterns: normal (red), emphysema (teal), honeycombing (yellow), ground glass (green), and ground glass with additional reticulation (blue). This is done for every section of the CT pulmonary angiogram. Example input (A) axial, (B) sagittal, and (C) coronal CT sections (top row) from one patient are presented with the corresponding AI-quantified outputs (bottom row). The AI model output is provided as a Digital Imaging and Communications in Medicine, or DICOM, scan series that can be interrogated and reviewed alongside the original CT scan series.

Example outputs from the artificial intelligence (AI) model. The lung parenchyma is automatically segmented and then classified into five lung parenchymal patterns: normal (red), emphysema (teal), honeycombing (yellow), ground glass (green), and ground glass with additional reticulation (blue). This is done for every section of the CT pulmonary angiogram. Example input (A) axial, (B) sagittal, and (C) coronal CT sections (top row) from one patient are presented with the corresponding AI-quantified outputs (bottom row). The AI model output is provided as a Digital Imaging and Communications in Medicine, or DICOM, scan series that can be interrogated and reviewed alongside the original CT scan series.

The cohort had a wide range of lung disease severity as assessed using pulmonary function testing (median Dlco percent predicted, 31% [IQR, 22%–49%]; median forced vital capacity percent predicted, 90% [IQR, 77%–107%]; median forced expiratory volume in 1 second percent predicted, 75% [IQR, 55%–90%]) and hemodynamic PH disease severity (median PVR, 8.0 WU [IQR, 5.0–11.6 WU]). There were 275 patients in the training cohort and 246 in the external test cohort. Patients in the test cohort were younger (median age, 65 years [IQR, 51–72 years] vs 68 years [IQR, 60–75 years]; P < .001), were more likely to be female (58% vs 46%; P = .01), and had more severe PH (median PVR, 9.4 WU [IQR, 6.4–13.2 WU] vs 7.2 WU [IQR, 4.2–10.8 WU]; P < .001). A greater proportion of patients in the external cohort were diagnosed with IPAH (61% vs 40%; P < .001). There was no significant difference in Dlco percent predicted between groups (median, 32% [IQR, 23%–53%] in training set vs 30% [IQR, 22%–47%] in test set). These data are presented in Table 1, and data availability for each variable is shown in Table S2. The external test group therefore formed an ideal heterogeneous test cohort to assess model generalizability, with differences in demographic and hemodynamic characteristics. There were no demographic differences between the excluded patients and the study cohort (Table S3). CT visual scoring was available in 166 of 246 (67%) patients in the external test cohort. The test set included CTPA imaging examinations from 37 centers on 33 different scanners, as described in Table S1.

Table 1:

Patient Demographic and Clinical Characteristics for All Cohorts

graphic file with name radiol.231718.tbl1.jpg

Survival Analysis

The model assumptions of linearity and proportional hazards were validated and are described in Table S4 and Figure S1. Univariable and multivariable associations between each AI-identified pathologic parenchymal pattern and mortality in the training cohort are presented in Table 2. The median follow-up duration was 1.96 years (IQR, 0.91–4.35 years). The outcome of mortality occurred in 183 of 275 (66%) patients. Two of 275 (0.7%) patients underwent lung transplantation. All AI-quantified pathologic CT patterns were found to be associated with increased risk of patient mortality in univariable analysis—ground glass (HR, 1.02 [95% CI: 1.01, 1.03]; P = .004), ground glass with reticulation (HR, 1.04 [95% CI: 1.02, 1.05]; P < .001), honeycombing (HR, 1.05 [95% CI: 1.03, 1.07]; P < .001), emphysema (HR, 1.01 [95% CI: 1.01, 1.02]; P < .001), and the composite variable fibrosis (HR, 1.03 [95% CI: 1.02, 1.04]; P < .001). Only fibrosis was associated with increased risk of mortality in the multivariable analysis (HR, 1.01 [95% CI: 1.00, 1.02]; P = .04). When this prognostic model was applied to the external test set, it demonstrated good discrimination in predicting mortality (C index, 0.76).

Table 2:

Cox Proportional Hazards Regression Assessing the Association of AI–quantified CT Features and Radiologic Scoring with Mortality Outcomes in the Training Cohort (n = 275)

graphic file with name radiol.231718.tbl2.jpg

Locally estimated scatterplot smoothing, or LOESS, analysis (Fig S2) identified a percentage of fibrosis of 3.43% as the threshold corresponding to 20% 1-year mortality. Kaplan-Meier survival analysis demonstrated that patients with fibrosis above the identified threshold of 3.43% in the test cohort had poorer survival (log-rank test; P < .001) (Fig 3). The estimated survival rates for patients with and without fibrosis above the threshold were 69% versus 88% for 1-year survival, 36% versus 66% for 3-year survival, and 24% versus 56% for 5-year survival, respectively. This 3.43% threshold was a significant predictor of poor survival in IPAH and PH-LD cohorts when analyzed separately (P < .001 for both; Fig S3).

Figure 3:

Graph shows Kaplan-Meier survival curves for patients in the external test cohort (n = 246) stratified into groups by the threshold for significant fibrosis (3.43%) derived in the training cohort (n = 275). Log-rank test P value is shown. Shaded regions represent the CIs.

Graph shows Kaplan-Meier survival curves for patients in the external test cohort (n = 246) stratified into groups by the threshold for significant fibrosis (3.43%) derived in the training cohort (n = 275). Log-rank test P value is shown. Shaded regions represent the CIs.

Additional Prognostic Value of AI Metrics Over Radiologic Scoring Alone

Model B (AI quantification and visual scoring combined) had better predictive strength than Model A (visual scoring only) in the full cohort (C index, 0.67 vs 0.61; P < .001) and when patients were stratified by diagnosis (IPAH: C index, 0.63 vs 0.59, P < .001; PH-LD: C index, 0.65 vs 0.59, P < .001) (Table 3). A total of 300 of 421 patients (71%) were radiologically scored as having no fibrosis. The AI model reported minor levels of fibrosis (1.5%), ground glass (1.8%), ground glass with additional reticulation (0.6%), and honeycombing (0.7%) in these patients (Table 4). Fibrosis (HR, 1.03; P = .004) and honeycombing (HR, 1.08; P < .001) were associated with increased risk of mortality in this subgroup. Example outputs of mild, moderate, and severe fibrosis are shown in Figure 4.

Table 3:

Additional Predictive Value of AI Quantification in Combination with Visual Scoring

graphic file with name radiol.231718.tbl3.jpg

Table 4:

Significance and Extent of AI-quantified Disease in Subgroup of Patients Radiologically Scored as Having No Fibrosis

graphic file with name radiol.231718.tbl4.jpg

Figure 4:

Examples of degrees of fibrosis quantified and radiologically scored. CT pulmonary angiograms (top row) were radiologically scored as having mild (left), moderate (middle), and severe (right) disease; the corresponding output and fibrosis percentage as quantified by the artificial intelligence (AI) model are shown in the bottom row.

Examples of degrees of fibrosis quantified and radiologically scored. CT pulmonary angiograms (top row) were radiologically scored as having mild (left), moderate (middle), and severe (right) disease; the corresponding output and fibrosis percentage as quantified by the artificial intelligence (AI) model are shown in the bottom row.

Discussion

There is clinical need to better quantify lung disease in pulmonary hypertension (PH) due to an overlap in radiologically scored severity between different phenotypes. The first key finding of our study is that artificial intelligence (AI)–quantified fibrosis at CT was prognostic (hazard ratio, 1.01; P = .04) for survival in patients with PH and lung disease. This was independent of demographic characteristics, hemodynamic disease severity (pulmonary vascular resistance), and spirometry (diffusing capacity of the lungs for carbon monoxide), and was externally validated (concordance index [C index], 0.76) in images from 37 centers. The second key finding of our study is that the AI model was sensitive to minor parenchymal differences and, when used in combination with radiologic reporting, provided additional predictive prognostic value, with a C index of 0.67 for AI-quantified fibrosis and radiologic scoring versus 0.61 for radiologic scoring alone (P < .001).

This is the first study to our knowledge to clinically evaluate an AI quantitative CT model for lung disease assessment in PH. Existing work in this domain uses pulmonary function testing (Dlco) or radiologic scoring for lung disease assessment. Pulmonary function tests show variability and low reproducibility and can be insensitive to changes in disease progression (1922). Radiologic scoring is subjective and has inter- and intraobserver variability even among specialist radiologists (6,7). It is also relatively imprecise, being typically scored categorically as none, mild, moderate, or severe. A few continuous visual scoring scales exist, in which radiologists quantify lobar disease to the nearest 5%, but these scales have been limited to the research domain (22). Guidelines for interpretation of fibrosis focus on identification of parenchymal features and not grading or severity assessment (23). Carefully quantifying five different lung parenchymal features is tedious, time-consuming, and impractical in routine clinical reporting, thereby making it an ideal assistive task for AI.

Percentage of fibrosis was found to be an independent prognostic marker (HR, 1.01; P = .04) for mortality even after correcting for hemodynamic disease severity (PVR) and Dlco. The PVR threshold of 5 WU was chosen based on recent studies that demonstrated its utility in predicting mortality in both IPAH and PH-LD (16,18,24). The Dlco threshold of 45% was derived from previous studies that determined the strong prognostic value of this threshold (8,17,2527). Dlco is often the only biomarker used for lung disease severity across multiple studies and is part of the REVEAL 2.0 risk score for pulmonary arterial hypertension risk stratification (3,8,28,29). We derived 3.43% as a clinically applicable threshold for fibrosis severity, which corresponded to 20% 1-year mortality. This threshold was validated in an external test cohort, where between-group differences in survival were found between patients stratified by this threshold. This was also the case for IPAH and PH-LD analyzed separately. We postulate that the additional prognostic value of AI-quantified CT parenchymal patterns compared to Dlco may be because Dlco is a functional marker of gas exchange, whereas CT findings are an anatomic marker of architectural distortion.

Combining AI-quantified fibrosis and radiologic scoring improved prognostication. In patients radiologically scored as having no fibrosis, the AI model identified minor levels of lung disease that were associated with increased risk for mortality. This could be due to a combination of (a) lack of detectability and (b) variability in subjective radiologic opinion regarding whether minor parenchymal changes represent true fibrosis. In our experience, it is not uncommon for different radiologists to differ in their opinions among none, minor, and mild fibrosis. It has been established that there is interobserver variability in the detection of fibrosis, which is magnified at low levels of disease and worse for individual parenchymal patterns than for overall diagnosis (30,31). Our repeatable and interpretable quantitative AI model outputs would likely aid in these cases to improve consensus. The C index of 0.67 for this model in external testing demonstrates good discrimination, particularly considering that this C index value is for disease severity scoring alone. In clinical practice this value will be higher as it will also account for patient demographic characteristics, clinical features, and other imaging.

There is substantial overlap between PH phenotypes, with 71% of patients with IPAH with a lung phenotype being scored as having no fibrosis, and 7% of those with classic IPAH being scored as having mild or moderate fibrosis (3). Mild lung disease can be treated with pulmonary arterial hypertension therapy in the current guidance, but these patients have poorer survival and treatment response compared with those with no lung disease (3,8). However, the prognostic clinical significance of lung disease has not been precisely quantified and has relied on semiquantitative radiologic scoring. The combined approach in our study could enable better phenotyping and identification of patients likely to respond to treatment. Limitations of deep learning approaches include their lack of transparency and their susceptibility to unknown, difficult-to-quantify biases. Our combined approach was designed to put the radiologist at the center of the decision-making process. The model provides an interpretable image mask of the analyzed scan, which can be evaluated and assessed by the radiologist. This enables recognition of errors and verification of model outputs. The proposed AI model use case is as an adjunct to reporting, providing a continuous percentage of lung involvement for individual lung parenchymal patterns.

We acknowledge limitations inherent to the retrospective setting, CTPA imaging, and AI models. First, while all patients were assigned a diagnosis in the specialist tertiary center, there may be some bias due to patients not being appropriately identified in nonspecialist settings. Second, there are data availability and demographic differences between the training and external test cohorts, which could be a source of algorithmic bias. The main reason for exclusion was lack of thin-section CTPA imaging, which is performed less frequently in nonspecialist centers. Third, variations in breath holding and contrast agent administration timing during CT scan acquisition can cause errors in the AI classification. We excluded only patients with severe breathing artifact (30 patients) in this study, and the AI use case proposed is as an adjunct to clinical reporting. The output is interpretable and transparent to the reporting radiologist. Fourth, this AI approach was applied in a single large tertiary center, but we preferentially chose an external test cohort of patients with a range of demographic characteristics, lung disease, and hemodynamic disease severity to validate and generalize our findings. Fifth, our thresholds, although validated externally, might not be generalizable to all institutions and may need to be adjusted for different prevalence rates and patient demographic characteristics.

In conclusion, our study showed that the percentage of lung fibrosis quantified on CT pulmonary angiograms using an artificial intelligence (AI) model was associated with increased risk of mortality in patients with pulmonary hypertension (PH). Furthermore, a prognostic model combining AI-quantified fibrosis and radiologic severity scoring showed improved performance for predicting patient survival compared with a model including radiologic severity scoring alone. Further larger multiregistry prospective studies that investigate the use of such AI models in better characterizing and phenotyping patients with PH and lung disease are warranted.

This research was funded in whole or in part by the Wellcome Trust (grants 222930/Z/21/Z [K.D.], 203914/Z/16/ [4ward North PhD Programme], 223521/Z/21/Z [M.S.], 206632/Z/17/Z [A.R.; Wellcome Trust Clinical Research Career Development Fellowship], and 205188/Z/16/Z [A.J.S]). This study/research is funded by the National Institute for Health and Care Research (NIHR) Sheffield Biomedical Research Centre (NIHR203321). The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care.

Disclosures of conflicts of interest: K.D. Funding for projects not directly related to this work from Janssen Pharmaceuticals and GE HealthCare, funding for educational events from Royal College of Radiologists, and part of AI committee and AI lead for RADIANT trainee group for Royal College of Radiologists. M.S. Salary funded by National Institute for Health and Care Research AI Award AI_AWARD01706. L.D. No relevant relationships. S.A. No relevant relationships. S.R. No relevant relationships. C.H. No relevant relationships. C.J. No relevant relationships. A.R. No relevant relationships. M.M. No relevant relationships. A.A.R.T. Grant funding to institution from British Heart Foundation and National Institute for Health and Care Research and honoraria for lectures and support for conference registration and travel from Janssen-Cilag. J.W. Investigator-led research grant from Medical Research Council. R.C. No relevant relationships. D.G.K. Funding from NIHR Sheffield Biomedical Research Centre; consulting fees from Altavant, GSK, Ferrer, MSD, Gossamer Bio, United Therapeutics, and Liquidia; payment or honoraria for lectures, presentations, speakers bureaus, manuscript writing, or educational events from Acceleron Pharma, Altavant, Ferrer, Gossamer Bio, Janssen, United Therapeutics, and MSD; support for attending meetings and/or travel from Ferrer, Janssen, United Therapeutics, and MSD; participation on a data and safety monitoring board or advisory board for Janssen; UK National Pulmonary Hypertension Audit lead; and member of the Specialised Respiratory Clinical Reference Group for National Health Service England. A.J.S. Funding from British Heart Foundation for cardiac MRI research and research grant and lecture fees from Janssen Pharmaceuticals.

Abbreviations:

AI
artificial intelligence
C index
concordance index
CTPA
CT pulmonary angiography
Dlco
diffusing capacity of the lungs for carbon monoxide
HR
hazard ratio
IPAH
idiopathic pulmonary arterial hypertension
PH
pulmonary hypertension
PH-LD
PH associated with lung disease
PVR
pulmonary vascular resistance
WU
Wood units

References

  • 1. Condliffe R , Kiely DG , Lewis RA . Mild parenchymal lung disease is still lung disease . Eur Respir J 2020. ; 56 ( 5 ): 2003727 . [DOI] [PubMed] [Google Scholar]
  • 2. Olschewski H . The challenge to decide between pulmonary hypertension due to chronic lung disease and PAH with chronic lung disease . Diagnostics (Basel) 2021. ; 11 ( 2 ): 311 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Hoeper MM , Dwivedi K , Pausch C , et al . Phenotyping of idiopathic pulmonary arterial hypertension: a registry analysis . Lancet Respir Med 2022. ; 10 ( 10 ): 937 – 948 . [Published correction appears in Lancet Respir Med 2023;11(2):e15.] [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Humbert M , Kovacs G , Hoeper MM , et al . 2022 ESC/ERS guidelines for the diagnosis and treatment of pulmonary hypertension . Eur Heart J 2022. ; 43 ( 38 ): 3618 – 3731 . [DOI] [PubMed] [Google Scholar]
  • 5. Kiely DG , Levin D , Hassoun P , et al . EXPRESS: statement on imaging and pulmonary hypertension from the Pulmonary Vascular Research Institute (PVRI) . Pulm Circ 2019. ; 9 ( 3 ): 2045894019841990 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Watadani T , Sakai F , Johkoh T , et al . Interobserver variability in the CT assessment of honeycombing in the lungs . Radiology 2013. ; 266 ( 3 ): 936 – 944 . [DOI] [PubMed] [Google Scholar]
  • 7. Widell J , Lidén M . Interobserver variability in high-resolution CT of the lungs . Eur J Radiol Open 2020. ; 7 : 100228 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lewis RA , Thompson AAR , Billings CG , et al . Mild parenchymal lung disease and/or low diffusion capacity impacts survival and treatment response in patients diagnosed with idiopathic pulmonary arterial hypertension . Eur Respir J 2020. ; 55 ( 6 ): 2000041 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Dwivedi K , Sharkey M , Condliffe R , et al . Pulmonary hypertension in association with lung disease: quantitative CT and artificial intelligence to the rescue? State-of-the-art review . Diagnostics (Basel) 2021. ; 11 ( 4 ): 679 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Mamalakis M , Dwivedi K , Sharkey M , Alabed S , Kiely D , Swift AJ . A transparent artificial intelligence framework to assess lung disease in pulmonary hypertension . Sci Rep 2023. ; 13 ( 1 ): 3812 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Sharkey MJ , Dwivedi K , Alabed S , Swift AJ . Deep learning automated quantification of lung disease in pulmonary hypertension on CT pulmonary angiography: a preliminary clinical study with external validation . arXiv 2303.11130 [preprint] https://arxiv.org/abs/2303.11130. Posted March 20, 2023. Accessed May 5, 2023.
  • 12. Hurdman J , Condliffe R , Elliot CA , et al . Pulmonary hypertension in COPD: results from the ASPIRE registry . Eur Respir J 2013. ; 41 ( 6 ): 1292 – 1301 . [DOI] [PubMed] [Google Scholar]
  • 13. Dwivedi K , Condliffe R , Sharkey M , et al . Computed tomography lung parenchymal descriptions in routine radiological reporting have diagnostic and prognostic utility in patients with idiopathic pulmonary arterial hypertension and pulmonary hypertension associated with lung disease . ERJ Open Res 2022. ; 8 ( 1 ): 00549 – 2021 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Dwivedi K , Sharkey M , Alabed S , Langlotz CP , Swift AJ , Bluethgen C . External validation, radiological evaluation, and development of deep learning automatic lung segmentation in contrast-enhanced chest CT . Eur Radiol doi: 10.1007/s00330-023-10235-9. Published online September 29, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Hansell DM , Bankier AA , MacMahon H , McLoud TC , Müller NL , Remy J . Fleischner Society: glossary of terms for thoracic imaging . Radiology 2008. ; 246 ( 3 ): 697 – 722 . [DOI] [PubMed] [Google Scholar]
  • 16. Olsson KM , Hoeper MM , Pausch C , et al . Pulmonary vascular resistance predicts mortality in patients with pulmonary hypertension associated with interstitial lung disease: results from the COMPERA registry . Eur Respir J 2021. ; 58 ( 2 ): 2101483 . [DOI] [PubMed] [Google Scholar]
  • 17. Trip P , Nossent EJ , de Man FS , et al . Severely reduced diffusion capacity in idiopathic pulmonary arterial hypertension: patient characteristics and treatment responses . Eur Respir J 2013. ; 42 ( 6 ): 1575 – 1585 . [DOI] [PubMed] [Google Scholar]
  • 18. Zeder K , Avian A , Bachmaier G , et al . Elevated pulmonary vascular resistance predicts mortality in COPD patients . Eur Respir J 2021. ; 58 ( 2 ): 2100944 . [DOI] [PubMed] [Google Scholar]
  • 19. Hruby J , Butler J . Variability of routine pulmonary function tests . Thorax 1975. ; 30 ( 5 ): 548 – 553 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Magnussen H , Vaz Fragoso CA , Miller MR , Brusasco V . Spirometry variability must be critically interpreted before negating a clinical diagnosis of chronic obstructive pulmonary disease . Am J Respir Crit Care Med 2018. ; 197 ( 6 ): 835 – 836 . [DOI] [PubMed] [Google Scholar]
  • 21. Khalil N , Manganas H , Ryerson CJ , et al . Phase 2 clinical trial of PBI-4050 in patients with idiopathic pulmonary fibrosis . Eur Respir J 2019. ; 53 ( 3 ): 1800663 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Jacob J , Bartholmai BJ , Rajagopalan S , et al . Mortality prediction in idiopathic pulmonary fibrosis: evaluation of computer-based CT analysis with conventional severity measures . Eur Respir J 2017. ; 49 ( 1 ): 1601011 . [DOI] [PubMed] [Google Scholar]
  • 23. Hobbs S , Chung JH , Leb J , Kaproth-Joslin K , Lynch DA . Practical imaging interpretation in patients suspected of having idiopathic pulmonary fibrosis: official recommendations from the Radiology Working Group of the Pulmonary Fibrosis Foundation . Radiol Cardiothorac Imaging 2021. ; 3 ( 1 ): e200279 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Kiely DG , Condliffe R . Assessing pulmonary hypertension severity in lung disease is a key step to improving outcomes: embrace resistance and don’t be pressurised to go with the flow . Eur Respir J 2021. ; 58 ( 2 ): 2102008 . [DOI] [PubMed] [Google Scholar]
  • 25. Hoeper MM , Pausch C , Grünig E , et al . Idiopathic pulmonary arterial hypertension phenotypes determined by cluster analysis from the COMPERA registry . J Heart Lung Transplant 2020. ; 39 ( 12 ): 1435 – 1444 . [DOI] [PubMed] [Google Scholar]
  • 26. Hoeper MM , Meyer K , Rademacher J , Fuge J , Welte T , Olsson KM . Diffusion capacity and mortality in patients with pulmonary hypertension due to heart failure with preserved ejection fraction . JACC Heart Fail 2016. ; 4 ( 6 ): 441 – 449 . [DOI] [PubMed] [Google Scholar]
  • 27. Olsson KM , Fuge J , Meyer K , Welte T , Hoeper MM . More on idiopathic pulmonary arterial hypertension with a low diffusing capacity . Eur Respir J 2017. ; 50 ( 2 ): 1700354 . [DOI] [PubMed] [Google Scholar]
  • 28. Benza RL , Gomberg-Maitland M , Elliott CG , et al . Predicting survival in patients with pulmonary arterial hypertension: the REVEAL risk score calculator 2.0 and comparison with ESC/ERS-based risk assessment strategies . Chest 2019. ; 156 ( 2 ): 323 – 337 . [DOI] [PubMed] [Google Scholar]
  • 29. Kovacs G , Agusti A , Barberà JA , et al . Pulmonary vascular involvement in chronic obstructive pulmonary disease. Is there a pulmonary vascular phenotype? Am J Respir Crit Care Med 2018. ; 198 ( 8 ): 1000 – 1011 . [DOI] [PubMed] [Google Scholar]
  • 30. Walsh SLF , Calandriello L , Sverzellati N , Wells AU , Hansell DM ; UIP Observer Consort . Interobserver agreement for the ATS/ERS/JRS/ALAT criteria for a UIP pattern on CT . Thorax 2016. ; 71 ( 1 ): 45 – 51 . [DOI] [PubMed] [Google Scholar]
  • 31. Walsh SLF , Devaraj A , Enghelmayer JI , et al . Role of imaging in progressive-fibrosing interstitial lung diseases . Eur Respir Rev 2018. ; 27 ( 150 ): 180073 . [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Radiology are provided here courtesy of Radiological Society of North America

RESOURCES