Skip to main content
Annals of the American Thoracic Society logoLink to Annals of the American Thoracic Society
. 2025 Jan 1;22(1):83–92. doi: 10.1513/AnnalsATS.202401-009OC

Computed Tomography Radiomics Features Predict Change in Lung Density and Rate of Emphysema Progression

Pratim Saha 1,2,*,, Sandeep Bodduluri 1,3,4,*,, Arie Nakhmani 1,4, Muhammad F A Chaudhary 1,3, Praneeth R Amudala Puchakayala 1, Venkata Sthanam 1,4, Raul San Jose Estepar 5, Joseph M Reinhardt 6, Chengcui Zhang 2, Surya P Bhatt 1,3,
PMCID: PMC11708762  PMID: 39404745

Abstract

Rationale

Emphysema progression is heterogeneous. Predicting temporal changes in lung density and detecting rapid progressors may facilitate the selection of individuals for targeted therapies.

Objectives

To test whether computed tomography (CT) radiomics can be used to predict changes in lung density and detect rapid progressors.

Methods

We extracted radiomics features from inspiratory chest CT in 4,575 subjects with and without airflow obstruction at enrollment, who completed a follow-up visit at approximately 5 years. We quantified emphysema using adjusted lung density (ALD) and estimated emphysema progression as the annualized change in ALD (ΔALD/yr) between visits. We categorized participants into rapid progressors (>1% ΔALD/yr) and stable disease (≤1% ΔALD/yr). A gradient boosting model was used 1) to predict ALD at 5 years and 2) to identify rapid progressors. Four models using demographics (base clinical model), CT density, radiomics, and combined features (clinical, radiomics, and CT density) were evaluated and tested.

Results

There were 1,773 (38.8%) rapid progressors. For predicting ALD at 5 years in the 20% held-out data, the base model explained 31% of the variance (adjusted R2 = 0.31), whereas R2 was 0.74 for the CT density model, 0.66 for the radiomics-only model, and 0.77 for the combined-features model. For detecting rapid progressors, the base model (area under the receiver operating characteristic curve [AUC], 0.57 [95% confidence interval (CI), 0.53–0.61]) was outperformed by the radiomics-only model (AUC, 0.73 [95% CI, 0.69–0.76]; Δ = 0.15; P < 0.001) and the combined model (AUC, 0.74 [95% CI, 0.71–0.77]; Δ = 0.17; P < 0.001).

Conclusions

Parenchymal and airway radiomics features derived from inspiratory scans can be used to predict temporal changes in lung density and help identify rapid progressors.

Keywords: chronic obstructive pulmonary disease, emphysema, progression, radiomics


Chronic obstructive pulmonary disease (COPD) is a highly prevalent lung disease affecting more than 392 million adults and is the third leading cause of death globally (1). Although its course is variable, COPD is usually progressive. The predictors of lung function trajectories in COPD have been well described (2, 3). The structural changes underlying these trajectories, however, are not well understood. Emphysema, a major structural subset of COPD, is usually progressive, but the rate of progression is heterogeneous (4, 5). There are currently no specific therapies for emphysema, and the first step is to identify individuals who can be targeted in clinical trials of directed therapies to impede progression. The major causes of emphysema, such as cigarette smoking, biomass smoke, and occupational factors, with continued exposure, are also likely associated with its progression. Additional risk factors include advancing age and genetic conditions including alpha-1 antitrypsin deficiency (6).

There are now considerable data to suggest that several intrinsic factors related to alterations in the structure of the airways and lung parenchyma themselves may beget further disease (7, 8). Cross-sectional and longitudinal studies suggest that in some individuals, emphysema follows terminal small airway disease and may be a consequence of air trapping (4, 9). Other studies indicate that emphysema, once initiated, may itself impose mechanical stretch on adjacent normal lung regions and contribute to the further alveolar destruction (7, 8, 10). The location and distribution of emphysema may also affect its progression (11). In addition to lung density changes, which can be easily quantified, emphysema also alters lung tissue organization and texture. Recent advances in image analyses have enabled the quantification of lung density uniformity and contrast and thus an improvement in estimates of tissue texture. The remodeling of parenchyma in COPD does not happen in isolation and is almost always associated with remodeling of the small and medium-size airways, as either a cause or a consequence of the parenchymal changes. We hypothesized that a combination of parenchymal density and texture measures on computed tomography (CT) as well as features associated with airway shape and structure can help predict temporal changes in lung density and also aid the accurate detection of rapid emphysema progressors. We also aimed to extract important CT features predictive of progression, which may provide mechanistic insights.

Methods

Study Population

We analyzed data of participants enrolled in COPDGene (Genetic Epidemiology of COPD), which is a multicenter cohort study. The details of the study design have been described previously (12). Briefly, participants were current or former smokers with smoking histories of at least 10 pack-years between the ages of 45 and 80 years. Those with other lung diseases, such as interstitial lung diseases and bronchiectasis, or with prior lobar excision, active malignancy, and chest radiation were excluded. Spirometry and chest CT were acquired at enrollment and at a second visit approximately 5 years later. Participants underwent prebronchodilator and postbronchodilator spirometry with the administration of 180 μg of albuterol. COPD was defined by a postbronchodilator ratio of forced expiratory volume in 1 second to forced vital capacity of <0.70 (13, 14). The severity of COPD was determined as per the Global Initiative for Chronic Obstructive Lung Disease (GOLD) report (14).

CT Imaging

CT scans of the chest were obtained at full inspiration (total lung capacity) in the supine position. The images were reconstructed at B31f kernel with a slice thickness of 0.75 mm and a reconstruction interval of 0.5 mm. The lungs were segmented using a UNet-based deep learning framework (15), and airway trees were segmented using the imaging software LungQ (Thirona) (16).

Emphysema quantification

We calculated lung density on inspiratory CT scans as the 15th percentile Hounsfield units (HU) plus 1,000, to convert to grams per liter. To account for variance introduced by the volume of acquisition, we quantified emphysema using the adjusted lung density (ALD), which is the 15th percentile density of the lungs, with a correction factor of the actual total lung volume on CT divided by the predicted total lung volume using the MESA (Multi-Ethnic Study of Atherosclerosis) equations derived from a normative population, which account for age, sex, race, height, and body mass index (BMI) (5). Change in emphysema was quantified as the annualized change in ALD (ΔALD) between the two visits. Participants with ΔALD > 1% per year were categorized as rapid emphysema progressors, and those with ΔALD ≤ 1% per year were categorized as having stable disease (17). In sensitivity analyses, we also tested other thresholds of >0.5% and >1.5% annual ΔALD to define rapid progression.

Radiomics

We used the PyRadiomics (version 3.0.1) package, an open-source radiomics quantification platform, to compute features of 1) density, 2) texture, 3) lung shape, and 4) airway shape on the inspiratory chest CT scans (18). The lungs were segmented using a UNet-based algorithm (LungInsight) (15), and radiomics features were extracted from the segmented lungs. Density and texture features were computed for the whole lung, whereas shape features were computed for the individual lung separately. We applied the Laplacian of Gaussian filter to enhance the characterization of the texture features. This edge enhancement filter further highlights subtle textural variations in the lung parenchyma. We extracted the lung shape features from lung segmentation maps. A total of 116 density, texture, and shape features were obtained. The complete list of radiomics features and their definitions are provided in Table E1 in the data supplement.

Density features

The density-feature set included 18 features representing basic statistics of the inspiratory CT density histogram, such as mean, median, skewness, kurtosis, 10th percentile, and 90th percentile.

Texture features

The texture-based feature set included 56 features based on measurements derived from gray-level cooccurrence matrix, gray-level run length matrix, and gray-level size zone matrix such as inverse difference moment, gray-level nonuniformity (GLN), cluster prominence, and entropy.

Lung shape features

The lung shape–based feature set included 28 features derived from the right and left lungs separately. Lung shape features included elongation, flatness, major- and minor-axis lengths, maximum two- and three-dimensional diameters, and surface area.

Airway shape features

The airway shape–based feature set included 14 features derived from the airway tree and included the same shape features as were derived for the lungs.

Statistical analyses

We used a Light Gradient-Boosting Machine (LightGBM) regressor and classifier to point 1) predict the absolute ALD value at Year 5 and point 2) categorize participants into rapid progressors versus those with stable disease. LightGBM is a decision tree–based gradient boosting framework (19). LightGBM grows the decision tree leafwise, where the leaf yields maximum loss. This nature of growing the tree enables the achievement of global minimum loss at high speeds. LightGBM uses gradient-based one-side sampling (GOSS) and exclusive feature bundling techniques. GOSS retains every instance with a large gradient and randomly samples every instance with a small gradient. When calculating the information gain, GOSS inserts a constant multiplier for the data instances with small gradients to account for the effect of the data distribution. Without significantly altering the initial data distribution, GOSS increases its attention on the undertrained examples. Exclusive feature bundling is a near lossless technique for the identification of important features by bundling high-dimensional feature space into a low-dimensional space (19).

We tested four sets of predictors for both regression and classification analyses:

  1. Base clinical model: The base clinical model included six features: age, race, sex, BMI, pack-years of smoking, and current smoking status.

  2. CT density model: Baseline ALD and scanner type were included.

  3. Radiomics model: All 116 radiomics features were included in this model.

  4. Combined model: The base clinical model features, CT density model features, and radiomics features were combined (124 features).

Model Training and Evaluation

Regression

The LightGBM regressor was used to predict ALD at 5 years. The dataset was divided into training (80% of the sample) and held-out (20% of the sample) sets. The model was trained using 10-fold cross-validation on the training dataset and evaluated on the held-out sets separately on each of the four models (base, CT density, radiomics, and combined). Hyperparameter tuning was done on the following parameters of LightGBM: number of estimators, maximum depth of the tree, learning rate, minimum child weight, and number of leaves. We tuned hyperparameters for the four models separately using 10-fold cross-validation and extracted the best values for each of the estimators using tree of Parzen estimators (20, 21). The coefficient of determination (R2) was calculated for each model.

Classification

The LightGBM classifier was used to predict the classification of participants into rapid progressors and those with stable disease. We followed the same approach as in the regression analyses for splitting of the dataset into training and held-out sets for model training and testing. The area under the receiver operating characteristic curve (AUC) was calculated for each model. The DeLong test was used to calculate the confidence interval for AUCs of the training and held-out data. Differences in model discrimination were compared using the DeLong test (22).

Feature importance

We identified the top 10 features for both the regression and classification tasks for the radiomics-only and combined model using LightGBM’s built-in feature importance estimator that assigns a score for each feature on the basis of the feature’s contribution in the decision tree. The importance score reflects the frequency of the feature value applied toward prediction. We normalized all importance scores by the highest score.

Statistical analyses were performed in Python version 3.9 using PyRadiomics version 3.1.0 and LightGBM Python packages (18, 19). A two-sided α value of 0.05 was considered to indicate statistical significance for all analyses.

Results

Participants

Of 5,119 participants who completed 5-year follow-up visits, we included 4,575 subjects, excluding those with missing CT scans (n = 448), those with poor segmentation quality (n = 21), and lifetime nonsmokers (n = 75) (Figure 1). The mean (standard deviation [SD]) age of the cohort was 59.8 (8.6) years. There were 2,318 (49.8%) women, and 1,320 (28.4%) were of African American race. A total of 1,843 (40.3%) subjects had diagnoses of COPD. Of these, 409 (8.9%), 883 (19.3%), 451 (9.8%), and 100 (2.2%) had GOLD severity stages 1 through 4, respectively. There were 2,194 (48.0%) without airflow obstruction (GOLD stage 0), and 538 (11.8%) had preserved ratio impaired spirometry (Table 1) (23).

Figure 1.


Figure 1.

Consolidated Standards of Reporting Trials diagram. COPDGene = Genetic Epidemiology of COPD; CT = computed tomography.

Table 1.

Characteristics of participants at baseline and after 5 years (n = 4,575)

Parameter Visit 1 Visit 2
Age, yr 59.8 (8.6) 65.4 (8.6)
Female sex, n (%) 2,318 (49.8) 2,318 (49.8)
African American race, n (%) 1,320 (28.4) 1,320 (28.4)
Body mass index, kg/m2 29.2 (6.1) 29.1 (6.4)
Smoking, pack-years 42.5 (23.5) 44.0 (23.9)
Current smokers, n (%) 2,013 (46.3) 1626 (37.4)
FEV1, L 2.4 (0.8) 2.1 (0.8)
FEV1, pp 80.5 (22.7) 78.4 (24.5)
FVC, L 3.4 (1.0) 3.1 (0.9)
FVC, pp 89.4 (16.7) 87.2 (17.6)
FEV1:FVC ratio 0.7 (0.1) 0.7 (0.1)
GOLD severity, n (%)    
 PRISm 538 (11.8) 554 (12.1)
 Stage 0 2,194 (48.0) 2,013 (44.0)
 Stage 1 409 (8.9) 439 (9.6)
 Stage 2 883 (19.3) 907 (19.8)
 Stage 3 451 (9.8) 472 (10.3)
 Stage 4 100 (2.2) 190 (4.2)
Pi10, mm 2.3 (0.6) 2.3 (0.6)
Adjusted lung density, g/L 86.3 (24.4) 85.7 (25.6)
Total lung capacity, L 5.5 (1.4) 5.5 (1.4)
Percentage emphysema 5.2 (8.0) 5.6 (9.2)

Definition of abbreviations: FEV1 = forced expiratory volume in 1 second; FVC = forced vital capacity; GOLD = Global Initiative for Chronic Obstructive Lung Disease; Pi10 = square root of the wall area of a theoretical airway with 10-mm luminal perimeter; pp = percentage predicted, PRISm = preserved ratio impaired spirometry.

Data are expressed as mean (SD) unless otherwise specified.

At baseline, ALD ranged from 4.4 to 180.5 g/L, with a mean (SD) of 86.3 (24.4) g/L. The mean (SD) difference between the ALD at baseline and at 5 years was 0.6 (1.3) g/L. A total of 1,773 subjects (38.8%) were identified as rapid emphysema progressors, and 2,802 (61.2%) were identified as having stable disease. Table 2 shows the baseline characteristics of rapid progressors and those with stable disease.

Table 2.

Baseline characteristics for rapid progressors and stable disease (n = 4,575)

Parameter Rapid Stable
Age, yr 60.3 (8.5) 59.5 (8.7)
Female sex, n (%) 854 (48.2) 1428 (51.0)
African American race, n (%) 533 (30.1) 764 (27.3)
Body mass index, kg/m2 28.7 (6.2) 29.5 (6.0)
Smoking, pack-years 45.4 (23.3) 40.7 (23.4)
Current smokers, n (%) 937 (52.8) 1,251 (44.6)
FEV1, L 2.2 (0.9) 2.5 (0.8)
FEV1, pp 74.6 (24.2) 84.2 (20.8)
FVC, L 3.3 (1.0) 3.5 (1.0)
FVC, pp 87.7 (17.8) 90.5 (15.9)
FEV1:FVC ratio 0.6 (0.2) 0.7 (0.1)
GOLD severity, n (%)    
 PRISm 208 (11.7) 330 (11.8)
 Stage 0 641 (36.2) 1,553 (55.4)
 Stage 1 169 (9.5) 240 (8.5)
 Stage 2 421 (23.7) 462 (16.5)
 Stage 3 273 (15.5) 178 (6.4)
 Stage 4 61 (3.4) 39 (1.4)
Pi10, mm 2.4 (0.6) 2.2 (0.6)
ALD, g/L 86.6 (27.1) 84.7 (22.0)
ΔALD per year, g/L 2.9 (1.6) −1.8 (3.2)
Total lung capacity, L 5.6 (1.4) 5.5 (1.4)
Percentage emphysema 6.3 (9.5) 4.5 (6.8)

Definition of abbreviations: ALD = adjusted lung density; FEV1 = forced expiratory volume in 1 second; FVC = forced vital capacity; GOLD = Global Initiative for Chronic Obstructive Lung Disease; Pi10 = square root of the wall area of a theoretical airway with 10-mm luminal perimeter; pp = percentage predicted, PRISm = preserved ratio impaired spirometry.

Data are expressed as mean (SD) unless otherwise specified.

Prediction of ALD at 5 Years

In the training set for the prediction of ALD at 5 years (80% of the cohort), the base clinical model explained 32% variance (adjusted R2 = 0.32). R2 was 0.75 for the CT density model and 0.66 for the radiomics-only model. When radiomics were combined with the base clinical and CT density model, R2 was 0.78. In the held-out set (20% of the cohort), the base clinical model explained 31% of the variance (adjusted R2 = 0.31, Pearson correlation = 0.56) whereas R2 was 0.74 (Pearson correlation = 0.86) for the CT density model, 0.66 (Pearson correlation = 0.84) for the radiomics-only model, and 0.77 (Pearson correlation = 0.90) for the combined-features model. A scatterplot and Bland-Altman plot showing the relation between ground truth and predicted ALD at 5 years for the four models are shown in Figures E1–E4.

Figure 2 shows the top predictors of ALD at 5 years in each of the models. In the base clinical model, BMI was the best performing feature, followed by age and pack-years of smoking. In the radiomics model, the 10th percentile of CT density histogram was the best performing feature, followed by GLN of intensity values in the CT image and the ratio of surface area to volume of the airways. In the combined model, baseline ALD was the best performing feature, followed by 10th percentile of the CT density histogram and age.

Figure 2.


Figure 2.

Top features for predicting adjusted lung density at 5 years. Results are shown for the (A) base clinical model, (B) radiomics model, and (C) combined model. The x-axis represents the importance score of each feature generated by Light Gradient-Boosting Machine, and the corresponding feature names are shown along the y-axis. 10th percentile = 10th percentile of computed tomography density histogram; Correlation = correlation of gray-level values in lung parenchyma; Maximum = maximum gray-level intensity; Run entropy = run entropy of gray-level runs of parenchymal texture; Sphericity = roundness of the shape of emphysematous regions.

Classification of Rapid Progressors versus Stable Disease

In the training set, for classifying rapid progression versus stable disease, the AUC was 0.58 (95% confidence interval [CI], 0.57–0.60) for the base clinical model, 0.61 (95% CI, 0.60–0.62) for the CT density model, 0.74 (95% CI, 0.73–0.75) for the radiomics model, and 0.75 (95% CI, 0.74–0.76) for the combined model. In the held-out dataset (20% of the cohort), 355 (39%) and 560 (61%) participants were identified as rapid progressors and subjects with stable disease, respectively. For the base clinical model, the AUC was 0.57 (95% CI, 0.53–0.61). The positive and negative likelihood ratios for the base clinical model were 1.97 and 0.98, respectively. For the CT density model, the AUC was 0.60 (95% CI, 0.56–0.64). The positive and negative likelihood ratios for the CT density model were 1.58 and 0.88. For the radiomics model, the AUC was 0.73 (95% CI, 0.69–0.76). The positive and negative likelihood ratios for the radiomics model were 2.81 and 0.63, respectively. For the combined model, the AUC was 0.74 (95% CI, 0.71–0.77). The positive and negative likelihood ratios for the combined model were 2.71 and 0.62, respectively. Compared with the base clinical model (AUC, 0.57 [95% CI, 0.53–0.61]), the discriminative accuracy was higher for the CT density (AUC, 0.60 [95% CI, 0.56–0.64]; Δ = 0.03; P = 0.27), radiomics-only (AUC, 0.73 [95% CI, 0.69–0.76]; Δ = 0.15; P < 0.001), and combined (AUC, 0.74 [95% CI, 0.71–0.77]; Δ = 0.17; P < 0.001) models.

The discriminative accuracy worsened with increasing disease severity in the base clinical model (AUCs of 0.56, 0.67, 0.62, and 0.53 for GOLD stages 0, 1, 2, and combined 3 and 4, respectively) and CT density model (AUCs of 0.62, 0.72, 0.52, and 0.50, respectively). The AUCs for the radiomics-only (0.70, 0.73, 0.73, and 0.78, respectively) and combined (0.71, 0.80, 0.72, and 0.74, respectively) models were consistently better than those of the base clinical and CT density models across all GOLD stages.

Figure 3 shows the top features of each model. In the base clinical model, pack-years of smoking was the best performing feature. In the radiomics model, the total energy from gray-level cooccurrence matrix–based texture features was the best performing feature, followed by cluster prominence, and sphericity of the left lung. In the combined model, baseline ALD was the best performing feature, followed by pack-years of smoking and cluster prominence. Figure 4 shows the receiver operating characteristic curves for the four models for the classification of rapid progressors versus stable disease.

Figure 3.


Figure 3.

Top features for identifying rapid progressors versus stable disease. Results are shown for the (A) base clinical model, (B) radiomics model, and (C) combined model. The x-axis represents the importance score of each feature generated by Light Gradient-Boosting Machine, and the corresponding feature names are shown along the y-axis. Elongation = extent of deviation from being circular; Size zone non-uniformity = variability in the volumes of differently sized regions within lungs; Total energy = total energy of parenchymal texture.

Figure 4.


Figure 4.

Receiver operating characteristic (ROC) curve showing the discriminative accuracy for the detection of rapid progressors versus subjects with stable disease. Results are shown for four models on the 20% held-out test data. (A) The blue line represents the ROC curve for the base clinical model, (B) the orange line represents the CT density model, (C) the green line shows the ROC curve for the radiomics model, and (D) the red line represents the ROC curve for the combined model. CT = computed tomography.

Sensitivity analysis

Radiomics features were extracted from CT images that were rotated 45° and 90° to evaluate model robustness with changes in CT acquisition protocol and patient positioning. Compared with the discriminative accuracy of the radiomics feature set derived from original images (AUC, 0.74), the AUC was 0.71 for radiomics features extracted from CT images rotated by 45° and 0.73 when images were rotated by 90°. Similarly, for predicting ALD at 5 years, an R2 value of 0.66 was achieved using original CT image, and R2 values of 0.70 and 0.71 were achieved after rotating CT image by 45° and 90° respectively. When rapid progression was defined by >0.5% and >1.5% per year ΔALD, the AUCs for the radiomics model were 0.73 (95% CI, 0.70–0.76) and 0.74 (95% CI, 0.70–0.77), respectively. Performance of all models using these thresholds for rapid progression is shown in Table E2.

Discussion

In a cohort of current and former smokers at high risk for disease progression, we demonstrated that radiomics from inspiratory scans alone can be used to predict the progression of emphysema over a 5-year period and to detect rapid progressors. The radiomics features alone had discriminative accuracy that was similar to that of a model that included additional demographics and CT density features. These findings have significant implications for the selection of patients for clinical trials targeting emphysema progression.

There are currently no known treatments that affect structural disease progression in COPD. Several large multicenter clinical trials of pharmacologic therapies have shown an improvement in lung function that is sustained over the period of the trials (2426). There is no definitive evidence that existing medications decrease the rate of decline of lung function. Clinical trials of lung function decline are limited by the need for large sample sizes given the wide SD of spirometric measures. Spirometry is also nonspecific for the underlying structural processes of emphysema and airway remodeling. These factors have impeded the design and conduct of clinical trials targeting disease progression in COPD.

Recently, there has been more focus on targeting the structural disease underlying COPD using quantitative measures on chest imaging. These measures have less variance than spirometry, and hence studies targeting a change in these parameters are likely to need significantly lower sample sizes (27). This approach has been used in a subset of patients with COPD who have alpha-1 antitrypsin deficiency in which trials of augmentation with intravenous alpha-1 antitrypsin required a considerably smaller sample size to show meaningful effects of the medication (28, 29). However, there was considerable heterogeneity in the response to treatment in these trials. A recent placebo-controlled randomized trial of losartan, an angiotensin receptor blocker, did not result in slowing the progression of emphysema in those with mild to moderate disease (30). These data suggest that a more precision-based approach by targeting individuals at high risk for emphysema progression may result in the need for small sample sizes and a higher chance of impeding the rate of progression.

To enhance the likelihood of targeting individuals at high risk of progression of disease, risk predictors at baseline are needed. Data from the ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints) study suggest that blood concentrations of the biomarkers SP-D (surfactant protein D) and sRAGE (soluble receptor for advanced glycation end product) are both associated with emphysema progression, but no cutoffs were determined for prediction of disease progression (31). We improve on several prior models that have used a combination of clinical and imaging measures. Chen and colleagues used propensity-adjusted models using 27 clinical and imaging variables to define disease axes, which were then used to predict emphysema progression (32). These models explained approximately 42% of the variance in emphysema progression (32). Several variables used in this model, such as age of smoking initiation and 6-minute-walk distance, are not readily available in the electronic medical records and need to be prospectively collected. El Kaddouri and colleagues demonstrated that the pattern of emphysema on a semiquantitative scale on baseline scans (mild, moderate, and confluent centrilobular and paraseptal) is associated with emphysema progression but did not report whether these patterns can be used to discriminate between rapid progression and stable disease for a given individual (33). Furthermore, these patterns of emphysema often coexist, and hence these patterns may not accurately identify rapid progressors. We improve on these models by explaining a significantly larger proportion of the variance in emphysema progression and also by estimating the probability of rapid progression.

We identified several radiomic features on chest CT that were consistently associated with the degree of emphysema at 5 years and with the identification of rapid progressors, which add mechanistic insights into emphysema progression. The top features in the radiomics model were the 10th percentile of lung density histogram for the prediction of ALD at 5 years and total energy of parenchymal texture for the identification of rapid progressors. Tenth percentile represents the density (HU) value below which 10% of the lung voxels reside. The closer this value is to −1,000 HU (air), the higher the probability of emphysematous air trapping within the lung. An image’s energy is a metric for homogeneous patterns. A higher energy suggests that the image has more instances of intensity value pairings that are close to one another at higher frequencies. Total energy is calculated by multiplying the value of the energy feature by the volume of the voxel in cubic millimeters. A higher energy value of parenchyma represents more variations and heterogeneity in the density profile of a given region of interest. We, and others, have shown that the presence of emphysema begets more emphysema (7, 8). This is most likely by way of mechanical transduction. Periods of increased mechanical stress, such as during acute exacerbations, are associated with faster progression of emphysema (10). Other top features included aspects related to shape of the lungs that may be related to the remodeling associated with air trapping and hyperinflation noted with progressive emphysema. The ratio of airway surface area to volume was also noted to be an important feature, a measure we recently described as a useful metric for disease prognostication (34). In addition, GLN representing subtle texture changes associated with emphysema was also identified as a top feature in radiomics model. GLN reflects the distribution of intensities/gray levels in the region of interest. A higher GLN value represents how intensities in a texture deviate from a homogeneous or smooth textural pattern.

Strengths and Limitations

Our study has several strengths. The COPDGene study is a cohort of well-phenotyped participants that included both healthy control subjects and smokers with a wide spectrum of disease severity. Spirometry and CT images were gathered using a standard protocol and subject to stringent quality control. The robustness of the results was enhanced by running the models with 10-fold cross-validation and the validation of results in a separate held-out dataset. In sensitivity analyses, we rotated the input images by 45° and 90° and regenerated radiomics features. The performance of the models did not change significantly with rotation, supporting the robustness of the model to changes in model acquisition and patient positioning that may be encountered in clinical practice. We also note that in contrast to the clinical and CT density models, the performance of the radiomics models does not decrease with increasing disease severity, perhaps because of the inclusion of additional features of airway shape which may also be affected with increasing disease severity. The study also has a few limitations. The cohort was composed mostly of current and former smokers. This limits generalizability to a nonsmoking population, but smokers are at high risk for emphysema progression. We did not include any individuals with other coexisting lung diseases, which may alter the performance of the prediction models. More research is needed to validate these algorithms in nonsmokers as well as in the presence of coexisting chronic lung diseases. All parenchymal measures can be affected by the volume of acquisition. In COPDGene, breath-hold was not spirometry gated, but all participants were coached to inhale to total lung capacity, reflecting clinical practice. We used an arbitrary threshold of 1% ΔALD to define rapid progressors, but our models to predict ALD at 5 years are based on estimation of ALD as a continuous parameter. The 1% threshold has been used previously (17). Nonetheless, we conducted additional sensitivity analyses for >0.5% and >1.5% annual ΔALD, and the results were comparable with the primary results.

Conclusions

We demonstrate that lung radiomics can be used to predict lung density at approximately 5 years and to identify rapid progressors. Identifying rapid progressors is critical for the enrichment of future clinical trials targeting emphysema progression with those who are most likely to progress.

Supplemental Materials

ONLINE DATA SUPLLEMENT
DOI: 10.1513/AnnalsATS.202401-009OC

Footnotes

Supported by National Heart, Lung, and Blood Institute grants R01 HL151421 (S.P.B., A.N.), UH3HL155806 (S.P.B.), U01 HL089897, and U01 HL089856 and National Institutes of Health grant K01HL163249 (SB). COPDGene is also supported by the COPD Foundation through contributions made to an industry advisory board composed of AstraZeneca, Boehringer Ingelheim, Genentech, GlaxoSmithKline, Novartis, Pfizer, Siemens, and Sunovion.

Author Contributions: Study concept and design: P.S., S.B., and S.P.B. Acquisition, analysis, or interpretation of data: all authors. Drafting of the manuscript: P.S., S.B., and S.P.B. Critical revision of the manuscript for important intellectual content: all authors. Statistical analysis: P.S. and S.B. P.S., S.B., and S.P.B. had full access to data included in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.

This article has a data supplement, which is accessible at the Supplements tab.

Author disclosures are available with the text of this article at www.atsjournals.org.

References

  • 1. Adeloye D, Song P, Zhu Y, Campbell H, Sheikh A, Rudan I, et al. NIHR RESPIRE Global Respiratory Health Unit Global, regional, and national prevalence of, and risk factors for, chronic obstructive pulmonary disease (COPD) in 2019: a systematic review and modelling analysis. Lancet Respir Med . 2022;10:447–458. doi: 10.1016/S2213-2600(21)00511-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Bui DS, Lodge CJ, Burgess JA, Lowe AJ, Perret J, Bui MQ, et al. Childhood predictors of lung function trajectories and future COPD risk: a prospective cohort study from the first to the sixth decade of life. Lancet Respir Med . 2018;6:535–544. doi: 10.1016/S2213-2600(18)30100-0. [DOI] [PubMed] [Google Scholar]
  • 3. Lange P, Celli B, Agustí A, Boje Jensen G, Divo M, Faner R, et al. Lung-function trajectories leading to chronic obstructive pulmonary disease. N Engl J Med . 2015;373:111–122. doi: 10.1056/NEJMoa1411532. [DOI] [PubMed] [Google Scholar]
  • 4. Pompe E, Moore CM, Hoesein FAM, de Jong PA, Charbonnier J-P, Han MK, et al. COPDGene investigators Progression of emphysema and small airways disease in cigarette smokers. Chronic Obstr Pulm Dis . 2021;8:198–212. doi: 10.15326/jcopdf.2020.0140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Pompe E, Strand M, van Rikxoort EM, Hoffman EA, Barr RG, Charbonnier JP, et al. COPDGene Investigators Five-year progression of emphysema and air trapping at CT in smokers with and those without chronic obstructive pulmonary disease: results from the COPDGene study. Radiology . 2020;295:218–226. doi: 10.1148/radiol.2020191429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Petrache I, Hajjar J, Campos M. Safety and efficacy of alpha-1-antitrypsin augmentation therapy in the treatment of patients with alpha-1-antitrypsin deficiency. Biologics . 2009;3:193–204. doi: 10.2147/btt.2009.3088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Ritter MC, Jesudason R, Majumdar A, Stamenović D, Buczek-Thomas JA, Stone PJ, et al. A zipper network model of the failure mechanics of extracellular matrices. Proc Natl Acad Sci U S A . 2009;106:1081–1086. doi: 10.1073/pnas.0808414106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Bhatt SP, Bodduluri S, Hoffman EA, Newell JD, Jr, Sieren JC, Dransfield MT, et al. COPDGene Investigators Computed tomography measure of lung at risk and lung function decline in chronic obstructive pulmonary disease. Am J Respir Crit Care Med . 2017;196:569–576. doi: 10.1164/rccm.201701-0050OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. McDonough JE, Yuan R, Suzuki M, Seyednejad N, Elliott WM, Sanchez PG, et al. Small-airway obstruction and emphysema in chronic obstructive pulmonary disease. N Engl J Med . 2011;365:1567–1575. doi: 10.1056/NEJMoa1106955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Bhatt SP, Bodduluri S, Dransfield MT, Reinhardt JM, Crapo JD, Silverman EK, et al. Acute exacerbations are associated with progression of emphysema. Ann Am Thorac Soc . 2022;19:2108–2111. doi: 10.1513/AnnalsATS.202112-1385RL. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Boueiz A, Chang Y, Cho MH, Washko GR, Estépar RSJ, Bowler RP, et al. COPDGene Investigators Lobar emphysema distribution is associated with 5-year radiological disease progression. Chest . 2018;153:65–76. doi: 10.1016/j.chest.2017.09.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Regan EA, Hokanson JE, Murphy JR, Make B, Lynch DA, Beaty TH, et al. Genetic epidemiology of COPD (COPDGene) study design. COPD . 2011;7:32–43. doi: 10.3109/15412550903499522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Bhatt SP, Balte PP, Schwartz JE, Cassano PA, Couper D, Jacobs DR, et al. Discriminative accuracy of FEV1: FVC thresholds for COPD-related hospitalization and mortality. JAMA . 2019;321:2438–2447. doi: 10.1001/jama.2019.7233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Agustí A, Celli BR, Criner GJ, Halpin D, Anzueto A, Barnes P, et al. Global Initiative for Chronic Obstructive Lung Disease 2023 report: GOLD executive summary. Am J Respir Crit Care Med . 2023;207:819–837. doi: 10.1164/rccm.202301-0106PP. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Hofmanninger J, Prayer F, Pan J, Röhrich S, Prosch H, Langs G. Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problem. Eur Radiol Exp . 2020;4 doi: 10.1186/s41747-020-00173-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Charbonnier J-P, Pompe E, Moore C, Humphries S, van Ginneken B, Make B, et al. COPDGene investigators Airway wall thickening on CT: relation to smoking status and severity of COPD. Respir Med . 2019;146:36–41. doi: 10.1016/j.rmed.2018.11.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Gerard SE, Dougherty TM, Nagpal P, Jin D, Han MK, Newell JD, Jr,, et al. SPIROMICS Research Group Vessel and airway characteristics in one-year CT-defined rapid emphysema progression: SPIROMICS. Ann Am Thorac Soc . 2024;21:1022–1033. doi: 10.1513/AnnalsATS.202304-383OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Van Griethuysen JJ, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res . 2017;77:e104–e107. doi: 10.1158/0008-5472.CAN-17-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. In: Advances in neural information processing systems 30 (NIPS 2017) Guyon I, von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, et al., editors. La Jolla, CA: Neural Information Processing Systems Foundation; 2017. LightGBM: a highly efficient gradient boosting decision tree. [Google Scholar]
  • 20.Bergstra J, Bardenet R, Bengio Y, Kégl B. In: Advances in neural information processing systems 24 (NIPS 2011) Shawe-Taylor J, Zemel R, Bartlett P, Pereira F, Weinberger KQ, editors. La Jolla, CA: Neural Information Processing Systems Foundation; 2011. Algorithms for hyper-parameter optimization. [Google Scholar]
  • 21. Bergstra J, Yamins D, Cox D. Proceedings of the 30th International Conference on Machine Learning. PMLR. 2013;28:115–123. [Google Scholar]
  • 22. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics . 1988;44:837–845. [PubMed] [Google Scholar]
  • 23. Wan ES, Castaldi PJ, Cho MH, Hokanson JE, Regan EA, Make BJ, et al. COPDGene Investigators Epidemiology, genetics, and subtyping of preserved ratio impaired spirometry (PRISm) in COPDGene. Respir Res . 2014;15 doi: 10.1186/s12931-014-0089-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Tashkin DP, Li N, Halpin D, Kleerup E, Decramer M, Celli B, et al. Annual rates of change in pre-vs. post-bronchodilator FEV1 and FVC over 4 years in moderate to very severe COPD. Respir Med . 2013;107:1904–1911. doi: 10.1016/j.rmed.2013.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Lipson DA, Barnhart F, Brealey N, Brooks J, Criner GJ, Day NC, et al. IMPACT Investigators Once-daily single-inhaler triple versus dual therapy in patients with COPD. N Engl J Med . 2018;378:1671–1680. doi: 10.1056/NEJMoa1713901. [DOI] [PubMed] [Google Scholar]
  • 26. Bhatt SP, Rabe KF, Hanania NA, Vogelmeier CF, Cole J, Bafadhel M, et al. BOREAS Investigators Dupilumab for COPD with type 2 inflammation indicated by eosinophil counts. N Engl J Med . 2023;389:205–214. doi: 10.1056/NEJMoa2303951. [DOI] [PubMed] [Google Scholar]
  • 27. Martinez FJ, Agusti A, Celli BR, Han MK, Allinson JP, Bhatt SP, et al. Treatment trials in young patients with chronic obstructive pulmonary disease and pre–chronic obstructive pulmonary disease patients: time to move forward. Am J Respir Crit Care Med . 2022;205:275–287. doi: 10.1164/rccm.202107-1663SO. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Dirksen A, Piitulainen E, Parr D, Deng C, Wencker M, Shaker S, et al. Exploring the role of CT densitometry: a randomised study of augmentation therapy in α1-antitrypsin deficiency. Eur Respir J . 2009;33:1345–1353. doi: 10.1183/09031936.00159408. [DOI] [PubMed] [Google Scholar]
  • 29. Chapman KR, Burdon JG, Piitulainen E, Sandhaus RA, Seersholm N, Stocks JM, et al. RAPID Trial Study Group Intravenous augmentation treatment and lung density in severe α1 antitrypsin deficiency (RAPID): a randomised, double-blind, placebo-controlled trial. Lancet . 2015;386:360–368. doi: 10.1016/S0140-6736(15)60860-1. [DOI] [PubMed] [Google Scholar]
  • 30. Wise RA, Holbrook JT, Brown RH, Criner GJ, Dransfield MT, He J, et al. Clinical trial of losartan for pulmonary emphysema: Pulmonary Trials Cooperative Losartan Effects on Emphysema Progression Clinical Trial. Am J Respir Crit Care Med . 2022;206:838–845. doi: 10.1164/rccm.202201-0206OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Coxson HO, Dirksen A, Edwards LD, Yates JC, Agusti A, Bakke P, et al. Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE) Investigators The presence and progression of emphysema in COPD as determined by CT scanning and biomarker expression: a prospective analysis from the ECLIPSE study. Lancet Respir Med . 2013;1:129–136. doi: 10.1016/S2213-2600(13)70006-7. [DOI] [PubMed] [Google Scholar]
  • 32. Chen J, Cho M, Silverman EK, Hokanson JE, Kinney GL, Crapo JD, et al. Turning subtypes into disease axes to improve prediction of COPD progression. Thorax . 2019;74:906–909. doi: 10.1136/thoraxjnl-2018-213005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. El Kaddouri B, Strand MJ, Baraghoshi D, Humphries SM, Charbonnier J-P, van Rikxoort EM, et al. Fleischner Society visual emphysema CT patterns help predict progression of emphysema in current and former smokers: results from the COPDGene study. Radiology . 2021;298:441–449. doi: 10.1148/radiol.2020200563. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Bodduluri S, Kizhakke Puliyakote A, Nakhmani A, Charbonnier J-P, Reinhardt JM, Bhatt SP. Computed tomography–based airway surface area–to–volume ratio for phenotyping airway remodeling in chronic obstructive pulmonary disease. Am J Respir Crit Care Med . 2021;203:185–191. doi: 10.1164/rccm.202004-0951OC. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ONLINE DATA SUPLLEMENT
DOI: 10.1513/AnnalsATS.202401-009OC

Articles from Annals of the American Thoracic Society are provided here courtesy of American Thoracic Society

RESOURCES