Abstract
BACKGROUND
Currently recommended traditional spirometry outputs do not reflect the relative contributions of emphysema and airway disease to airflow obstruction. We hypothesized that machine-learning algorithms can be trained on spirometry data to identify these structural phenotypes.
METHODS
Participants enrolled in a large multicenter study (COPDGene) were included. The data points from expiratory flow-volume curves were trained using a deep-learning model to predict structural phenotypes of chronic obstructive pulmonary disease (COPD) on CT, and results were compared with traditional spirometry metrics and an optimized random forest classifier. Area under the receiver operating characteristic curve (AUC) and weighted F-score were used to measure the discriminative accuracy of a fully convolutional neural network, random forest, and traditional spirometry metrics to phenotype CT as normal, emphysema-predominant (>5% emphysema), airway-predominant (Pi10 > median), and mixed phenotypes. Similar comparisons were made for the detection of functional small airway disease phenotype (>20% on parametric response mapping).
RESULTS
Among 8980 individuals, the neural network was more accurate in discriminating predominant emphysema/airway phenotypes (AUC 0.80, 95%CI 0.79–0.81) compared with traditional measures of spirometry, FEV1/FVC (AUC 0.71, 95%CI 0.69–0.71), FEV1% predicted (AUC 0.70, 95%CI 0.68–0.71), and random forest classifier (AUC 0.78, 95%CI 0.77–0.79). The neural network was also more accurate in discriminating predominant emphysema/small airway phenotypes (AUC 0.91, 95%CI 0.90–0.92) compared with FEV1/FVC (AUC 0.80, 95%CI 0.78–0.82), FEV1% predicted (AUC 0.83, 95%CI 0.80–0.84), and with comparable accuracy with random forest classifier (AUC 0.90, 95%CI 0.88–0.91).
CONCLUSIONS
Structural phenotypes of COPD can be identified from spirometry using deep-learning and machine-learning approaches, demonstrating their potential to identify individuals for targeted therapies.
TRIAL REGISTRATION
ClinicalTrials.gov NCT00608764.
FUNDING
This study was supported by NIH grants K23 HL133438 and R21EB027891 and an American Thoracic Foundation 2018 Unrestricted Research Grant. The COPDGene study is supported by NIH grants NHLBI U01 HL089897 and U01 HL089856. The COPDGene study (NCT00608764) is also supported by the COPD Foundation through contributions made to an Industry Advisory Committee comprising AstraZeneca, Boehringer-Ingelheim, GlaxoSmithKline, Novartis, and Sunovion.
Keywords: Pulmonology
Keywords: COPD
Emphysema and airway predominant COPD can be identified from spirometry using machine learning.
Introduction
Chronic obstructive pulmonary disease (COPD) is an inflammatory disease of the lungs that is associated with substantial respiratory morbidity and health care costs and is now the fourth leading cause of death in the United States (1). COPD is defined by persistent airflow obstruction on spirometry, the result of a combination of 2 distinct structural processes: emphysema characterized by alveolar destruction and poor elastic recoil of the lungs as well as airway disease characterized by airway narrowing and remodeling (2, 3). Although spirometric measures of airflow obstruction correlate strongly with CT measures of both emphysema and airway disease, spirometry does not discern the relative contributions of these structural disease processes to overall airflow obstruction. Furthermore, recent studies demonstrate that approximately half of current and former smokers, with no evidence of spirometric airflow obstruction according to traditional criteria, have evidence of emphysema and/or airway disease (4, 5). These findings suggest that the existing spirometry criteria for airflow obstruction are not sensitive to the contributory structural changes.
The inability to accurately and easily differentiate predominant emphysema from predominant airway disease hinders the development of targeted therapies (6). Furthermore, these structural changes have significant consequences beyond those due to lung function impairment. The degree of emphysema and airway wall thickening on CT are both independently associated with worse respiratory quality of life, dyspnea, and mortality (7–13). Despite these associations, CT is often not recommended for diagnosis in clinical practice due to concerns about high costs and risk of radiation. There are currently no low-cost, low-risk tools to phenotype the structural components of COPD, and even its diagnosis relies on demonstrating abnormalities in discrete components of spirometry, such as the forced expiratory volume in first second (FEV1) and the ratio of FEV1 to the forced vital capacity (FEV1/FVC). Specific components of the spirometric flow-volume and volume-time curves have been analyzed to identify early and mild COPD but have not been successful in distinguishing emphysema-predominant disease from airway disease predominance (14–20).
We hypothesized that machine-learning approaches trained on all the data points contained in the expiratory flow-volume curve would accurately distinguish individuals with predominant emphysema from those with predominant airway disease. We used fully convolutional network (FCN) and random forest classifier to test our hypothesis.
Results
Participant characteristics
After exclusions (Figure 1), the expiratory flow-volume curves of 8980 participants were included in the analyses (Table 1). The mean (SD) age was 57.8 years (8.4 years); 4177 participants (39%) were female, 6085 participants (67.7%) were non-Hispanic white, and 2895 participants (32.3%) were African Americans. 4705 participants (52.9%) were active smokers at enrollment. The cohort included 3926 participants (44.1%) without airflow obstruction (GOLD 0) and 3901 participants (43.4%) with airflow obstruction, including 724 (8.1%), 1696 (19.1%), 984 (11.1%), and 497 (5.6%) with GOLD stages 1–4, respectively. 1066 participants (11.9%) had preserved ratio impaired spirometry (PRISm).
Table 1. Baseline characteristics of participants in each structural phenotype.
Structurally normal CT scans were seen in 3085 participants (34.3%). Airway-predominant disease, defined by Pi10, was seen in 3207 participants (35.7%), emphysema-predominant disease was seen in 1390 participants (15.4%), and a mixed phenotype was seen in 1298 participants (14.4%) (Table 1). Airway-predominant disease, defined by functional small airway disease, was seen in 826 participants (10.25%), emphysema-predominant disease was seen in 200 participants (2.54%), and a mixed phenotype was seen in 1636 participants (20.79%).
Classification results for phenotyping emphysema/airway disease
Training.
Results presented here are aggregated means from 10 replications of Monte-Carlo cross-validation in the training data set. The average out-of-the-bag error for the random forest model was 0.45 (95%CI 0.44 to 0.46), whereas the average validation loss for the neural network was 1.01 (95%CI 1.00 to 1.02). The parameters and weights of the model with minimum validation loss (neural network) and minimum out-of-the-bag error (random forest) among the 10 training/validation splits were used to evaluate the model performance on the held-out test set.
Held-out test data set.
In the test data (20% of cohort), (1796 participants), 617 participants (34.3%) were normal, 641 participants (35.6%) had predominant airway disease, 278 participants (15.4%) had emphysema predominant disease, and 260 participants (14.4%) participants had a mixed phenotype. For the prediction of structural phenotypes, the AUCs for FEV1% predicted and FEV1/FVC were 0.70 (95%CI 0.68 to 0.71) and 0.71 (95%CI 0.69 to 0.71), respectively. The area under the receiver operating characteristic curve (AUC) for the random forest classification was 0.78 (95%CI 0.77 to 0.79). The neural network outperformed traditional measures of spirometry and also the optimized random forest classifier with AUC of 0.80 (95%CI 0.79 to 0.81) (Table 2). The F1 score for the neural network was 0.56 compared with 0.45, 0.43, and 0.54 for FEV1% predicted, FEV1/FVC, and random forest classifier, respectively (Figure 2 and Figure 3). Within each structural disease class, neural network again outperformed the traditional spirometry measures for classification (Table 2 and Figures 2 and 3). Results for feature importance using SHapley Additive exPlanation (SHAP) values are shown in Supplemental Figures 1 and 2 (supplemental material available online with this article; https://doi.org/10.1172/jci.insight.132781DS1).
Table 2. Discriminative accuracy of traditional spirometry metrics, random forest classifier, and deep-learning model for emphysema/medium size airway disease.
Classification results for phenotyping emphysema/functional small airway disease
Training.
The average out-of-the-bag error for the random forest model was 0.19 (95%CI 0.20 to 0.18), whereas the average validation loss for the neural network was 0.57 (95%CI 0.55 to 0.59).
Held out test data set.
In the test data (20% of cohort), (1574 participants), 1041 (66.1%) were normal, 165 (10.4%) had predominant small airway disease, 40 (2.5%) had emphysema predominant disease, and 328 (20.8%) participants had a mixed phenotype. For the prediction of structural phenotypes, the AUCs for FEV1% predicted and FEV1/FVC were 0.83 (95%CI 0.80 to 0.84) and 0.80 (95%CI0.78 to 0.82), respectively. The AUC for the random forest classification was 0.90 (95%CI 0.88 to 0.91). The neural network outperformed traditional measures of spirometry and had similar discrimination compared with the random forest classifier with AUC of 0.91 (95%CI 0.90 to 0.92) (Table 3). The F1 score for the neural network was 0.79 compared with 0.73, 0.71, and 0.76 for FEV1% predicted, FEV1/FVC, and random forest classifier, respectively (Figure 4 and Figure 5). Within each structural disease class, FCN again outperformed the traditional spirometry measures for classification (Table 3 and Figures 4 and 5).
Table 3. Discriminative accuracy of traditional spirometry metrics, random forest classifier, and deep-learning model for emphysema/functional small airway disease.
Discussion
In a large multicenter cohort of current and former smokers, machine-learning approaches, including deep-learning methods, trained on spirometry data outperformed traditional spirometry measures for the phenotyping of COPD into its structural components, including predominant small airway disease, and provided flow-volume curve signatures for predominant structural disease categories. These results will enhance patient identification for phenotypic characterization and targeting therapies.
Spirometric impairment is a summary metric, and the development of targeted therapies is hindered by the inability to identify predominant COPD phenotypes. Existing threshold-based spirometry criteria are also insensitive to early and mild damage in the lungs. As much as 20%–25% of the lung may be affected by emphysema before these changes manifest on spirometry (21). Substantial airway remodeling and loss also occur before the development of significant spirometric impairment (22, 23). Because airflow in the middle part of the flow-volume curve disproportionately results from flow in the small airways, multiple studies have evaluated various methods of analyzing the mid and distal parts of the curve. These include the forced expiratory flow in the 25th to the 75th percentile, forced expiratory flow in the first 3 seconds, the shape of the maximum expiratory curve, and change in angle of flow during forced exhalation (14–20, 24–29). None of these studies, however, validated their measures against structural lung disease and, hence, were unable to separate emphysema from airway predominant disease.
Distinguishing predominant emphysema and airway disease is relevant for optimizing and advancing clinical care. Current therapy for COPD includes bronchodilators and inhaled corticosteroids, and only half of patients treated with these medications have a clinically meaningful improvement in their respiratory quality of life (30). These therapies target airway tone and inflammation and do not target emphysema. Although no specific pharmacologic therapies are currently approved that separately target emphysema and airway disease, interventional and pharmacologic therapies are being developed that are likely to benefit carefully phenotyped and selected individuals. Surgical and bronchoscopic lung volume reduction procedures are approved for severe emphysema. New interventions that target chronic bronchitis and airway remodeling are being developed, and there are ongoing trials that specifically target emphysema (NCT02696564). The results of this study can help identify these patients for clinical trials and eventually therapy.
Several aspects related to deep learning that are pertinent to this study should be considered. The prediction of structural lung disease from a sequence of flow values derived from a forced expiratory effort is effectively a sequence classification task. Capitalizing on recent advances in deep learning, several studies implemented convolutional neural networks and long short-term memory models to classify sequential data from natural language processing and speech recognition tasks. In the current study, we applied a fully convolutional network (FCN) as well as a random forest classifier on flow sequence generated from the expiratory flow-volume curve to phenotype structural lung disease as the outcome. Wang et al. proposed the use of FCNs to analyze sequential data and achieved robust results on several data sets from the University of California, Riverside, time series classification archive (31, 32). The FCN architecture was initially proposed for semantic image segmentation tasks, where the architecture is composed of 3 computation blocks and each block performs convolution operations followed by batch normalization and ReLU activation layers. The resulting output from the 3 convolution operations is fed into a global average pooling layer, which drastically reduces the number of training parameters and further enables the visualization of class activations specific to each class. The minimal requirement for preprocessing and feature crafting before classification, and the visualization of feature activations specific to each class through the pooling layer, make FCN an effective choice for classification of sequential or time series data in the medical domain. Nonetheless, a random forest classifier with optimized parameters performed almost equally well as the neural network, suggesting that a number of other machine-learning and deep-learning algorithms may be applied to the sequence of raw data points that constitute the spirometry curves. Random forest relies on decision points and avoids correlated points, whereas FCN uses near neighbors that may be correlated.
Although further improvement in accuracy may be possible with other algorithms, the overall results reflect the frequently observed overlapping and interrelated structural changes in both airways and the parenchyma in varying proportions that occurs in the majority of smokers. There is considerable airway-parenchymal interdependence; the presence of emphysema can untether airways and result in a predisposition to airway collapse, and peribronchial fibrosis and airway loss can result in distal emphysema. Although this inherent biological complexity limits the information that can be ascertained from spirometry alone, the probability scores that result from the FCN model for each individual raise the likelihood of accurately identifying the predominant structural category and represent a substantial advance in the identification of structural phenotypes in COPD.
The study has several strengths. Data from a large multicenter cohort of participants whose disease spanned the range of severity were included. Extensive CT phenotyping was performed with stringent quality control of both CT and spirometry. The structural phenotypes were classified using quantitative CT data that are more objective than labels applied by experts and are less subject to variability. COPDGene (Genetic Epidemiology of COPD) included a substantial number of African Americans and women. The training of the neural network and subsequent hyperparameter optimization was performed over 10 replications of Monte-Carlo cross-validation to ensure robustness of the model. The final evaluation of the classifier was performed on a hold-out test data set, which was not seen by the model previously.
Limitations.
The study also has several limitations. First, COPDGene included current and former smokers, and hence, these results should be validated in cohorts that include nonsmokers with and at risk for COPD. Second, CT scans were not spirometry gated. Participants were, however, coached to reproducibly achieve maximal inhalation. Third, the outcome variables were numeric values for CT parameters, and it is not known how factors that cause variability in CT assessments, such as scanner type and field of view, effect the performance of the machine-learning models. These aspects need further analysis. Fourth, although the top 5 flows at a given volume that are associated with each of the phenotypic classes were identified, we are unable to ascribe a physiologic explanation to these findings. The flow values in combination appear to reflect structural processes that are not detected when discrete single values are used. Although we used SHAP to identify the top features, interpretation is limited, as these features may slightly differ in a different data set, and hence, the inherent black box nature of FCN remains (33). Fifth, machine-learning algorithms can be affected by underfitting and overfitting biases (34), but we obtained similar results in a hold-out test data set.
Conclusions.
Structural phenotypes of COPD can be identified from spirometry using a deep neural network and machine-learning approaches, demonstrating their potential to identify individuals for targeted therapies. Further research is necessary to evaluate the applicability of the deep-learning model to improve COPD outcomes.
Methods
Study population and physiologic assessments.
Spirometry data from participants enrolled in the COPDGene study were included (35). COPDGene is a large multicenter cohort study of current and former smokers aged between 45 and 80 years, with a smoking history of at least 10-pack years; the details of this study have been previously published.
All participants underwent a standard protocol, which included prebronchodilator and postbronchodilator spirometry using the New Diagnostic Design Easy-One spirometer per the American Thoracic Society criteria. Postbronchodilator spirometry was performed 20 minutes after administration of 180 μg albuterol HFA with a spacer (Aerochamber, Monaghan Medical Corporation). Quality control was performed by including only those spirometry efforts that met at least grade 2 ATS standards (repeatable between 100 and 150 ml). The postbronchodilator ratio of FEV1/FVC < 0.70 was used to confirm the presence of airflow obstruction (36), and FEV1% predicted was used to estimate the severity of airflow obstruction per Global initiative for Obstructive Lung Disease (GOLD) recommendations (37). Participants with FEV1/FVC >0.70 but with FEV1% predicted <80% were categorized as having PRISm (38). We selected the postbronchodilator effort with the highest sum of FEV1 and FVC for the analysis as per ATS criteria. The raw data points that constitute the expiratory flow-volume curve were decomposed incrementally as flow data at every 30 mL volume exhaled (39, 40).
CT-based phenotyping.
Quantitative CT scans were acquired at maximal inspiration (total lung capacity). Emphysema was quantified on inspiratory CT as the percentage of low attenuation areas <–950 Hounsfield units using Slicer 3D software (11). Clinically significant emphysema was defined as ≥5% low attenuation areas. This threshold was selected as there appears to be an inflection point at 5%, above which the frequency of exacerbations and mortality increases considerably (12). Large and medium size airway disease was quantified by the Pi10, the square root of the wall area of a hypothetical airway with internal perimeter of 10 mm, using Apollo Software (VIDA Diagnostics) (11). Because there is no established threshold for clinically significant airway wall disease, Pi10 > median in the COPDGene cohort was used for categorization as significant airway disease. Functional small airway disease (fSAD) phenotype was quantified by >20% lung affected by small airway disease measured on parametric response mapping, where fSAD is nonemphysematous air trapping and, hence, an indirect measure of small airway disease (41, 42).
Using these emphysema and Pi10 thresholds, we classified participants into 1 of 4 CT categories: normal, <5% emphysema and < median Pi10; airway predominant, <5% emphysema but with Pi10 ≥ median; emphysema predominant, ≥5% emphysema and Pi10 < median; and mixed emphysema/airway, ≥5% emphysema and ≥ median Pi10. We also classified participants into 4 groups based on fSAD: normal, <5% emphysema and <20% fSAD; airway predominant, <5% emphysema but with fSAD ≥20%; emphysema predominant, ≥5% emphysema and fSAD <20%; and mixed emphysema/airway, ≥5% emphysema and fSAD ≥20%.
Deep neural network.
FCN was developed for image segmentation tasks and has shown significant improvements in efficiency and overall performance as compared with traditional deep convolutional networks. In this study, we used FCN as a feature extractor of a time series (or sequential) data, where these features were further fed into a global average pooling layer and a soft-max layer to classify the sequences into different labels. The basic architecture of FCN includes 3 stacked computation blocks, where each block consists of 1D convolutional layer followed by a batch normalization layer and a rectified linear unit activation layer (Supplemental Figure 3). Convolution on the 1D input sequence was performed by the convolutional layers followed by the batch normalization layer to improve generalizability and faster convergence. The penultimate global average pooling layer reduces the number of weights and prevents overfitting. This FCN architecture has been previously shown to achieve superior performance in several 1D sequence classification tasks.
Model training and evaluation.
The flow data points in each expiratory flow-volume curve were used as a 1D input sequence, and each sequence was standardized to have a length of 200 points using data padding with zeros at the end of the sequence. The expiratory flow data was divided into input (80%) and hold-out test (20%) data sets. All possible combinations of number of filters (32, 64, 128, 256 filters) in the convolutional layers, learning rate in the range of 0.00001–0.1, and batch sizes of 64, 128, and 256 were evaluated on the training set to select the best hyperparameters. The hyperparameter tuning was performed using TALOS library in Python. The model with the best hyperparameters, where the 3 convolutional layers with filter sizes of 128, 256, and 128, corresponding kernel sizes of 9, 5, and 3, at a learning rate of 0.0001, with batch size of 64 over 100 epochs, was selected for further evaluation. The input data set was further divided into 10 random splits of training (80%) and validation (20%) to train the FCN model. The weights of the neural network with minimum loss on the validation set were used for subsequent evaluation on the hold-out test data set. Early stopping of the training was implemented when there was no decline in the validation loss for at least 25 epochs. The learning rate was reduced by a factor of 0.01 after 15 epochs of no decline in the validation loss. The primary outcome was classification of each participant into 1 of the 4 structural disease categories on quantitative CT. Supplemental Figure 4 shows the visualization of the FCN training process with the chosen hyperparameters to classify spirometry data into the 4 different structural COPD phenotypes. The performance of the FCN was compared by implementing optimized random forest model (parameters were chosen by 5-fold cross validation and selected the model with minimum out-of-the-bag error) on the same input sequences and also with the performance of the traditional spirometry variables (FEV1/FVC and FEV1% predicted). Computation of feature importance using SHAP values is described in the Supplemental Methods.
Statistics.
AUC analyses were computed to evaluate the accuracy of the FCN and the random forest classifier. Their discriminative accuracies were compared with 2 traditional spirometry measurements (FEV1/FVC and FEV1% predicted) based on logistic regression. Sensitivity, specificity, Youden index (sensitivity + specificity −1), and F1 score for structural disease classification were tested for each model (43). The nonparametric DeLong test was used to compare AUCs between the models (44). A 2-tailed P value of < 0.05 was considered significant for all analyses. Analyses were performed using Python ≥ 3.0, R version ≥ 3.6.0 (R Project for Statistical Computing), and MedCalc Statistical Software.
Study approval.
All participants provided written informed consent before enrollment, and the COPDGene study protocol was approved by the University of Alabama at Birmingham Institutional Review Board (IRB) for human use (F070712014). The COPDGene study was approved by the IRBs of all 21 participating clinical centers: Ann Arbor VA Medical Center IRB (no. 2014-060462), Ann Arbor, Michigan, USA; Baylor College of Medicine IRB (H-22209); Brigham and Women’s Hospital Partners Human Research Committee (no. 2007P000554); Columbia University IRB (AAAC9324), New York, New York, USA; Duke University Health System IRB (no. Pro00004464), Durham, North Carolina, USA; Johns Hopkins Medicine IRB (NA_00011524), Baltimore, Maryland, USA; Los Angeles Biomedical Research Institute Human Subjects Committee (no. 12756-03), Torrance, California, USA; Michael E. DeBakey VA Medical Center IRB (no. H-22202), Houston, Texas, USA; Minneapolis VA Health Care System Minnesota (no. 4128-A), Minneapolis, Minnesota, USA; Health Partners Twin Cities IRB, (no. 07-127) Minneapolis–Saint Paul, Minnesota, USA; Morehouse School of Medicine IRB (no. 97826), Atlanta, Georgia, USA; National Jewish Health IRB (no. 1883a); Reliant Medical Group IRB (Fallon) (no. 1441), Worcester, Massachusetts, USA; Temple University IRB (no. 21659), Philadelphia, Pennsylvania, USA; University California, San Diego, Human Research Protections Program (no. 140070), San Diego, California, USA; University of Iowa IRB (no. 200710717); University of Michigan Medical School IRB (HUM00014973), Ann Arbor, Michigan, USA; University of Minnesota IRB Human Subjects Committee (no. 0801M24949), Minneapolis, Minnesota, USA; University of Pittsburgh IRB (no. 07120059); and University of Texas Health Science Center at San Antonio IRB (HSC20070644H), San Antonio, Texas, USA.
Author contributions
SB and SPB had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. SB and SPB conceived and designed the study. SB, AN, JMR, CGW, MLM, RR, BCJ, NRB, PJC, FCS, CZ, PVB, SPB acquired, analyzed, or interpreted data. SB and SPB drafted the manuscript. SB, AN, JMR, CGW, MLM, RR, BCJ, NRB, PJC, FCS, CZ, PVB, SPB provided critical revision of the manuscript for important intellectual content. SB, BCJ, and SPB provided statistical analysis. SPB supervised the study.
Supplementary Material
Acknowledgments
The authors thank the staff and participants of the COPDGene study for their important contributions. This study was supported by NIH grants K23 HL133438 and R21EB027891 (to SPB) and an American Thoracic Foundation 2018 Unrestricted Research Grant (to SB). JMR was supported by an NIH grant. MLM was supported by grants from the NIH and the US Department of Veteran Affairs. The COPDGene study is supported by NIH grants NHLBI U01 HL089897 and U01 HL089856. The COPDGene study (NCT00608764) is also supported by the COPD Foundation through contributions made to an Industry Advisory Committee comprising AstraZeneca, Boehringer-Ingelheim, GlaxoSmithKline, Novartis, and Sunovion.
Version 1. 06/18/2020
In-Press Preview
Version 2. 07/09/2020
Electronic publication
Footnotes
Conflict of interest: JMR reports holding stock in VIDA Diagnostics and personal fees from Boehringer Ingelheim. PJC reports personal fees and grant support from GlaxoSmithKline and personal fees from Novartis in the past 3 years. SPB has served on advisory boards for Sunovion and GlaxoSmithKline.
Copyright: © 2020, American Society for Clinical Investigation.
Reference information: JCI Insight. 2020;5(13):e132781.https://doi.org/10.1172/jci.insight.132781.
Contributor Information
Sandeep Bodduluri, Email: sbodduluri@uabmc.edu.
Arie Nakhmani, Email: anry@uab.edu.
Joseph M. Reinhardt, Email: joe-reinhardt@uiowa.edu.
Carla G. Wilson, Email: WilsonC@NJHealth.org.
Ramaraju Rudraraju, Email: rrudraraju@uabmc.edu.
Nirav R. Bhakta, Email: nirav.bhakta@ucsf.edu.
Peter J. Castaldi, Email: repjc@channing.harvard.edu.
Frank C. Sciurba, Email: sciurbafc@upmc.edu.
Chengcui Zhang, Email: czhang02@uab.edu.
Purushotham V. Bangalore, Email: puri@uab.edu.
Surya P. Bhatt, Email: sbhatt@uabmc.edu.
References
- 1.US Burden of Disease Collaborators, et al. The State of US Health, 1990-2016: Burden of Diseases, Injuries, and Risk Factors Among US States. JAMA. 2018;319(14):1444–1472. doi: 10.1001/jama.2018.0158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hogg JC, et al. The nature of small-airway obstruction in chronic obstructive pulmonary disease. N Engl J Med. 2004;350(26):2645–2653. doi: 10.1056/NEJMoa032158. [DOI] [PubMed] [Google Scholar]
- 3.McDonough JE, et al. Small-airway obstruction and emphysema in chronic obstructive pulmonary disease. N Engl J Med. 2011;365(17):1567–1575. doi: 10.1056/NEJMoa1106955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Regan EA, et al. Clinical and radiologic disease in smokers with normal spirometry. JAMA Intern Med. 2015;175(9):1539–1549. doi: 10.1001/jamainternmed.2015.2735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Woodruff PG, et al. Clinical significance of symptoms in smokers with preserved pulmonary function. N Engl J Med. 2016;374(19):1811–1821. doi: 10.1056/NEJMoa1505971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Elbehairy AF, Parraga G, Webb KA, Neder JA, O’Donnell DE, Canadian Respiratory Research Network (CRRN) Mild chronic obstructive pulmonary disease: why spirometry is not sufficient! Expert Rev Respir Med. 2017;11(7):549–563. doi: 10.1080/17476348.2017.1334553. [DOI] [PubMed] [Google Scholar]
- 7.Bhatt SP, et al. Imaging Advances in Chronic Obstructive Pulmonary Disease. Insights from the Genetic Epidemiology of Chronic Obstructive Pulmonary Disease (COPDGene) Study. Am J Respir Crit Care Med. 2019;199(3):286–301. doi: 10.1164/rccm.201807-1351SO. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Coxson HO. Quantitative computed tomography assessment of airway wall dimensions: current status and potential applications for phenotyping chronic obstructive pulmonary disease. Proc Am Thorac Soc. 2008;5(9):940–945. doi: 10.1513/pats.200806-057QC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Coxson HO, Rogers RM. Quantitative computed tomography of chronic obstructive pulmonary disease. Acad Radiol. 2005;12(11):1457–1463. doi: 10.1016/j.acra.2005.08.013. [DOI] [PubMed] [Google Scholar]
- 10.Gietema HA, Edwards LD, Coxson HO, Bakke PS, ECLIPSE Investigators Impact of emphysema and airway wall thickness on quality of life in smoking-related COPD. Respir Med. 2013;107(8):1201–1209. doi: 10.1016/j.rmed.2013.04.016. [DOI] [PubMed] [Google Scholar]
- 11.Grydeland TB, et al. Quantitative computed tomography measures of emphysema and airway wall thickness are related to respiratory symptoms. Am J Respir Crit Care Med. 2010;181(4):353–359. doi: 10.1164/rccm.200907-1008OC. [DOI] [PubMed] [Google Scholar]
- 12.Han MK, et al. Association between emphysema and chronic obstructive pulmonary disease outcomes in the COPDGene and SPIROMICS Cohorts: A Post Hoc Analysis of Two Clinical Trials. Am J Respir Crit Care Med. 2018;198(2):265–267. doi: 10.1164/rccm.201801-0051LE. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Johannessen A, et al. Mortality by level of emphysema and airway wall thickness. Am J Respir Crit Care Med. 2013;187(6):602–608. doi: 10.1164/rccm.201209-1722OC. [DOI] [PubMed] [Google Scholar]
- 14.Li H, Liu C, Zhang Y, Xiao W. The concave shape of the forced expiratory flow-volume curve in 3 seconds is a practical surrogate of FEV1/FVC for the diagnosis of airway limitation in inadequate spirometry. Respir Care. 2017;62(3):363–369. doi: 10.4187/respcare.05016. [DOI] [PubMed] [Google Scholar]
- 15.Nève V, Edmé JL, Baquet G, Matran R. Reference ranges for shape indices of the flow-volume loop of healthy children. Pediatr Pulmonol. 2015;50(10):1017–1024. doi: 10.1002/ppul.23112. [DOI] [PubMed] [Google Scholar]
- 16.O’Donnell CR, Rose RM. The flow-ratio index. An approach for measuring the influence of age and cigarette smoking on maximum expiratory flow-volume curve configuration. Chest. 1990;98(3):643–646. doi: 10.1378/chest.98.3.643. [DOI] [PubMed] [Google Scholar]
- 17.O’Donnell CR, Sneddon SL, Schenker M, Garshick E, Speizer FE, Mead J. Accuracy of spirometric and flow-volume indices obtained by digitizing volume-time tracings. Am Rev Respir Dis. 1987;136(1):108–112. doi: 10.1164/ajrccm/136.1.108. [DOI] [PubMed] [Google Scholar]
- 18.Ohwada A, Takahashi K. Concave pattern of a maximal expiratory flow-volume curve: a sign of airflow limitation in adult bronchial asthma. Pulm Med. 2012;2012:797495. doi: 10.1155/2012/797495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zheng CJ, Adams AB, McGrail MP, Marini JJ, Greaves IA. A proposed curvilinearity index for quantifying airflow obstruction. Respir Care. 2006;51(1):40–45. [PubMed] [Google Scholar]
- 20.Wang W, Xie M, Dou S, Cui L, Xiao W. Computer quantification of “angle of collapse” on maximum expiratory flow volume curve for diagnosing asthma-COPD overlap syndrome. Int J Chron Obstruct Pulmon Dis. 2016;11:3015–3022. doi: 10.2147/COPD.S118415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bergin C, et al. The diagnosis of emphysema. A computed tomographic-pathologic correlation. Am Rev Respir Dis. 1986;133(4):541–546. doi: 10.1164/arrd.1986.133.4.541. [DOI] [PubMed] [Google Scholar]
- 22.Bodduluri S, et al. Airway fractal dimension predicts respiratory morbidity and mortality in COPD. J Clin Invest. 2018;128(12):5374–5382. doi: 10.1172/JCI120693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Koo HK, et al. Small airways disease in mild and moderate chronic obstructive pulmonary disease: a cross-sectional study. Lancet Respir Med. 2018;6(8):591–602. doi: 10.1016/S2213-2600(18)30196-6. [DOI] [PubMed] [Google Scholar]
- 24.Kapp MC, Schachter EN, Beck GJ, Maunder LR, Witek TJ. The shape of the maximum expiratory flow volume curve. Chest. 1988;94(4):799–806. doi: 10.1378/chest.94.4.799. [DOI] [PubMed] [Google Scholar]
- 25.Omland O, Sigsgaard T, Pedersen OF, Miller MR. The shape of the maximum expiratory flow-volume curve reflects exposure in farming. Ann Agric Environ Med. 2000;7(2):71–78. [PubMed] [Google Scholar]
- 26.Orlowska K, Grebska E. [Examination of the accuracy of spirometric parameters: comparison of FMF (forced midexpiratory flow) and FEV 1 (forced expiratory volume)] Gruzlica. 1972;40(5):425–430. [PubMed] [Google Scholar]
- 27.Schachter EN, Kapp MC, Maunder LR, Beck G, Witek TJ. Smoking and cotton dust effects in cotton textile workers: an analysis of the shape of the maximum expiratory flow volume curve. Environ Health Perspect. 1986;66:145–148. doi: 10.1289/ehp.8666145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Tien YK, Elliott EA, Mead J. Variability of the configuration of maximum expiratory flow-volume curves. J Appl Physiol Respir Environ Exerc Physiol. 1979;46(3):565–570. doi: 10.1152/jappl.1979.46.3.565. [DOI] [PubMed] [Google Scholar]
- 29.Varga J, et al. Relation of concavity in the expiratory flow-volume loop to dynamic hyperinflation during exercise in COPD. Respir Physiol Neurobiol. 2016;234:79–84. doi: 10.1016/j.resp.2016.08.005. [DOI] [PubMed] [Google Scholar]
- 30.Jones PW, et al. Responder analyses for treatment effects in COPD using the St George’s Respiratory Questionnaire. Chronic Obstr Pulm Dis. 2017;4(2):124–131. doi: 10.15326/jcopdf.4.2.2017.0130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Wang Z, Yan W, Oates T. Time series classification from scratch with deep neural networks: A strong baseline. 2017 International Joint Conference on Neural Networks (IJCNN). doi: 10.1109/IJCNN.2017.7966039. https://arxiv.org/abs/1611.06455 Accessed on June 23, 2020.
- 32. Dau HA, et al. UCR Time Series Classification Archive. https://www.cs.ucr.edu/~eamonn/time_series_data_2018/ Accessed June 23, 2020.
- 33.Wang F, Kaushal R, Khullar D. Should health care demand interpretable artificial intelligence or accept “black box” medicine? Ann Intern Med. 2020;172(1):59–60. doi: 10.7326/M19-2548. [DOI] [PubMed] [Google Scholar]
- 34.Wang F, Casalino LP, Khullar D. Deep learning in medicine-promise, progress, and challenges. JAMA Intern Med. 2019;179(3):293–294. doi: 10.1001/jamainternmed.2018.7117. [DOI] [PubMed] [Google Scholar]
- 35.Regan EA, et al. Genetic epidemiology of COPD (COPDGene) study design. COPD. 2010;7(1):32–43. doi: 10.3109/15412550903499522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bhatt SP, et al. Discriminative accuracy of FEV1:FVC thresholds for COPD-related hospitalization and mortality. JAMA. 2019;321(24):2438–2447. doi: 10.1001/jama.2019.7233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Vogelmeier CF, et al. Global Strategy for the Diagnosis, Management, and Prevention of Chronic Obstructive Lung Disease 2017 Report. GOLD Executive Summary. Am J Respir Crit Care Med. 2017;195(5):557–582. doi: 10.1164/rccm.201701-0218PP. [DOI] [PubMed] [Google Scholar]
- 38.Wan ES, et al. Epidemiology, genetics, and subtyping of preserved ratio impaired spirometry (PRISm) in COPDGene. Respir Res. 2014;15:89. doi: 10.1186/s12931-014-0089-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Bhatt SP, et al. New spirometry indices for detecting mild airflow obstruction. Sci Rep. 2018;8(1):17484. doi: 10.1038/s41598-018-35930-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bhatt SP, et al. The peak index: spirometry metric for airflow obstruction severity and heterogeneity. Ann Am Thorac Soc. 2019;16(8):982–989. doi: 10.1513/AnnalsATS.201811-812OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Galbán CJ, et al. Computed tomography-based biomarker provides unique signature for diagnosis of COPD phenotypes and disease progression. Nat Med. 2012;18(11):1711–1715. doi: 10.1038/nm.2971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bhatt SP, et al. Association between functional small airway disease and FEV1 decline in chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2016;194(2):178–184. doi: 10.1164/rccm.201511-2219OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–35. doi: 10.1002/1097-0142(1950)3:1<32::AID-CNCR2820030106>3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
- 44.DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–845. doi: 10.2307/2531595. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.