Skip to main content
American Journal of Respiratory and Critical Care Medicine logoLink to American Journal of Respiratory and Critical Care Medicine
letter
. 2021 Apr 15;203(8):1033–1037. doi: 10.1164/rccm.202011-4174LE

Spirometric Classifications of Chronic Obstructive Pulmonary Disease Severity as Predictive Markers for Clinical Outcomes: The HUNT Study

Laxmi Bhatta 1,*, Linda Leivseth 2, Xiao-Mei Mai 1, Anne Hildur Henriksen 1,3, David Carslake 4, Yue Chen 5, Pablo Martinez-Camblor 6, Arnulf Langhammer 7,8,, Ben Michael Brumpton 1,3,4,
PMCID: PMC8048755  PMID: 33332249

The classification of chronic obstructive pulmonary disease (COPD) severity is important in guiding therapy and prognosis (1). The Global Initiative for Chronic Obstructive Lung Disease (GOLD) has recommended GOLD grades (1) based on post-bronchodilator percentage-predicted FEV1 (ppFEV1), which is widely used in respiratory medicine. However, ppFEV1 has been criticized because of its susceptibility to physiological variation (24). Studies have recommended alternative expressions of FEV1 that could be used for the classification of COPD severity (2, 3, 59). For the first time, we have compared the predictive abilities of a broad range of FEV1 expressions for cause-specific mortality and hospitalization.

Some of the results of these studies have been previously reported in the form of a preprint (https://doi.org/10.1101/2020.11.03.20221432).

Methods

This study included people aged ≥40 years who participated in the HUNT2 Study (Trøndelag Health Study 2) during 1995–1997 (n = 44,384; 75.2% participation). A 5% random sample (n = 2,300) and people reporting asthma-related symptoms, diagnosis, or use of medication (n = 7,123) were invited to perform spirometry (10). For the analysis, we included 890 people with COPD who met both the fixed ratio (post-bronchodilator FEV1/FVC <0.70) and lower limit of normal criteria and had respiratory symptoms (daily cough in periods, cough with phlegm, wheezing, or dyspnea) and/or self-reported doctor-diagnosed COPD (1, 11).

Post-bronchodilator spirometry was performed 30 minutes after inhalation of 1 mg of terbutaline according to the 1994 American Thoracic Society guidelines (12, 13). Quality assurance of spirometric measurements is described in detail elsewhere (13, 14).

We defined expressions of FEV1 such as ppFEV1, FEV1 z-score, FEV1Ht−2, FEV1 Ht−3, and FEV1Q (described in detail in Reference 15) as suggested by the previous studies (13, 5, 6, 8, 9, 16). The Global Lung Function Initiative 2012 reference equation was used to calculate ppFEV1, ppFVC, FEV1 z-scores, and FVC z-scores (11, 13). FEV1 was standardized by the square of height in meters to calculate FEV1 Ht−2 (6, 9) and by the cube of height in meters to calculate FEV1 Ht−3 (5, 8). FEV1 was standardized by sex-specific lowest percentile (0.5 L for men and 0.4 L for women) of FEV1 distribution among patients to calculate FEV1Q, as suggested by Miller and Pederson in a large European population consisting of three cohorts (5).

Follow-up and outcomes

The study outcomes were all-cause mortality, respiratory mortality, cardiovascular mortality, the first unplanned COPD hospitalization, and the first unplanned pneumonia hospitalization. Participants were followed for 5 years, and right-censoring events were emigration (n = 3) or end of follow-up. Cause-specific mortality and hospitalizations were identified using International Classification of Diseases codes from medical records and are described in detail elsewhere (15).

Statistical analysis

Cumulative incidence curves for all-cause mortality were constructed through Kaplan-Meier estimates, and log-rank tests were used to test differences.

A regression tree method (17) that accounts for time and multiple outcomes was applied to obtain optimal cutoffs of FEV1Q (2.8, 4.1, and 5.2), termed FEV1Q grades.

We applied incident/dynamic time-dependent areas under the receiver operating characteristic curves (AUCs) that account for time to compare the predictive abilities of FEV1 expressions and their respective methods of classification of COPD severity to predict clinical outcomes (1821). For cause-specific mortality and hospitalization, AUCs accounting for competing risks were calculated (20). We used crude models to compare AUCs because the clinical decision does not explicitly take into account other factors (5). We used 10,000 bootstrap iterations to calculate the 95% confidence interval for AUCs (22). A general bootstrap algorithm (23) was applied to compare the AUCs.

Statistical analysis was performed using R 3.6.1 software (http://www.r-project.org).

Ethics

Ethical approval was obtained from the Regional Committees for Medical and Health Research Ethics (2015/1461/REK midt). All participants gave informed written consent.

Results

During the follow-up period, 146, 30, and 56 subjects died because of all causes, respiratory diseases, and cardiovascular diseases, respectively, and 172 and 96 were hospitalized because of COPD and pneumonia, respectively. At baseline, the average age of participants was 63.8 years, 6 of 10 participants were men, and more than half (53.3%) of participants were current smokers (15). A trend for increasing cumulative incidence of all-cause mortality with worsening categories of classifications of COPD severity was observed (Figure 1).

Figure 1.

Figure 1.

Cumulative incidence curves of classifications of chronic obstructive pulmonary disease (COPD) severity for all-cause mortality among participants with COPD aged ≥40 years in the HUNT2 Study (Trøndelag Health Study 2) (1995–1997) followed for 5 years. FEV1 Ht−2 = FEV1 standardized by square of height in meters; FEV1 Ht−3 = FEV1 standardized by cube of height in meters; FEV1Q = FEV1 standardized by sex-specific lowest percentile (0.5 L for men and 0.4 L for women) of FEV1 distribution; FEV1 z-score = FEV1z-score based on the Global Lung Function Initiative 2012 equation; GOLD = Global Initiative for Chronic Obstructive Lung Disease; ppFEV1 = percentage-predicted FEV1 based on the Global Lung Function Initiative 2012 equation.

When using FEV1 expressions as continuous measures, the AUCs for all-cause mortality were 64.5 for ppFEV1, 58.8 for FEV1 z-score, 68.9 for FEV1 Ht−2, 69.3 for FEV1 Ht−3, and 70.2 for FEV1Q (P value for AUCs between ppFEV1 and FEV1Q <0.001). Similar patterns of AUCs were observed for cause-specific mortality and hospitalization, except for respiratory mortality (P = 0.062) (Figure 2).

Figure 2.

Figure 2.

The areas under the receiver operating characteristic curves (AUCs) for different expressions of FEV1 and their respective methods of classification of chronic obstructive pulmonary disease (COPD) severity for all-cause mortality (n = 146), respiratory mortality (n = 30), cardiovascular mortality (n = 56), COPD hospitalization (n = 172), and pneumonia hospitalization (n = 96) among participants with COPD aged ≥40 years in the HUNT2 Study (Trøndelag Health Study 2) (1995–1997) followed for 5 years. #Continuous variables. *Grades/quartiles 3–4 were combined because of zero cases in grade/quartile 4 in Global Initiative for Chronic Obstructive Lung Disease (GOLD) grades, FEV1 z-score grades, and FEV1 standardized by square of height in meters (FEV1 Ht−2) grades. **Grades/quartiles 2–4 were analyzed because of zero cases in grade/quartile 1 of GOLD grades, FEV1 Ht−2 quartiles, FEV1 Ht−3 quartiles, FEV1 standardized by sex-specific lowest percentile (0.5 L for men and 0.4 L for women) of FEV1 distribution (FEV1Q) quartiles, and FEV1Q grades. Similar differences in AUCs were observed when grade/quartiles 1–2 were combined for respiratory mortality. P value represents the differences between ppFEV1 vs. FEV1Q, ppFEV1 quartiles versus FEV1Q quartiles, and GOLD grades versus FEV1Q grades. CI = confidence interval; FEV1 Ht−3 = FEV1 standardized by cube of height in meters; FEV1 z-score = FEV1 z-score based on the Global Lung Function Initiative 2012 equation; ppFEV1 = percentage-predicted FEV1 based on the Global Lung Function Initiative 2012 equation.

The FEV1Q grades had higher AUCs compared with GOLD grades for predicting all-cause mortality (P < 0.001), cardiovascular mortality (P = 0.005), COPD hospitalization (P < 0.001), and pneumonia hospitalization (P < 0.001) but not for respiratory mortality (P = 0.464) (Figure 2). Similar patterns of AUCs were observed when using FEV1 expressions as ppFEV1 quartiles and FEV1Q quartiles, except for respiratory mortality (P = 0.848) and cardiovascular mortality (P = 0.381) (Figure 2).

Discussion

In this population-based study, we found that among all FEV1 expressions, FEV1Q was the best predictor of clinical outcomes studied, followed by FEV1 Ht−2 or FEV1 Ht−3, across 5 years of follow-up. For respiratory mortality, the smaller sample size gives imprecise estimates, resulting in a marginally similar predictive ability for FEV1Q and ppFEV1.

FEV1 is a continuous variable, the expression of FEV1 is used for indicating lung function impairments in respiratory medicine, and ppFEV1 is most commonly used for this purpose (1). The GOLD grades based on ppFEV1 have been widely used for clinical purposes in classifying COPD severity (1). However, they have been criticized because of their susceptibility to physiological variation and poor prediction ability (24, 6). The FEV1 z-score avoids this bias due to physiological variation (2, 3). In addition, ppFEV1 and FEV1 z-scores are based on reference values and depend on the choice of reference equation; therefore, performance might vary with reference values (11, 13, 24, 25). Miller and colleagues (57) found that FEV1 expressions such as FEV1 Ht−2, FEV1 Ht−3, and FEV1Q, which are independent of reference equations, were better correlated with mortality than those that are dependent on reference equations. In addition, Miller and Pedersen (5) found that FEV1Q predicted mortality better than other FEV1 expressions. Extending this knowledge, our study supports FEV1Q as a stronger predictor than other FEV1 expressions in predicting multiple clinical outcomes. This indicates that the severity in people with COPD appears to be better related to how far the FEV1 of that person is from the “bottom line” rather than how far it is from a “predicted value.”

The predictive ability of a classification of COPD severity based on a FEV1 expression largely depends on the choice of cutoffs. For example, the GOLD grades and ppFEV1 quartiles had different predictive abilities in our study. Huang and colleagues (4) observed similar results. Therefore, the optimal cutoffs of FEV1 expressions for classification of COPD severity were investigated in this study, and we found that cutoffs for FEV1Q (2.8, 4.1, and 5.2; FEV1Q grades) were generally best in predicting clinical outcomes. The optimal cutoffs should be further investigated in a large multiethnic population with a wide age range. In a clinical setting, information such as age, sex, and height of patients with COPD is easily available. Therefore, using FEV1Q (or other expressions of FEV1 that are independent of reference equations) for risk classification of patients with COPD might be easy to apply and avoid variation due to dependence on reference equations (5). Furthermore, multidimensional prognostic indices that combine reference independent FEV1 expressions with symptoms, exacerbations, risk factors, and/or biomarkers should be investigated further.

This study also had certain limitations. Our methods may not capture nonlinear associations between FEV1 expressions and mortality (26) or hospitalization, and further studies investigating these approaches are needed.

In summary, these findings highlight improved prediction of outcomes by the use of FEV1Q.

Supplementary Material

Supplements
Author disclosures

Acknowledgments

Acknowledgment

The Trøndelag Health Study (HUNT) is a collaboration between HUNT Research Centre (Faculty of Medicine and Health Science, Norwegian University of Science and Technology), Trøndelag County Council, and the Norwegian Institute of Public Health. The Lung Study in HUNT2 was partly funded through a nondemanding grant from AstraZeneca Norway. Data on date of death and hospitalizations were obtained from the Norwegian Cause of Death Registry and The Nord-Trøndelag Hospital Trust, respectively.

Footnotes

Supported by ExtraStiftelsen Helse og Rehabilitering and Landsforeningen for hjerte-og-lungesyke (the Norwegian Extra Foundation for Health and Rehabilitation and the Norwegian Heart and Lung Patient Organization) (project number 2016/FO79031) and the liaison committee of the Central Norway Regional Health Authority Norwegian University of Science and Technology (NTNU). B.M.B. works in a research unit funded by Stiftelsen Kristian Gerhard Jebsen; Faculty of Medicine and Health Sciences, NTNU; The Liaison Committee for Education, Research and Innovation in Central Norway; the Joint Research Committee between St. Olavs Hospital and the Faculty of Medicine and Health Sciences, NTNU; and the Medical Research Council Integrative Epidemiology Unit at the University of Bristol, which is supported by the Medical Research Council and the University of Bristol (MC_UU_12013/1). D.C. works in a unit funded by the UK Medical Research Council (MC_UU_00011/1) and the University of Bristol.

Author Contributions: L.B., L.L., A.L., and B.M.B. conceived and designed the study. L.B. analyzed the data. L.B. wrote the first draft of the manuscript. All authors interpreted the results and revised and approved the manuscript for submission. L.B. and B.M.B. are accountable for the accuracy and integrity of all parts of the work. As project leader for the Lung Study in HUNT2, A.L. was responsible for planning, data collection, and quality assurance of data in the lung study.

Originally Published in Press as DOI: 10.1164/rccm.202011-4174LE on December 17, 2020

Author disclosures are available with the text of this letter at www.atsjournals.org.

References

  • 1.Global Initiative for Chronic Obstructive Lung Disease. Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease. 2019 [accessed 2020 Jan 1]. Available from: https://goldcopd.org/
  • 2.Fragoso CA, Concato J, McAvay G, Yaggi HK, Van Ness PH, Gill TM. Staging the severity of chronic obstructive pulmonary disease in older persons based on spirometric Z-scores. J Am Geriatr Soc. 2011;59:1847–1854. doi: 10.1111/j.1532-5415.2011.03596.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Quanjer PH, Pretto JJ, Brazzale DJ, Boros PW. Grading the severity of airways obstruction: new wine in new bottles. Eur Respir J. 2014;43:505–512. doi: 10.1183/09031936.00086313. [DOI] [PubMed] [Google Scholar]
  • 4.Huang TH, Hsiue TR, Lin SH, Liao XM, Su PL, Chen CZ. Comparison of different staging methods for COPD in predicting outcomes. Eur Respir J. 2018;51:1700577. doi: 10.1183/13993003.00577-2017. [DOI] [PubMed] [Google Scholar]
  • 5.Miller MR, Pedersen OF. New concepts for expressing forced expiratory volume in 1 s arising from survival analysis. Eur Respir J. 2010;35:873–882. doi: 10.1183/09031936.00025809. [DOI] [PubMed] [Google Scholar]
  • 6.Miller MR, Pedersen OF, Dirksen A. A new staging strategy for chronic obstructive pulmonary disease. Int J Chron Obstruct Pulmon Dis. 2007;2:657–663. [PMC free article] [PubMed] [Google Scholar]
  • 7.Miller MR, Pedersen OF, Lange P, Vestbo J. Improved survival prediction from lung function data in a large population sample. Respir Med. 2009;103:442–448. doi: 10.1016/j.rmed.2008.09.016. [DOI] [PubMed] [Google Scholar]
  • 8.Fletcher C, Peto R, Tinker C, Speizer FE. The natural history of chronic bronchitis and emphysema: an eight-year study of early chronic obstructive lung disease in working men in London. London: Oxford University Press; 1976. [Google Scholar]
  • 9.Sorlie PD, Kannel WB, O’Connor G. Mortality associated with respiratory function and symptoms in advanced age: the Framingham Study. Am Rev Respir Dis. 1989;140:379–384. doi: 10.1164/ajrccm/140.2.379. [DOI] [PubMed] [Google Scholar]
  • 10.Bhatta L, Leivseth L, Mai X-M, Chen Y, Henriksen AH, Langhammer A, et al. Prevalence and trend of COPD from 1995-1997 to 2006-2008: the HUNT study, Norway. Respir Med. 2018;138:50–56. doi: 10.1016/j.rmed.2018.03.020. [DOI] [PubMed] [Google Scholar]
  • 11.Quanjer PH, Stanojevic S, Cole TJ, Baur X, Hall GL, Culver BH, et al. ERS Global Lung Function Initiative. Multi-ethnic reference values for spirometry for the 3-95-yr age range: the global lung function 2012 equations. Eur Respir J. 2012;40:1324–1343. doi: 10.1183/09031936.00080312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Standardization of spirometry. update: American Thoracic Society. Am J Respir Crit Care Med. 1994;1995;152:1107–1136. doi: 10.1164/ajrccm.152.3.7663792. [DOI] [PubMed] [Google Scholar]
  • 13.Langhammer A, Johannessen A, Holmen TL, Melbye H, Stanojevic S, Lund MB, et al. Global Lung Function Initiative 2012 reference equations for spirometry in the Norwegian population. Eur Respir J. 2016;48:1602–1611. doi: 10.1183/13993003.00443-2016. [DOI] [PubMed] [Google Scholar]
  • 14.Hankinson JL, Eschenbacher B, Townsend M, Stocks J, Quanjer PH. Use of forced vital capacity and forced expiratory volume in 1 second quality criteria for determining a valid test. Eur Respir J. 2015;45:1283–1292. doi: 10.1183/09031936.00116814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bhatta L, Leivseth L, Mai X-m, Henriksen AH, Carslake D, Chen Y, et al. Spirometric classifications of COPD severity as predictive markers for clinical outcomes: the HUNT Study [preprint] medRxiv 2020. Available from: https://www.medrxiv.org/content/10.1101/2020.11.03.20221432v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Pellegrino R, Viegi G, Brusasco V, Crapo RO, Burgos F, Casaburi R, et al. Interpretative strategies for lung function tests. Eur Respir J. 2005;26:948–968. doi: 10.1183/09031936.05.00035205. [DOI] [PubMed] [Google Scholar]
  • 17.De’ath G. Multivariate regression trees: a new technique for modeling species-environment relationships. Ecology. 2002;83:1105–1117. [Google Scholar]
  • 18.Heagerty PJ, Zheng Y. Survival model predictive accuracy and ROC curves. Biometrics. 2005;61:92–105. doi: 10.1111/j.0006-341X.2005.030814.x. [DOI] [PubMed] [Google Scholar]
  • 19.Kamarudin AN, Cox T, Kolamunnage-Dona R. Time-dependent ROC curve analysis in medical research: current methods and applications. BMC Med Res Methodol. 2017;17:53. doi: 10.1186/s12874-017-0332-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Saha P, Heagerty PJ. Time-dependent predictive accuracy in the presence of competing risks. Biometrics. 2010;66:999–1011. doi: 10.1111/j.1541-0420.2009.01375.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Martínez-Camblor P, Pardo-Fernández JC. Smooth time-dependent receiver operating characteristic curve estimators. Stat Methods Med Res. 2018;27:651–674. doi: 10.1177/0962280217740786. [DOI] [PubMed] [Google Scholar]
  • 22.Bansal A, Heagerty PJ. A tutorial on evaluating the time-varying discrimination accuracy of survival models used in dynamic decision making. Med Decis Making. 2018;38:904–916. doi: 10.1177/0272989X18801312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Martínez-Camblor P, Corral N. A general bootstrap algorithm for hypothesis testing. J Stat Plan Inference. 2012;142:589–600. [Google Scholar]
  • 24.Langhammer A, Johnsen R, Gulsvik A, Holmen TL, Bjermer L. Forced spirometry reference values for Norwegian adults: the Bronchial Obstruction in Nord-Trøndelag Study. Eur Respir J. 2001;18:770–779. doi: 10.1183/09031936.01.00255301. [DOI] [PubMed] [Google Scholar]
  • 25.Moreira S, Fernandes M, Silva M, Escaleira D, Staats R, Valença J, et al. Comparison of the FEV1 value from five reference equations: ESCS 71|83|93, NHANES and GLI. Eur Respir J. 2017;50:PA2507. [Google Scholar]
  • 26.Tejero E, Prats E, Casitas R, Galera R, Pardo P, Gavilán A, et al. Classification of airflow limitation based on z-score underestimates mortality in patients with chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2017;196:298–305. doi: 10.1164/rccm.201611-2265OC. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplements
Author disclosures

Articles from American Journal of Respiratory and Critical Care Medicine are provided here courtesy of American Thoracic Society

RESOURCES