Abstract
Background
A user-friendly method for assessing lung cancer risk may help standardize selection of current and former smokers for screening. We evaluated a simple 4-factor model, the Pittsburgh Predictor, against two well-known, but more complicated models for predicting lung cancer risk.
Methods
Trained against outcomes observed in the National Lung Screening Trial (NLST), the Pittsburgh Predictor used four risk factors, duration of smoking, smoking status, smoking intensity, and age, to predict 6-year lung cancer incidence. After calibrating the Bach and PLCOM2012 models to outcomes observed in the low-dose computed tomography arm of the NLST, we compared model calibration, discrimination, and clinical usefulness (net benefit) in the NLST and Pittsburgh Lung Screening Study (PLuSS) populations.
Results
The Pittsburgh Predictor, Bach, and PLCOM2012 represented risk equally well, except for the tendency of PLCOM2012 to overestimate risk in subjects at highest risk. Relative to the Pittsburgh Predictor, Bach and PLCOM2012 increased the area under the receiver operator characteristic curve by 0.007 to 0.009 and 0.012 to 0.021 units, respectively, depending on study population. Across a clinically relevant span of 6-year lung cancer risk thresholds (0.01 to 0.05), Bach and PLCOM2012 increased net benefit by less than 0.1% in NLST and 0.3% in PLuSS.
Conclusion
In exchange for a small reduction in prediction accuracy, a simpler lung cancer risk prediction model may facilitate standardized procedures for advising and selecting patients with respect to lung cancer screening.
Keywords: lung cancer screening, LDCT, Pittsburgh predictor, PLuSS
Lung cancer remains the leading cause of cancer mortality in the U. S. [1]. Enrolling 55 to 74 year-old ≥30 pack-year current or former smokers (quit <15 years), the National Lung Screening Trial (NLST) recently reported 20% lower lung cancer mortality after three annual screenings with low-dose computed tomography (LDCT) vs. chest radiography [2]. Guided by the success of the NLST and a comparative modeling study which validated the selection criteria used by NLST [3], the United States Preventive Services Task Force (USPSTF) endorsed LDCT in 55 to 80 year-old ≥30 pack-year smokers, with annual screening to stop after a 15-year smoke-free interval [4].
As a matter of policy, the USPSTF guideline may be used to define eligibility for LDCT. However, several lung cancer risk prediction models have been published [4–11]. Unlike the straightforward USPSTF guideline, these models place smokers on a continuum of risk, determined by a panel of risk factors. Some models define complex non-linear relationships between the smoking-related variables and lung cancer risk [5,11]. According to some analysts, the use of a model to select smokers for screening may improve the efficiency of LDCT, including the number of persons needed to screen to detect a fixed number with lung cancer [11].
Against this backdrop, we conceived a simple 4-factor lung cancer risk prediction model, less complicated than currently available models, perhaps more amenable to widespread application, yet devised to place individuals on a lung-cancer risk continuum. It is important to point out that we derived and validated our model in populations preselected to be high risk for lung cancer, that are consistent with the USPSTF recommendations for screening, but not representative of all smokers. We systematically evaluated our model, the Pittsburgh Predictor, against two other models, one validated by Bach et al. [5] and a second by investigators from the Prostate, Lung Colorectal, and Ovarian (PLCO) Cancer Screening Trial [11].
METHODS
Data sources
We used data from the National Lung Screening Trial (NLST) [2,12,13] (Cancer Data Access System, September 9, 20121) and the Pittsburgh Lung Screening Study (PLuSS) [14]. Between August 2002 and September 2007, NLST enrolled 53,454 55 to 74 year-old current and former (quit <15 years) cigarette smokers (≥30 pack years) and randomized 26,722 and 26,732 to three annual screenings with low-dose computed tomography (LDCT) or chest radiography (CXR), respectively. Between January 2002 and April 2005, the Pittsburgh Lung Screening Study (PLuSS) recruited 3654 50–79 year-old current and former (quit <10 years) cigarette smokers (>10 cigarettes per day for ≥25 years) and completed baseline LDCT in 3642 and repeat LDCT one year later in 3423, with selected subjects receiving additional protocol directed LDCT in later years.
Questionnaire data
With the following exceptions, NLST and PLuSS collected comparable risk factor information from self-completed questionnaires administered at the beginning of follow-up. PLuSS and the non-ACRIN portion of NLST determined duration of smoking as the difference between current age (active smoker) or age quit (former smoker) and age smoking started, whereas the ACRIN portion of NLST recorded the number of years total a subject had smoked. NLST recorded smoking intensity as integer cigarettes per day, whereas PLuSS recorded smoking intensity in five discrete categories. Questionnaires used in NLST, but not PLuSS, included two items about asbestos. All data analyses represented age as age at study enrollment, history of chronic obstructive pulmonary disease (COPD) as history of a doctor diagnosis of emphysema, family history of lung cancer and personal cancer history.2
Predictive models
We considered two published lung cancer prediction models [5,11], the Bach and PLCOM2012 (Table 1). Bach [5] modeled data for 14,254 smokers and 3918 asbestos-exposed persons who participated in the Carotene and Retinol Efficacy Trial (CARET). Between June 1985 and September 1994, CARET enrolled 50–69 year-old current or former cigarette smokers. In addition to cigarette smoking (current or quit <15 years, ≥ 30 pack years), eligibility by virtue of asbestos exposure required radiologic evidence or relevant job history. With factors for sex, age, asbestos, and cigarette smoking (years smoking, years quit, and cigarettes per day), Bach predicts cumulative lung cancer risk from two proportional hazards regression equations, one equation for lung cancer risk and a second equation for mortality from causes other than lung cancer. Based on 1070 lung cancers in CARET over a mean 9.3 year follow-up, the Bach equations predict risks in ≥55 year-old persons followed to age 85 years.
Table 1.
Risk factors included in the Pittsburgh Predictor (PP), Bach [5] and PLCOM2012 [11] lung cancer risk prediction models
| Factor | PP | Bach | PLCOM2012 |
|---|---|---|---|
| Age | X | X | X |
| Duration of smoking | X | X | X |
| Smoking intensity | X | X | X |
| Smoking status | X | X | |
| Smoking quit time | X | X | |
| Sex | X | ||
| Asbestos exposure | X | ||
| Race or ethnic group | X | ||
| Education | X | ||
| Personal history of cancer | X | ||
| Family history of lung cancer | X | ||
| Chronic obstructive pulmonary disease | X | ||
| Body-mass index | X |
PLCOM2012 [11] modeled data for the 39,928 current or former smokers (709 lung cancers over six years of follow-up) in the control arm of the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. Between November 1993 and July 2001, PLCO enrolled 55–74 year-old persons without a history of prostate, lung, colorectal, or ovarian cancer. With factors for age, cigarette smoking (years smoking, smoking status, years quit, and cigarettes per day), race or ethnic group, education, personal history of cancer, family history of lung cancer, history of chronic obstructive pulmonary disease, and BMI, PLCOM2012 uses a logistic equation to predict 6-year lung cancer risk.
Our simple Pittsburgh Predictor model included four risk factors, duration of smoking (four categories, <30, 30–39, 40–49, and ≥50 years), smoking status (two categories, currently smoking or quit <1 year and quit ≥1 year), smoking intensity (four categories, <20, 20–29, 30–39, and ≥40 cigarettes per day) and age. To moderate its correlation with duration of smoking, younger and older age groups were determined in relation to the smoking duration-specific median ages observed in NLST (Table 1). Model training excluded 793 (3.0%) and 1082 (4.0%) participants in NLST LDCT and CXR, respectively. Reasons for exclusion included absent baseline LDCT or CXR and post randomization evidence indicating a possible violation of NLST entry criteria [12]. Retaining 25,929 LDCT and 25,648 CXR NLST-eligible and screened participants, we used our four risk factors and lung cancer occurrence over six years to fit two logistic regression models. One model used data from NLST LDCT (1000 lung cancers; area under the receiver operator characteristic curve – AuROC, 0.679) and the second model used data from NLST CXR (854 lung cancers; AuROC, 0.687). Obtaining roughly equal parameter estimates with LDCT and CXR data, we averaged the risk factor-specific parameter estimates from LDCT and CXR models, multiplied by 10, and rounded to the nearest integer (Table 2). Relative to a reference category, the resulting integer point value associated with each risk factor category represents 10 times the increase in the logit of the expected 6-year lung cancer risk. A summation of the integer point values associated with each of our four risk factors produces a total risk score with possible values ranging between -17 points (<57 year-old former smoker of <20 cigarettes/day for <30 years) and 23 points (≥68 year-old current smoker of ≥40 cigarettes/day for ≥50 years). A total score of 0 points captures the risk associated with the reference categories, <59 year-old current smoker of 20–29 cigarettes/day for 30–39 years (Figure 1).
Table 2.
Pittsburgh Predictor, a 4-factor lung cancer risk prediction model
| Risk factor | Points | |
|---|---|---|
| 1. Duration of smoking, years | <30 | −10 |
| 30–39 | 0 | |
| 40–49 | 8 | |
| ≥50 | 14 | |
| 2. Age, years [1] | younger | 0 |
| older | 4 | |
| 3. Smoking status [2] | current | 0 |
| former | −3 | |
| 4. Smoking intensity, cigarettes/day | <20 | −4 |
| 20–29 | 0 | |
| 30–39 | 2 | |
| ≥40 | 5 |
Older age defined as age ≥57, ≥59, ≥61, and ≥68 years for persons smoking <30, 30–39, 40–49, and ≥50 years, respectively.
The quit smoking category includes former smokers with current attained age at least one year greater than age quit.
Figure 1.
Screen shot from Pittsburgh Predictor
As a measure of clinical usefulness, we calculated net benefit (for details see the Supplement). Using net benefit, our analysis aimed, in part, to develop a quantitative understanding of the relative clinical usefulness of our simple model, the Pittsburgh Predictor.
Data analysis
NLST LDCT and NLST CXR data were used to evaluate model calibration (Bach and PLCOM2012) and discrimination (Pittsburgh Predictor, Bach, and PLCOM2012). Approximately 0.2% and 4.7% of Bach and PLCOM2012 predictions in NLST replaced missing or extreme risk factor data with imputed values (data not shown). After recalibrating Bach and PLCOM2012 against NLST LDCT, we evaluated model calibration and discrimination in PLuSS, restricted to the 3642 subjects with baseline LDCT. Lacking data about asbestos, the application of Bach to PLuSS assumed an 8.1% and 0.8% frequency of asbestos exposure in men and women, respectively, the frequency of asbestos work or asbestosis reported by men and women in NLST. Limited to smoking intensity data in discrete categories, the application of Bach and PLCOM2012 to PLuSS assigned values of 10.0, 15.8, 20.6, 30.4, and 45.3 cigarettes per day to smokers of <10, 10–19, 20–29, 30–39, and ≥40 cigarettes per day, the average cigarettes per day reported by NLST participants in corresponding categories.
To evaluate calibration, we estimated the intercept term in logistic regressions with the logit of model predictions entered as an offset, examined discretized and local regression (LOESS) plots of observed vs. expected lung cancer risk, and calculated the Hosmer–Lemeshow χ2 test statistic [15]. Comparisons of the discrimination achieved by different models used integrated plots of predictiveness and classification performance and estimates of the AuROC, net reclassification improvement (NRI), continuous NRI (cNRI), and integrated discrimination improvement (IDI) [16]. Finally, measures of net benefit were used to compare clinical usefulness [17]. Data analyses used SAS for Windows (version 9.3).
RESULTS
Description of the study cohorts
PLuSS had 3642 subjects, 51% men, median age 58 years (interquartile range, IQR, 54–64 years), and 60% current smokers with median 39 years duration (IQR 34–45 years) and 9% ≥2 packs per day. NLST had 51,577 subjects (25,929 LDCT and 25,648 CXR), 59% men, median age 60 years (IQR 57–65 years), and 49% current smokers with median duration 40 years (IQR 35–45 years) and 23% ≥2 packs per day. The mean numbers of protocol-based lung cancer screenings in PLuSS and NLST were 2.5 and 2.8 per subject, respectively. Six-year lung cancer risks in PLuSS, NLST LDCT, and NLST CXR were 3.9% (95% confidence interval, CI, 3.3–4.6%, 143 cases), 3.9% (95% CI 3.6–4.1%, 1000 cases), and 3.3% (95% CI 3.1–3.6%, 854 cases), respectively. Overall, 0.7%, 2.2%, and 1.6% of lung cancer-free survivors in PLuSS, NLST LDCT, and NLST CXR, respectively, were lost to follow-up before the fifth anniversary of study enrollment.
Model calibration
The published Bach and PLCOM2012 models predicted average lung cancer risk in NLST CXR after slight (<10%) recalibration (ORrecal 0.99 and 1.06, respectively; Figure 2) and in NLST LDCT after moderate (≥10%) recalibration (ORrecal 1.16 and 1.24, respectively; Figure 2). Apart from over estimation of risk by PLCOM2012 for NLST LDCT subjects in the highest risk category, the Pittsburgh Predictor, Bach, and PLCOM2012 represented risk well across levels of risk (Figure 2). Lung cancer occurred more frequently in PLuSS than predicted by the Pittsburgh Predictor (ORrecal 1.24, 95% CI 1.04–1.46), Bach (ORrecal 1.21, 95% CI 1.02–1.43), or PLCOM2012 (ORrecal 1.16, 95% CI 0.97–1.37), with expected lung cancer occurrence determined by models calibrated to NLST LDCT. After recalibration to PLuSS, the Pittsburgh Predictor, Bach, and PLCOM2012 represented risk well across levels of risk (Figure 3), except again for the tendency of PLCOM2012 to overestimate risk for subjects at higher risk.
Figure 2.
Pittsburgh Predictor, Bach, and PLCOM2012 calibration plots in NLST LDCT and NLST CXR. The Pittsburgh Predictor calculates 6-year lung cancer risks as , where α = −4.0641 after LDCT or −4.2195 after CXR and S = point total calculated according to the method shown in Table 2. The Bach and PLCOM2012 predictions use equations in Bach et al. [5] and Tammemagi et al. [11], inflated in accordance with ORrecal, the ratio between the lung cancer odds with and without recalibration. ORrecal estimates and 95% confidence intervals were derived from the intercept terms of logistic regressions with the logit of Bach or PLCOM2012 predictions entered as offsets. Abbreviations: H-L p-value, statistical significance of the Hosmer–Lemeshow χ2 goodness-of-fit statistic; AuROC, lung cancer discrimination represented as the area (95% confidence interval) under the receiver operator characteristic curve.
Figure 3.
Pittsburgh Predictor, Bach, and PLCOM2012 calibration plots in PLuSS. Predicted risks derive from NLST LDCT-calibrated equations, as defined for Figure 2, inflated further by amounts shown in each panel as ORrecal. See Figure 2 legend for definitions.
Scatter plots showed a linear association in PLuSS between the risk predictions offered by the Pittsburgh Predictor and Bach, but curvilinear associations for comparisons that involved PLCOM2012, with the PLCOM2012 producing greater estimates of risk for persons with 6-year risks >6% according to either the Pittsburgh Predictor or Bach (Figure 4).
Figure 4.
Six-year lung cancer risks for individuals (‘•’ symbols in gray) participating in the Pittsburgh Lung Screening Study (PLuSS; N=3642), according to the Pittsburgh Predictor, Bach, and PLCOM2012 prediction models calibrated to NLST LDCT. The gray line marks the points of equivalence between the predictions offered by two models, one shown on the x-axis and the second on the y-axis. The red curves show results from local regressions of the y-axis prediction on the x-axis prediction.
Model discrimination
Depending on the cohort, models discriminated with AuROC values between 0.678 and 0.721 (Figures 2 and 3). Some improvements in AuROC were statistically significant (e.g., PLCOM2012 relative to the Pittsburgh Predictor in NLST CXR, AuROC difference 0.015, p-value 0.016. Across a clinically relevant span of 6-year lung cancer risk thresholds (0.01 to 0.05), the Bach and PLCOM2012, relative to the Pittsburgh Predictor, increased net benefit by less than 0.1% in NLST and by less than 0.3% in PLuSS (see Supplement for details).
Lung cancer risk calculator
For illustration, we programmed Excel to calculate the Pittsburgh Predictor (Figure 1). The calculator defines four risk levels (≤0, 1–8, 9–14, and 15–23 total points), corresponding to 1.4%, 2.6%, 5.4%, and 9.5% average 6-year lung cancer risks in NLST LDCT.
DISCUSSION
We describe a simple 4 factor model, the Pittsburgh Predictor, for assessing lung cancer risk in the high risk population potentially eligible for LDCT lung cancer screening. LDCT will likely enter routine clinical practice and potentially affect as many as nine million current or former U.S. smokers [18]. Appropriately, the imprecision of the information available regarding the benefits and harms of LDCT provokes discussion about risk thresholds for screening, with higher thresholds expected to maximize benefit in relation to harm. In a clinical context, the optimal threshold might vary from smoker to smoker in accordance with personal valuations of the many potential benefits and risks of screening.
In this context, we formulated and evaluated a simple lung cancer risk prediction model, suitable for use in populations similar to NLST and PLuSS, and most importantly, the population for which the USPSTF has advocated lung cancer screening. This model, the Pittsburgh Predictor, calculates 6-year lung cancer risk based on four factors, duration of smoking, smoking status, smoking intensity, and age (Table 2). With two previously published models (Bach [5] and PLCOM2012 [11]) as benchmarks, the Pittsburgh Predictor calibrated well (Figures 2–4), with qualifications noted in the following paragraph. As expected, the Pittsburgh Predictor discriminated less well overall, as expressed by reductions in the AuROC of 0.007 to 0.021 units (Figures 2–3), depending on study population and choice of model for comparison. The simplicity of the PP (4 factors vs. 6 factors for Bach vs. 12 factors for PLCOM2012) explains the lower AUC. Yet, across a clinically relevant span of decision thresholds, a span that includes the average risks observed in NLST and PLuSS, this weakening in prediction accuracy corresponded to reductions of net benefit no greater than 0.1% in NLST and 0.3% in PLuSS (see Supplement for details).
The results described here reflect model performance in a setting of active lung cancer screening. As a consequence, the accuracy of the absolute predictions delivered by the Pittsburgh Predictor and the recalibrated Bach and PLCOM2012 models depends on comparability between NLST and a target population with respect to the screening, diagnostic, and follow-up experiences of the two populations. In this context, 6-year lung cancer risks in PLuSS were ~20% greater than predicted by NLST LDCT-calibrated models, as demonstrated by values for ORrecal > 1.00 (Figure 3; Pittsburgh Predictor 1.24, Bach 1.21, PLCOM2012 1.16). Other explanations for the higher than expected lung cancer risk observed in PLuSS include 1) risk determinants (e.g., indoor radon), uniquely prevalent in PLuSS, but not captured by any model and 2) random chance (Type I error). For two reasons, we anticipated that models calibrated to NLST LDCT might underpredict in PLuSS; 1) PLuSS, in its early days, pursued over aggressively a surgical diagnosis for LDCT-detected abnormalities [14] and 2) PLuSS, continued for some participants, as noted above, periodic LDCT after the initial screening rounds, factors expected to contribute to over-diagnosis and lead time still unresolved at the end of follow-up, respectively. Similarly, differences in clinical practice might explain some of the variability in risk-adjusted lung cancer incidence observed center to center in NLST. As a final note, the amplified risks calculated by PLCOM2012 for PLuSS participants in the highest decile of risk (Figure 4) explain completely (data not shown) the superior calibration achieved overall by PLCOM2012 relative to Bach (ORrecal 1.16 vs. 1.21). This apparent superiority of PLCOM2012 could reflect more accurate estimation of risk among person placed at very high risk by virtue of risk factors unique to PLCOM2012 (e.g., family history of lung cancer or personal history of chronic obstructive pulmonary disease.
Using a predictor of lung cancer death, Kovalchik et al. [19] estimated that 99% of the lung cancer death prevented by LDCT could have been achieved by restricting LDCT screening to the 80% of NLST at highest risk. For comparison, we determined that 93% of 6-year lung cancer incidence in NLST LDCT occurred among persons who belonged to the top 80% of risk, as determined by the Pittsburgh Predictor (i.e., >0 total score, >1.7% predicted risk). These examples illustrate the manner in which stratification according to risk might be used to limit LDCT to those persons most likely to benefit.
To demonstrate its simplicity, we programmed the Pittsburgh Predictor as a Microsoft Excel® application, available for download. As shown in figure 1, this application uses a 5-item questionnaire to ascertain the four factors that determine the Pittsburgh Predictor total score (Figure 1, table 2). The simplicity of the Pittsburgh Predictor would not preclude use of other less common factors predictive of lung cancer for decision making purposes (e.g., family history of lung cancer, quit time, occupational or environmental exposures, and personal history of emphysema), particularly for persons with calculated risks close to a decision threshold, but our model does not take these factors into account.
There are several limitations to our model. First, it is derived in preselected high risk populations (PLuSS and NLST) and not necessarily applicable to the general population of smokers. It is applicable to the population eligible for LDCT based on USPSTF recommendations. Second, the USPSTF criteria for screening include some low risk individuals and exclude some high risk individuals. Our model may help identify a relatively low risk subpopulation within a preselected high risk population. Third, the PP has a ceiling effect of about 12% risk, although for decision making purposes our focus is on lower risk individuals. Fourth, our model was derived and tested in the United States and applicability to other populations will need to be tested.
In conclusion, we present a detailed and transparent description of the development and validation of the Pittsburgh Predictor. While results from NLST established the benefits of LDCT, the utility in clinical practice of risk-based strategies for advising at-risk persons about LDCT, while intuitive, has not been proven. Nonetheless, simplicity, a motivating principle behind the Pittsburgh Predictor, may facilitate this process.
Supplementary Material
Highlights.
The Pittsburgh predictor is a simple 4 factor prediction model for lung cancer occurrence.
For 1 – 5 % 6 year lung cancer risk prediction, the Pittsburgh predictor performs well.
The Pittsburgh predictor may facilitate standardized procedures for advising and selecting patients with respect to lung cancer screening.
Acknowledgments
Supported by the University of Pittsburgh Cancer Institute’s Specialized Program of Research Excellence (SPORE) in Lung Cancer (NCI P50-CA90440) and the Cancer Center Core Grant (NCI 2P30 CA047904)
The authors thank Dr. Brenda Diergaarde for her critical review of this manuscript.
Abbreviations
- ACRIN
American College of Radiology Imaging Network
- AuROC
area under the receiver operator characteristic curve
- CDAS
Cancer Data Access System
- CXR
chest radiography
- LDCT
low-dose computed tomography
- NLST
National Lung Screening Trial
- NRI
net reclassification improvement
- PLCO
Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial
- PLuSS
Pittsburgh Lung Screening Study
Footnotes
Drs. Wilson and Weissfeld have no conflicts or financial disclosures.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Siegel R, Naishadham D, Jemal A. Cancer statistics, 2013. CA Cancer J Clin. 2013;63:11–30. doi: 10.3322/caac.21166. [DOI] [PubMed] [Google Scholar]
- 2.National Lung Screening Trial Research Team. Aberle DR, Adams AM, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med. 2011;365:395–409. doi: 10.1056/NEJMoa1102873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.de Koning HJ, Meza R, Plevritis SK, et al. Benefits and harms of computed tomography lung cancer screening strategies: A comparative modeling study for the U.S. Preventive Services Task Force. Ann Intern Med. 2014;160:311–20. doi: 10.7326/M13-2316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Humphrey LL, Deffebach M, Pappas M, et al. Screening for lung cancer with low-dose computed tomography: a systematic review to update the US Preventive services task force recommendation. Ann Intern Med. 2013;159:411–20. doi: 10.7326/0003-4819-159-6-201309170-00690. [DOI] [PubMed] [Google Scholar]
- 5.Bach PB, Kattan MW, Thornquist MD, et al. Variations in lung cancer risk among smokers. J Natl Cancer Inst. 2003;95:470–8. doi: 10.1093/jnci/95.6.470. [DOI] [PubMed] [Google Scholar]
- 6.Cassidy A, Myles JP, van Tongeren M, et al. The LLP risk model: an individual risk prediction model for lung cancer. Br J Cancer. 2008;98:270–6. doi: 10.1038/sj.bjc.6604158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Maisonneuve P, Bagnardi V, Bellomi M, et al. Lung cancer risk prediction to select smokers for screening CT--a model based on the Italian COSMOS trial. Cancer Prev Res (Phila) 2011;4:1778–89. doi: 10.1158/1940-6207.CAPR-11-0026. [DOI] [PubMed] [Google Scholar]
- 8.Spitz MR, Etzel CJ, Dong Q, et al. An expanded risk prediction model for lung cancer. Cancer Prev Res (Phila) 2008;1:250–4. doi: 10.1158/1940-6207.CAPR-08-0060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Spitz MR, Hong WK, Amos CI, et al. A risk model for prediction of lung cancer. J Natl Cancer Inst. 2007;99:715–26. doi: 10.1093/jnci/djk153. [DOI] [PubMed] [Google Scholar]
- 10.Tammemagi CM, Pinsky PF, Caporaso NE, et al. Lung cancer risk prediction: Prostate, Lung, Colorectal And Ovarian Cancer Screening Trial models and validation. J Natl Cancer Inst. 2011;103:1058–68. doi: 10.1093/jnci/djr173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Tammemagi MC, Katki HA, Hocking WG, et al. Selection criteria for lung-cancer screening. N Engl J Med. 2013;368:728–36. doi: 10.1056/NEJMoa1211776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.National Lung Screening Trial Research Team. Aberle DR, Berg CD, et al. The National Lung Screening Trial: overview and study design. Radiology. 2011;258:243–53. doi: 10.1148/radiol.10091808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.National Lung Screening Trial Research Team. Church TR, Black WC, et al. Results of initial low-dose computed tomographic screening for lung cancer. N Engl J Med. 2013;368:1980–91. doi: 10.1056/NEJMoa1209120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wilson DO, Weissfeld JL, Fuhrman CR, et al. The Pittsburgh Lung Screening Study (PLuSS): outcomes within 3 years of a first computed tomography scan. Am J Respir Crit Care Med. 2008;178:956–61. doi: 10.1164/rccm.200802-336OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hosmer DW, Hosmer T, Le Cessie S, Lemeshow S. A comparison of goodness-of-fit tests for the logistic regression model. Stat Med. 1997;16:965–80. doi: 10.1002/(sici)1097-0258(19970515)16:9<965::aid-sim509>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]
- 16.Pencina MJ, D’Agostino RB, Sr, D’Agostino RB, Jr, Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat Med. 2008;27:157–72. doi: 10.1002/sim.2929. discussion 207–12. [DOI] [PubMed] [Google Scholar]
- 17.Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26:565–74. doi: 10.1177/0272989X06295361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ma J, Ward EM, Smith R, Jemal A. Annual number of lung cancer deaths potentially avertable by screening in the United States. Cancer. 2013;119:1381–5. doi: 10.1002/cncr.27813. [DOI] [PubMed] [Google Scholar]
- 19.Kovalchik SA, Tammemagi M, Berg CD, et al. Targeting of low-dose CT screeningaccording to the risk of lung-cancer death. N Engl J Med. 2013;369:245–54. doi: 10.1056/NEJMoa1301851. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




