Abstract
Background
The PROspective Multicenter Imaging Study for Evaluation of chest pain (PROMISE) minimal-risk tool was recently developed to identify patients with suspected stable angina at very low risk of coronary artery disease (CAD) and clinical events. We assessed the external validity of this tool within the context of the Scottish Computed Tomography of the HEART (SCOT-HEART) multicenter randomised controlled trial of patients with suspected stable angina due to coronary disease.
Methods
The minimal-risk tool was applied to 1764 patients with complete imaging and follow-up data. External validity was compared with the guideline-endorsed CAD Consortium (CADC) risk score and determined through tests of model discrimination and calibration.
Results
A total of 531 (30.1%, mean age 52.4 years, female 62.0%) patients were classified as minimal-risk. Compared to the remainder of the validation cohort, this group had lower estimated pre-test probability of coronary disease according to the CADC model (30.0% vs 47.0%, p < 0.001). The PROMISE minimal-risk tool improved discrimination compared with the CADC model (c-statistic 0.785 vs 0.730, p < 0.001) and was improved further following re-estimation of covariate coefficients (c-statistic 0.805, p < 0.001). Model calibration was initially poor (χ2 197.6, Hosmer-Lemeshow [HL] p < 0.001), with significant overestimation of probability of minimal risk, but improved significantly following revision of the PROMISE minimal-risk intercept and covariate coefficients (χ2 5.6, HL p = 0.692).
Conclusion and relevance
Despite overestimating the probability of minimal-risk, the PROMISE minimal-risk tool outperforms the CADC model with regards to prognostic discrimination in patients with suspected stable angina, and may assist clinicians in decisions regarding non-invasive testing.
Trial registration
ClinicalTrials.gov identifier: NCT01149590
Abbreviations: PROMISE, PROspective Multicenter Imaging Study for Evaluation of chest pain; SCOT-HEART, Scottish COmputed Tomography of the HEART; CCTA, coronary computed tomography angiography; CAD, coronary artery disease; CADC, Coronary Artery Disease Consortium; HL, Hosmer-Lemeshow; MICE-PMM, multiple imputation using regression switching with predictive mean matching
Keywords: Stable angina, Coronary artery disease, Coronary computed tomography angiography
1. Introduction
Chest pain is responsible for > 1% of all presentations to family physicians, although stable coronary artery disease (CAD) is the underlying cause in only a minority [1], [2]. Increased community awareness of CAD risk and improvements in primary prevention have led to progressively lower disease prevalence within this patient population and the frequency of abnormal results on ischaemia testing is now < 10% [3]. There is a clear need to refine the assessment of suspected stable angina to optimise the efficient use of diagnostic resources and minimise unnecessary investigations. Recently, investigators from the North American PROspective Multicenter Imaging Study for Evaluation of chest pain (PROMISE) trial developed a risk model to identify individuals at very low risk of CAD [4]. To investigate the generalisability of this risk score, we undertook an external validation in a United Kingdom-based study of computed tomography in the diagnosis of CAD: the Scottish COmputed Tomography of the HEART (SCOT-HEART) trial.
2. Methods
The SCOT-HEART study was a prospective multicenter randomised controlled trial investigating the role of coronary computed tomography angiography (CCTA) in patients referred to a specialist clinic with suspected angina due to CAD. The study design [5] and principal findings [6] have previously been reported. Briefly, participants were recruited from 12 cardiology chest pain clinics across Scotland and those randomised to the intervention arm underwent CCTA imaging in addition to routine clinical care. In contrast with PROMISE, there was a higher prevalence of obstructive coronary disease in the SCOT-HEART population (25.4% vs 10.7%) [7]. For the purposes of this analysis, we limited the validation cohort to those individuals randomised to assessment with CCTA who had sufficient data to determine minimal-risk. The mean period of follow-up was 3.3 ± 1.0 years.
Consistent with PROMISE [4], minimal-risk was defined as requiring a coronary calcium score of 0, no CCTA evidence of coronary atherosclerosis, and the absence of any cardiovascular events (including all-cause death, non-fatal myocardial infarction or coronary revascularisation) during follow-up. All variables included in the published model were evaluated in this analysis with the exception of ethnicity because of the disparate ethnic composition of the study populations. The remaining variables include: age, sex, smoking history, diabetes mellitus, dyslipidemia, family history of premature coronary artery disease, hypertension, symptoms related to stress and high-density lipoprotein (HDL) concentration. In cases where HDL-cholesterol concentrations were not available, multiple imputation using regression switching with predictive mean matching (MICE-PMM) was employed [8].
Multivariable binomial logistic regression using the published model coefficients was used to estimate the probability of minimal-risk for each participant and the predicted risk was compared to the observed outcomes for these individuals. Model discrimination was determined from area under the receiver-operator curve (AUC), or c-statistic, and compared to the established CAD Consortium (CADC) risk score, which has recently been demonstrated to outperform the older Diamond-Forrester score [9], [10]. The variables included in the CADC model include age, sex and typicality of presenting symptoms (typical, atypical or non-anginal). Discrimination reflects the ability of the model to correctly distinguish between minimal-risk (no plaque and no events) and other-risk individuals (i.e. place all subjects in the correct rank order of risk). Calibration describes the agreement between predicted and observed likelihood of minimal risk for an individual. Model calibration was assessed visually by plotting predicted versus observed risk in deciles and quantified with the Hosmer-Lemeshow (HL) statistic. Tests of discrimination and calibration were performed sequentially after step-wise updating of the model intercept (‘recalibration-in-the-large’) and slope to allow for differences in baseline risk between the derivation and validation populations. Finally, these tests were repeated following model revision, retaining the initial covariates but with coefficients re-estimated within the SCOT-HEART cohort. Statistical analysis was performed using R version 3.3.0 (R Foundation for Statistical Computing, Vienna, Austria).
3. Results
Within the 2073 participants randomised to the intervention arm, 1778 underwent CCTA scanning. In total, 1764 patients (57.6, SD 9.5 years, female 44%) had complete imaging and outcome data available for analysis of whom 531 (30.1%; 52.4, SD 9.6 years, female 62.0%) fulfilled all criteria for minimal-risk (Supplementary Table 1). High-density lipoprotein cholesterol (HDL-C) concentrations were unavailable in 506 patients (28.7%) and values were imputed.
In comparison with the remainder of the cohort, patients with minimal-risk were less likely to have symptoms of typical angina (24.1% vs 42.6%, p < 0.001) and had lower pre-test probability of obstructive CAD as determined from the CADC risk score (30.0% vs 47.0%, p < 0.001).
Table 1 reports the observed probability of minimal risk, findings on CCTA, revascularisation and observed probability of all-cause death or non-fatal myocardial infarction within the SCOT-HEART population grouped by decile of predicted probability of no risk.
Table 1.
Test results and event rates by probability of minimal risk.
Predicted probability of no risk | N | Observed probability of no risk, SD | Findings on CCTA |
Revascularisation | Death or MI | |||
---|---|---|---|---|---|---|---|---|
Normal | Mild disease | Moderate disease | Obstructive disease | |||||
0.0–0.1 | 2 (0.1) | 0.00 (0.00) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 2 (100.0) | 1 (50.0) | 0 (0.0) |
0.1–0.2 | 168 (9.5) | 0.02 (0.15) | 7 (4.2) | 27 (16.1) | 41 (24.4) | 93 (55.4) | 43 (25.6) | 17 (10.1) |
0.2–0.3 | 313 (17.7) | 0.12 (0.33) | 54 (17.3) | 72 (23.0) | 63 (20.1) | 124 (39.6) | 72 (23.0) | 10 (3.2) |
0.3–0.4 | 322 (18.3) | 0.18 (0.38) | 82 (25.5) | 79 (24.5) | 55 (17.1) | 106 (32.9) | 65 (20.2) | 10 (3.1) |
0.4–0.5 | 278 (15.8) | 0.22 (0.41) | 82 (29.5) | 68 (24.5) | 58 (20.9) | 70 (25.2) | 41 (14.7) | 8 (2.9) |
0.5–0.6 | 250 (14.2) | 0.40 (0.49) | 115 (46.0) | 63 (25.2) | 41 (16.4) | 31 (12.4) | 17 (6.8) | 6 (2.4) |
0.6–0.7 | 221 (12.5) | 0.56 (0.50) | 138 (62.4) | 37 (16.7) | 27 (12.2) | 19 (8.6) | 10 (4.5) | 3 (1.4) |
0.7–0.8 | 137 (7.8) | 0.66 (0.47) | 103 (75.2) | 15 (10.9) | 15 (10.9) | 4 (2.9) | 1 (0.7) | 2 (1.5) |
0.8–0.9 | 60 (3.4) | 0.75 (0.44) | 50 (83.3) | 7 (11.7) | 2 (3.3) | 1 (1.7) | 1 (1.7) | 0 (0.0) |
0.9–1.0 | 13 (0.7) | 1.00 (0.00) | 13 (100.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
Data is presented as number (percentage) of patients unless otherwise stated.
SD, standard deviation; CCTA, coronary computed tomography angiography; MI, myocardial infarction.
Model discrimination (Supplementary Fig. 1) was greater using the PROMISE minimal-risk score compared with the CADC model (c-statistic 0.785, 95% confidence interval [95% CI] 0.762–0.808, vs 0.730, 95% CI 0.706–0.755; p < 0.001). Discrimination was unaffected by re-calibration of model intercept or slope but the c-statistic improved further following revision of the model coefficients (c-statistic 0.805, 95% CI 0.784–0.827; p < 0.001).
Goodness-of-fit was not demonstrated for either established CADC model (χ2 1010.1, HL p < 0.001) or the PROMISE minimal-risk risk tool (χ2 197.6, HL p < 0.001). Goodness-of-fit for the PROMISE minimal-risk risk tool remained suboptimal following recalibration of the model intercept (χ2 32.2, HL p < 0.001) and calibration slope (χ2 23.7, HL p = 0.026) but was improved by the further addition of re-estimated model coefficients (χ2 5.6, HL p = 0.692; Supplementary Table 2) resulting in good calibration (Fig. 1).
Fig. 1.
Model calibration.
Plot demonstrates poor calibration of predicted probability vs observed proportion of minimal risk using initial model coefficients and intercept (blue) in addition to the established Coronary Artery Disease Consortium (CADC) model (green). Calibration remains poor despite updating the model intercept (grey) and slope (purple). Good model calibration (HL, Hosmer-Lemeshow) is demonstrated once the model coefficients are re-estimated within the validation cohort (red). The dashed line represents perfect calibration [22].
4. Discussion
When applied to the SCOT-HEART population, the PROMISE minimal-risk tool improved model discrimination for excluding CAD when compared with the established CADC model. Indeed, the present c-statistic is greater than that reported in the original PROMISE model derivation (0.785 versus 0.725) [4]. The improved discrimination of the PROMISE score compared with the CADC score likely reflects the different intended purpose of these scores. The CADC model was derived with the objective of accurately estimating the pre-test probability of obstructive coronary artery disease whilst the PROMISE model adopts a broader approach of excluding the presence of any coronary atherosclerosis or any clinical events throughout follow-up.
Model calibration – i.e. the ability to estimate accurately an individuals' absolute risk – was poor when applied in the SCOT-HEART trial population. Such a finding is common when assessing model performance in clinically divergent settings [11], [12], [13], [14] and causes may be patient-related (including differences in case-mix, event rates, or predictor definitions), or model-related, such as over (or under-) fitting of coefficients or omission of important predictive variables.
Poor goodness-of-fit does not necessarily signify lack of model value, and the robust discrimination demonstrated in this report suggests appropriate and informative covariate selection. Therefore, we adopted the recommended stepwise approach to model updating [11] that successfully improved model calibration within the SCOT-HEART population. It is plausible, that the PROMISE risk model remains correct in its initial form for application within the North American context, whilst the updated coefficients may provide more accurate risk estimates for European populations. Regardless, to achieve accurate predictions of absolute risk, it is likely that such population-specific recalibration would be desirable for each setting in which the model was to be applied. However, even in the absence of this recalibration, our findings have important implications for determining appropriate diagnostic pathways and highlight that clinicians need to be aware of the imprecision of the estimates determined from the minimal-risk score. In recognition of this, the current guidelines for prevention of atherosclerotic cardiovascular disease already recommend incorporating explanation of the uncertainty of prognostic models in the clinician-patient risk discussion [15].
In order to understand the reasons for risk estimate imprecision we need to consider carefully the differences in patient populations between the PROMISE and SCOT-HEART trials. Such differences have previously been reported [16], but specific areas of relevance to the current findings warrant mention. First, the improvement in discrimination seen with the SCOT-HEART population is likely to reflect the greater breadth of baseline risk which gives rise to a broader spread of the linear predictors within the validation cohort [17]. Relatedly, there is an apparent difference in the proportion of the trial populations fulfilling the criteria for minimal-risk between the studies, with slightly more low-risk patients identified in the SCOT-HEART cohort. This ‘miscalibration-in-the-large’ is a frequent challenge but can be straightforwardly addressed, if the average patient risk is known within the external clinical setting in which the model is being applied, by adjustment of the model intercept. Furthermore, although the exact nature of the effects of this difference in case-mix or ‘spectrum bias’ is difficult to quantify, some insight can be gained from examination of the change in specific variable coefficients. With one exception, all the coefficients increased in magnitude when re-estimated. Those covariates where the increase was most substantial, for example female sex, appear to be more powerful predictors of minimal-risk in the SCOT-HEART population than was identified in PROMISE.
Ultimately, the clinical value of any prognostic score relies on a previously defined and broadly accepted threshold of risk that suitably assists clinicians in identifying those patients who can have further testing safely deferred. In this regard, the accuracy of absolute predicted risk may be of secondary importance to how reliably a model categorizes or reclassifies an individual into appropriate diagnostic pathways. In the absence of updated clinical guidelines incorporating the minimal-risk tool, it is unclear where this threshold should be set. However, it is reassuring to note that the probability of obstructive coronary artery disease remains < 10% in the highest four deciles of predicted probability of minimal risk, whilst the corresponding risk of death or myocardial infarction remains below 2% in these groups. International guidelines have previously adopted low-risk thresholds in this range, below which further investigation is unhelpful [18], [19]. Should such an approach be continued using the PROMISE model, it would identify 1 in 4 patients who could safely avoid further testing, enabling potentially important reductions in diagnostic resource use.
It should be noted that within the United Kingdom, the National Institute for Health and Care Excellence (NICE) has recently updated guidance for the diagnosis of suspected stable angina [20]. This guideline has endorsed a shift to a symptom-focused approach to chest pain assessment, moving away from the broader cardiovascular risk-based strategy previously recommended [19]. Such an approach appears to more appropriately target the use of non-invasive testing and improve clinical outcomes [21]. Whether the cardiovascular risk assessment provided by the PROMISE risk tool can offer advantages to the NICE approach remains unclear and would need to be prospectively determined.
This study has some limitations. Failure to undergo CCTA as randomised, non-diagnostic images and the absence of coronary calcium scans, precluded the determination of minimal-risk in 309 (14.9%) patients. HDL-C concentrations were not available in a fifth of patients although the MICE-PMM technique for imputation achieves minimally biased estimates and satisfactory model performance with up to 50% missingness [8]. Finally, we excluded unstable angina not leading to revascularisation as a minimal risk exclusion criterion since unlike PROMISE, unstable angina events were not adjudicated in SCOT-HEART. However, with the widespread use of high-sensitivity troponin in Scotland, unstable angina accounts for < 5% of all acute coronary syndromes.
5. Conclusion
When assessing patients with suspected stable angina, the PROMISE minimal-risk tool outperforms the CADC model and improves discrimination of the pre-test probability of normal coronary arteries and no clinical events. Suboptimal model calibration may overestimate probability of minimal risk in external populations. Nevertheless, the PROMISE minimal-risk tool may assist clinicians in decisions regarding non-invasive testing.
Author contributions
Drs Adamson and Newby had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Adamson, McAllister, Newby.
Acquisition, analysis, interpretation of data: Adamson, Fordyce, McAllister, Udelson, Douglas, Newby.
Drafting of the manuscript: Adamson, Newby.
Critical revision of the manuscript: All authors.
Acknowledgments
Acknowledgments
Dr. McAllister is the recipient of a Wellcome Trust Intermediate Clinical Fellowship (201492/Z/16/Z).
Dr. Douglas reported receiving institutional research funds and serving on a data safety monitoring board for GE Healthcare and receiving institutional research grants from HeartFlow.
Dr. Udelson reported receiving consultant fees from Lantheus Medical Imaging; serving on the data safety monitoring board for Gilead and GSK; receiving other support from Abbott Laboratories; serving as an editor for Circulation Heart Failure, clinical trial committee work for Pfizer/GSK; and receiving research grants from the National Heart, Lung, and Blood Institute (R01 HL098237, R01 HL098236, R01 HL098305, R01 HL098235) and Otsuka.
Dr. Newby is the recipient of a Wellcome Trust Senior Investigator Award (WT103782AIA) and has previously received honoraria and consultancy from Toshiba Medical Systems.
Role of the funder/sponsor
The funding source had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and the decision to submit the manuscript for publication.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.ijcard.2017.09.033.
Appendix A. Supplementary data
Supplementary material
References
- 1.Bosner S., Becker A., Haasenritter J. Chest pain in primary care: epidemiology and pre-work-up probabilities. Eur. J. Gen. Pract. 2009;15(3):141–146. doi: 10.3109/13814780903329528. [DOI] [PubMed] [Google Scholar]
- 2.Frese T., Mahlmeister J., Heitzer M., Sandholzer H. Chest pain in general practice: frequency, management, and results of encounter. J. Fam. Med. Prim. Care. 2016;5(1):61–66. doi: 10.4103/2249-4863.184625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Rozanski A., Gransar H., Hayes S.W. Temporal trends in the frequency of inducible myocardial ischemia during cardiac stress testing: 1991 to 2009. J. Am. Coll. Cardiol. 2013;61(10):1054–1065. doi: 10.1016/j.jacc.2012.11.056. [DOI] [PubMed] [Google Scholar]
- 4.Fordyce C.B., Douglas P.S., Roberts R.S. Identification of patients with stable chest pain deriving minimal value from noninvasive testing: the PROMISE minimal-risk tool, a secondary analysis of a randomized clinical trial. JAMA Cardiol. 2017;2(4):400–408. doi: 10.1001/jamacardio.2016.5501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.The SCOT-HEART investigators Role of multidetector computed tomography in the diagnosis and management of patients attending the rapid access chest pain clinic, the Scottish computed tomography of the heart (SCOT-HEART) trial: study protocol for randomized controlled trial. Trials. 2012;13:184. doi: 10.1186/1745-6215-13-184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.The SCOT-HEART investigators CT coronary angiography in patients with suspected angina due to coronary heart disease (SCOT-HEART): an open-label, parallel-group, multicentre trial. Lancet. 2015;385(9985):2383–2391. doi: 10.1016/S0140-6736(15)60291-4. [DOI] [PubMed] [Google Scholar]
- 7.Douglas P.S., Hoffmann U., Patel M.R. Outcomes of anatomical versus functional testing for coronary artery disease. N. Engl. J. Med. 2015;372(14):1291–1300. doi: 10.1056/NEJMoa1415516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Marshall A., Altman D.G., Holder R.L. Comparison of imputation methods for handling missing covariate data when fitting a Cox proportional hazards model: a resampling study. BMC Med. Res. Methodol. 2010;10(1):112. doi: 10.1186/1471-2288-10-112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.CAD Consortium A clinical prediction rule for the diagnosis of coronary artery disease: validation, updating, and extension. Eur. Heart J. 2011;32(11):1316–1330. doi: 10.1093/eurheartj/ehr014. [DOI] [PubMed] [Google Scholar]
- 10.Bittencourt M.S., Hulten E., Polonsky T.S. European Society of Cardiology-recommended Coronary Artery Disease Consortium pretest probability scores more accurately predict obstructive coronary disease and cardiovascular events than the Diamond and Forrester score: the partners registry. Circulation. 2016;134(3):201–211. doi: 10.1161/CIRCULATIONAHA.116.023396. [DOI] [PubMed] [Google Scholar]
- 11.Moons K.G., Kengne A.P., Grobbee D.E. Risk prediction models: II. External validation, model updating, and impact assessment. Heart. 2012;98(9):691–698. doi: 10.1136/heartjnl-2011-301247. [DOI] [PubMed] [Google Scholar]
- 12.Rana J.S., Tabada G.H., Solomon M.D. Accuracy of the atherosclerotic cardiovascular risk equation in a large contemporary, multiethnic population. J. Am. Coll. Cardiol. 2016;67(18):2118–2130. doi: 10.1016/j.jacc.2016.02.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Van Calster B., Van Hoorde K., Vergouwe Y. Validation and updating of risk models based on multinomial logistic regression. Diagn. Progn. Res. 2017;1(1):2. doi: 10.1186/s41512-016-0002-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.D'Agostino R.B., Sr., Grundy S. Validation of the Framingham coronary heart disease prediction scores: results of a multiple ethnic groups investigation. JAMA. 2001;286(2):180–187. doi: 10.1001/jama.286.2.180. [DOI] [PubMed] [Google Scholar]
- 15.Martin S.S., Sperling L.S., Blaha M.J. Clinician-patient risk discussion for atherosclerotic cardiovascular disease prevention: importance to implementation of the 2013 ACC/AHA guidelines. J. Am. Coll. Cardiol. 2015;65(13):1361–1368. doi: 10.1016/j.jacc.2015.01.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fordyce C.B., Newby D.E., Douglas P.S. Diagnostic strategies for the evaluation of chest pain: clinical implications from SCOT-HEART and PROMISE. J. Am. Coll. Cardiol. 2016;67(7):843–852. doi: 10.1016/j.jacc.2015.11.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Debray T.P., Vergouwe Y., Koffijberg H. A new framework to enhance the interpretation of external validation studies of clinical prediction models. J. Clin. Epidemiol. 2015;68(3):279–289. doi: 10.1016/j.jclinepi.2014.06.018. [DOI] [PubMed] [Google Scholar]
- 18.European Society of Cardiology Task Force 2013 ESC guidelines on the management of stable coronary artery disease: the Task Force on the management of stable coronary artery disease of the European Society of Cardiology. Eur. Heart J. 2013;34(38):2949–3003. doi: 10.1093/eurheartj/eht296. [DOI] [PubMed] [Google Scholar]
- 19.National Institute for Health and Clinical Excellence . NICE; London: 2010. Chest Pain of Recent Onset: Assessment and Diagnosis of Recent Onset Chest Pain or Discomfort of Suspected Cardiac Origin. Clinical Guideline 95. [Google Scholar]
- 20.National Institute for Health and Care Excellence . NICE; London: 2016. Chest Pain of Recent Onset: Assessment and Diagnosis of Recent Onset Chest Pain or Discomfort of Suspected Cardiac Origin (Update). Clinical Guideline 95. [PubMed] [Google Scholar]
- 21.Adamson PD, Hunter A, Williams MC, et al. Diagnostic and prognostic benefits of computed tomography coronary angiography using the 2016 National Institute for Health and Care Excellence guidance within a randomised trial. Heart 2017. [DOI] [PMC free article] [PubMed]
- 22.Steyerberg E.W., Borsboom G.J., van Houwelingen H.C., Eijkemans M.J., Habbema J.D. Validation and updating of predictive logistic regression models: a study on sample size and shrinkage. Stat. Med. 2004;23(16):2567–2586. doi: 10.1002/sim.1844. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material