A predictive tool for identification of SARS-CoV-2 PCR-negative emergency department patients using routine test results

Rohan P Joshi; Vikas Pejaver; Noah E Hammarlund; Heungsup Sung; Seong Kyu Lee; Al’ona Furmanchuk; Hye-Young Lee; Gregory Scott; Saurabh Gombar; Nigam Shah; Sam Shen; Anna Nassiri; Daniel Schneider; Faraz S Ahmad; David Liebovitz; Abel Kho; Sean Mooney; Benjamin A Pinsky; Niaz Banaei

doi:10.1016/j.jcv.2020.104502

. 2020 Jun 10;129:104502. doi: 10.1016/j.jcv.2020.104502

A predictive tool for identification of SARS-CoV-2 PCR-negative emergency department patients using routine test results

Rohan P Joshi ^a, Vikas Pejaver ^b, Noah E Hammarlund ^b, Heungsup Sung ^c, Seong Kyu Lee ^d, Al’ona Furmanchuk ^e, Hye-Young Lee ^c,^f, Gregory Scott ^a, Saurabh Gombar ^a, Nigam Shah ^g, Sam Shen ^h, Anna Nassiri ⁱ, Daniel Schneider ^j, Faraz S Ahmad ^k, David Liebovitz ^l, Abel Kho ^l, Sean Mooney ^b, Benjamin A Pinsky ^a,^m,ⁿ, Niaz Banaei ^a,^m,^o,^*

PMCID: PMC7286235 PMID: 32544861

Abstract

Background

Testing for COVID-19 remains limited in the United States and across the world. Poor allocation of limited testing resources leads to misutilization of health system resources, which complementary rapid testing tools could ameliorate.

Objective

To predict SARS-CoV-2 PCR positivity based on complete blood count components and patient sex.

Study design

A retrospective case-control design for collection of data and a logistic regression prediction model was used. Participants were emergency department patients > 18 years old who had concurrent complete blood counts and SARS-CoV-2 PCR testing. 33 confirmed SARS-CoV-2 PCR positive and 357 negative patients at Stanford Health Care were used for model training. Validation cohorts consisted of emergency department patients > 18 years old who had concurrent complete blood counts and SARS-CoV-2 PCR testing in Northern California (41 PCR positive, 495 PCR negative), Seattle, Washington (40 PCR positive, 306 PCR negative), Chicago, Illinois (245 PCR positive, 1015 PCR negative), and South Korea (9 PCR positive, 236 PCR negative).

Results

A decision support tool that utilizes components of complete blood count and patient sex for prediction of SARS-CoV-2 PCR positivity demonstrated a C-statistic of 78 %, an optimized sensitivity of 93 %, and generalizability to other emergency department populations. By restricting PCR testing to predicted positive patients in a hypothetical scenario of 1000 patients requiring testing but testing resources limited to 60 % of patients, this tool would allow a 33 % increase in properly allocated resources.

Conclusions

A prediction tool based on complete blood count results can better allocate SARS-CoV-2 testing and other health care resources such as personal protective equipment during a pandemic surge.

Keywords: COVID-19, SARS-CoV-2, Rapid testing, Machine learning, Prediction tool

Health care systems worldwide are struggling to meet the demands of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic [1]. Weeks after local transmission was first recognized in the US, laboratory testing for COVID-19, the disease caused by SARS-CoV-2, was limited by assay complexity and reagent shortages. Limited testing and long turnaround times led to misutilization of resources resulting in strained health systems in high burden regions. While testing capacity is improving, alternative testing approaches and specimen sources are needed to handle COVID-19 surges, particularly in low-resource health systems. The aim of this study was to develop a decision support tool that integrates readily available routine lab values to predict negative SARS-CoV-2 results in patients presenting to the emergency department (ED). Use of this tool could reserve confirmatory SARS-CoV-2 testing and health system resources such as personal protective equipment and isolation rooms for those patients more likely to have COVID-19 (Fig. 1 ).

Fig. 1 — Value of a predictive COVID-19 rule-out tool in improving utilization of health care resources during a pandemic. Example contains a cohort comprising 1000 hypothetical patients with respiratory symptoms presenting to emergency departments across a region and assumes 8% COVID-19 prevalence, a highly accurate SARS-CoV-2 test, and 600 of a limited hospital resource (e.g. SARS-CoV-2 tests, personal protective equipment). (Panel A) If patients are randomly tested or randomly allocated a hospital resource during the wait for results, many patients with COVID-19 patients may not get tested or allocated the resource. (Panel B) With availability of a predictive tool of high sensitivity and negative predictive value based on readily available routine test results, utilization of limited confirmatory SARS-CoV-2 testing or other resources is reserved for those patients more likely to have COVID-19, with a 33 % improvement (48/80 to 74/80) in resource allocation. **COVID19 +ve**: COVID-19 positive patients; **COVID19 -ve**: COVID-19 negative patients; **Pred + ve**: predicted positive; **Pred -ve**: predicted negative.

A number of studies reported associations between certain non-SARS-CoV-2 test results and COVID-19 disease [2,3]. We therefore collected complete blood count (CBC) data from 3/1/2020 to 3/20/2020 ordered within 24 h of a SARS-CoV-2 PCR [4] (based off of the WHO assay) order, which resulted in a dataset of 33 confirmed SARS-CoV-2 PCR positive and 357 PCR negative ED patients at Stanford Health Care. CBC were generally concurrently ordered on ED patients with an Emergency Severity Index [5] (ESI) 1–3 and 50 % of ED patients with SARS-CoV-2 PCR tests had an accompanying CBC. ED patients with both CBC and SARS-CoV-2 PCR tests were therefore of older age (median 59 years old) compared to ED patients with SARS-CoV-2 PCR tests alone (median 37 years old), with a similar sex ratio and rate of SARS-CoV-2 PCR positivity. CBC orders were placed before positive SARS-CoV-2 PCR tests resulted and 83 % of patients had CBC orders placed within 4 h of the PCR order.

For training, we selected 3 CBC components, absolute neutrophil count (ANC), absolute lymphocyte count (ALC), and hematocrit (HCT), based on a univariate analysis suggesting association with positive SARS-CoV-2 PCR, evidence of ANC and ALC association with disease severity in the literature [2,3], and a low Pearson correlation of these features to each other. We included male sex as a feature in our model since it was associated with positive SARS-CoV-2 PCR status independent of hematocrit. In cross-validation within the training set, this manual variable selection method performed comparably to other model-based variable selection methods (e.g. recursive feature elimination, L1-penalization, and L2-penalization). Using ANC, ALC, hematocrit, and patient sex, we trained an L2-regularized logistic regression model [6]. ANC and ALC were negative predictors while male sex and HCT were positive predictors of SARS-CoV-2 PCR positivity. Using a receiver operating characteristic curve, we chose a test probability threshold to optimize for >50 % specificity and >80 % sensitivity. Validation sets consisted of data that were not seen by the model during training from emergency department patients who received SARS-CoV-2 PCR testing and concurrent CBC.

In testing on a validation set from patients presenting to Stanford Health Care from 3/21/2020 to 4/7/2020 (confirmed SARS-CoV-2 PCR positive, n = 41; negative, n = 495), the decision support tool showed a diagnostic C-statistic of 0.78 (Fig. 2 A). We examined the tradeoff between negative predictive value (NPV) and specificity using our model and found high NPV was maintained across a range of specificities, with a specificity-weighted average NPV of 98 %. Using the operating threshold defined using the training set, the model accurately ruled out SARS-CoV-2 in 40 % of total test patients with an NPV of 99 % and sensitivity of 93 %. Results of seasonal virus rapid respiratory panel (RRP) PCR, when positive, have been used by health systems to avoid the need for testing for COVID-19. We trained an L2-regularized logistic regression model that included the CBC features as well as results of seasonal rapid respiratory panel (RRP) PCR as predictors. The predictive performance of this RRP model was indistinguishable from the CBC-only model (data not shown). Consistent with previous reports [7], these results suggest that RRP does not have additional predictive value compared to CBC.

Fig. 2 — Performance of complete blood count (CBC)-based predictive COVID-19 rule-out tool. A) Results of predictive tool on Stanford Health Care emergency department patient cohort. B) Results of predictive tool on a Seattle, Washington emergency department patient cohort. C) Results of predictive tool on a Chicago, Illinois emergency department patient cohort. D) South Korean cohort of emergency department patients. All four cohorts represented validation sets not previously seen by the decision support tool. SARS-CoV-2 PCR performed in local laboratories was used as the reference method. **Left**: Receiver operating characteristic curves. **Middle**: Specificity versus negative predictive value across all operating thresholds. **Right**: Confusion matrix calculated using operating point defined using Stanford Health Care training cohort. **AUC**: Receiver operating characteristic area under curve. **PPV**: positive predictive value. **NPV**: negative predictive value. **Avg. NPV**: Weighted average of negative predictive values with specificity as weights across all probability thresholds. **Always neg. model**: baseline negative predictive value expected by a classifier that always predicts SARS-CoV-2 negative.

We note that these models were trained on our local population and could be sensitive to the choice of patients for whom our ED providers chose to order CBCs, in addition to other institutional factors affecting CBC and SARS-CoV-2 testing. We sought to test the generalizability of the predictive value of these CBC and sex features in our greater geographic area. Our Northern California population trained model performed similarly in a Seattle, Washington based patient cohort, with a diagnostic C-statistic of 0.75 and a specificity-weighted average NPV of 95 %. Applying our Northern California-defined operating threshold, the model accurately ruled out SARS-CoV-2 in 31 % of total patients, with an NPV of 96 % and sensitivity of 88 % (Fig. 2B). In a Chicago, Illinois-based patient cohort, the model demonstrated a C-statistic of 0.75 and a specificity-weighted average NPV of 90 %. Applying our Northern California-defined operating threshold, the model accurately ruled out SARS-CoV-2 in 39 % of total patients, with an NPV of 92 % and sensitivity of 85 % (Fig. 2C). The model also performed similarly in a South Korean based patient cohort, with a diagnostic C-statistic of 0.81 and a specificity-weighted average NPV of 99 %. Applying our Northern California-defined operating threshold, the model accurately ruled out SARS-CoV-2 in 53 % of patients in this South Korean population, with an NPV of 99 % and sensitivity of 89 % (Fig. 2D).

Our results demonstrate that a decision support tool based on CBC component results has high negative predictive value for SARS-CoV-2 PCR results in diverse patient populations from the Northwest US and South Korea. In the setting of limited testing where only 60 % of 1000 patients entering emergency departments could be tested, our tool would result in a 33 % better allocation of testing and other hospital resources (Fig. 1B). As a proof-of-concept, we made the CBC model available as a simple web-based tool, where individuals can enter ANC, ALC, HCT, sex, and receive a result of whether SARS-CoV-2 testing is predicted negative (http://web.stanford.edu/∼gscott2/cgi-bin/CovidTool/). Given that this predictive model trades high sensitivity for tolerable specificity, specific cost-benefit analyses that include resource scarcity, model operating point, and model performance as factors should be performed prior to implementation in a local setting. Our predictive model and others like it should be useful for COVID-19 and future pandemics as an interim stopgap early in and during surges of the pandemic when testing is limited. Integration of this tool can aid in optimizing utilization of health care resources, reducing turnaround-time of viral testing, and improving patient care.

Declaration of Competing Interest

The authors declare no competing interests.

Acknowledgements

We thank Ethan Steinberg for review of data curation and the prediction model. Our study was supported by Stanford University Department of Pathology.

Footnotes

^{Appendix A}

Supplementary material related to this article can be found, in the online version, at doi:https://doi.org/10.1016/j.jcv.2020.104502.

Appendix A. Supplementary data

The following is Supplementary data to this article:

mmc1.pdf^{(320.8KB, pdf)}

References

1.Emanuel E.J. Fair allocation of scarce medical resources in the time of Covid-19. N. Engl. J. Med. 2020;0 doi: 10.1056/NEJMsb2005114. [DOI] [PubMed] [Google Scholar]
2.Guan W. Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2002032. 0, null. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Chen N. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Hogan C.A., Sahoo M.K., Pinsky B.A. Sample pooling as a strategy to detect community transmission of SARS-CoV-2. JAMA. 2020 doi: 10.1001/jama.2020.5445. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Eitel D.R., Travers D.A., Rosenau A.M., Gilboy N., Wuerz R.C. The emergency severity index triage algorithm version 2 is reliable and valid. Acad. Emerg. Med. Off. J. Soc. Acad. Emerg. Med. 2003;10:1070–1080. doi: 10.1111/j.1553-2712.2003.tb00577.x. [DOI] [PubMed] [Google Scholar]
6.Hastie T., Tibshirani R., Friedman J. second edition. Springer; 2016. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. [Google Scholar]
7.Kim D., Quinn J., Pinsky B., Shah N.H., Brown I. Rates of co-infection between SARS-CoV-2 and other respiratory pathogens. JAMA. 2020 doi: 10.1001/jama.2020.6266. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.pdf^{(320.8KB, pdf)}

[bib0005] 1.Emanuel E.J. Fair allocation of scarce medical resources in the time of Covid-19. N. Engl. J. Med. 2020;0 doi: 10.1056/NEJMsb2005114. [DOI] [PubMed] [Google Scholar]

[bib0010] 2.Guan W. Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2002032. 0, null. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0015] 3.Chen N. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0020] 4.Hogan C.A., Sahoo M.K., Pinsky B.A. Sample pooling as a strategy to detect community transmission of SARS-CoV-2. JAMA. 2020 doi: 10.1001/jama.2020.5445. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0025] 5.Eitel D.R., Travers D.A., Rosenau A.M., Gilboy N., Wuerz R.C. The emergency severity index triage algorithm version 2 is reliable and valid. Acad. Emerg. Med. Off. J. Soc. Acad. Emerg. Med. 2003;10:1070–1080. doi: 10.1111/j.1553-2712.2003.tb00577.x. [DOI] [PubMed] [Google Scholar]

[bib0030] 6.Hastie T., Tibshirani R., Friedman J. second edition. Springer; 2016. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. [Google Scholar]

[bib0035] 7.Kim D., Quinn J., Pinsky B., Shah N.H., Brown I. Rates of co-infection between SARS-CoV-2 and other respiratory pathogens. JAMA. 2020 doi: 10.1001/jama.2020.6266. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

A predictive tool for identification of SARS-CoV-2 PCR-negative emergency department patients using routine test results

Rohan P Joshi

Vikas Pejaver

Noah E Hammarlund

Heungsup Sung

Seong Kyu Lee

Al’ona Furmanchuk

Hye-Young Lee

Gregory Scott

Saurabh Gombar

Nigam Shah

Sam Shen

Anna Nassiri

Daniel Schneider

Faraz S Ahmad

David Liebovitz

Abel Kho

Sean Mooney

Benjamin A Pinsky

Niaz Banaei

Abstract

Background

Objective

Study design

Results

Conclusions

Fig. 1.

Fig. 2.

Declaration of Competing Interest

Acknowledgements

Footnotes

Appendix A. Supplementary data

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases