Abstract
Background.
We sought to develop an automatable score to predict hospitalization, critical illness, or death in patients at risk for COVID-19 presenting for urgent care during the Massachusetts outbreak.
Methods.
Single-center study of adult outpatients seen in respiratory illness clinics (RICs) or the emergency department (ED), including development (n = 9381, March 7-May 2) and prospective (n = 2205, May 3–14) cohorts. Data was queried from Partners Enterprise Data Warehouse. Outcomes were hospitalization, critical illness or death within 7 days. We developed the COVID-19 Acuity Score (CoVA) using automatically extracted data from the electronic medical record and learning-to-rank ordinal logistic regression modeling. Calibration was assessed using predicted-to-observed ratio (E/O). Discrimination was assessed by C-statistics (AUC).
Results.
In the development cohort, 27.3%, 7.2%, and 1.1% of patients experienced hospitalization, critical illness, or death, respectively; and in the prospective cohort, 26.1%, 6.3%, and 0.5%. CoVA showed excellent performance in the development cohort (concurrent validation) for hospitalization (E/O: 1.00, AUC: 0.80); for critical illness (E/O: 1.00, AUC: 0.82); and for death (E/O: 1.00, AUC: 0.87). Performance in the prospective cohort (prospective validation) was similar for hospitalization (E/O: 1.01, AUC: 0.76); for critical illness (E/O 1.03, AUC: 0.79); and for death (E/O: 1.63, AUC=0.93). Among 30 predictors, the top five were age, diastolic blood pressure, blood oxygen saturation, COVID-19 testing status, and respiratory rate.
Conclusions.
CoVA is a prospectively validated automatable score to assessing risk for adverse outcomes related to COVID-19 infection in the outpatient setting.
Keywords: COVID-19, Risk Prediction, Outpatient, Interpretable machine learning
INTRODUCTION
The COVID-19 pandemic has presented unparalleled challenges for healthcare systems around the world[1–8]. The SARS-CoV-2 virus first appeared in Wuhan, China in December 2019. The first US case was confirmed on January 20[9], followed by exponential spread within the US[10]. By the end of April, Massachusetts was the third hardest hit state, trailing only New York and New Jersey[11]. Within Massachusetts, Boston and Chelsea were epicenters for the spread of COVID-19.
In anticipation of the surge of COVID-19 patients and to help limit viral spread, Massachusetts General Hospital (MGH) closed most outpatient and urgent care clinics and set up new Respiratory Illness Clinics (RICs) on March 7, 2020 to manage patients with symptoms of a respiratory infection. These clinics were staffed by clinicians and nurses reassigned from other areas, most with little urgent care experience. COVID-19 suspected cases were also screened in the Emergency Department (ED). Due to high case volumes and precautions for staff and patients, visit duration and extent of clinical assessments for most patients were curtailed. In addition to limited clinical assessment, triage decisions were complicated by COVID-19’s biphasic clinical course. Patients who initially present with mild symptoms often later return for hospital admission, and many subsequently suffer adverse events including ICU transfer, mechanical ventilation (MV) or death. Various prediction rules have been proposed, but to our knowledge few have been prospectively validated, and most were developed for the inpatient setting rather than outpatient screening.
To help frontline clinicians appropriately triage and plan follow-up care for patients presenting for COVID-19 screening and not requiring immediate hospitalization, we developed an outpatient screening score, COVID-19 Acuity Score (CoVA), that could be incorporated into an electronic medical record (EMR) system. We used data from MGH’s newly formed RIC clinics and the emergency department. CoVA assigns acuity levels based on demographic, clinical, radiographic, and medical history variables, and provides predicted probabilities for hospital admission, ICU admission or MV, or death within 7 days.
METHODS
Study Population
We included data from patients seen in MGH RIC clinics and ED between March 7 and May 14. Patients were divided into two mutually exclusive cohorts, a development cohort (March 7 to May 2, 2020, n = 9381); and prospective cohort (May 3 to May 14, 2020, n = 2205).
Inclusion criteria were: (1) MGH RIC or ED visit between March 7th and May 14th, 2020; (2) age ≥ 18 years; and (3) reason for visit was cough, fever, shortness of breath, COVID-related, or potentially related (see Supplemental Table S6). Exclusion criteria were: (1) Patients seen solely for a nasopharyngeal PCR swab without a clinical assessment were excluded; and (2) Patients with visits in both development and prospective cohorts are excluded, to ensure the score is not overfit to those patients. A data flowchart is provided in Supplemental Figure S1. Note that one patient contributes only once in either cohort. In the development cohort, we randomly sampled one visit from each patient. In the prospective cohort, we chose the most recent visit from each patient.
Primary Outcome
The primary outcome was the occurrence of an adverse event within 7 days following an outpatient medical encounter, including either hospitalization at MGH, critical illness (defined as ICU care and/or mechanical ventilation), or death. The prediction horizon was set to 7 days because this period was considered meaningful by our frontline teams for clinical decision-making (e.g., regarding whether to send a patient home or to the emergency department, or to follow up with phone calls), and because empirically, within the model development cohort most adverse events occurred within 7 days of initial presentation.
Predictors
We selected 98 variables that were routinely available in the outpatient setting during the COVID-19 pandemic to serve as candidate predictors. These included demographic variables: age, gender, tobacco use history, most recent body mass index (BMI) within the preceding month (represented as binary variables designated as high BMI, >35kg/m2, and low BMI, <18.5kg/m2), the most recent vital signs (blood pressure, respiratory rate, heart rate, temperature, blood oxygen saturation level (SpO2)) within the preceding 3 days; COVID testing status (See Supplemental Section “Predictor encoding” for encoding details); specific symptoms suggestive of COVID-19 (anosmia, dysgeusia based on ICD-10 codes), and pre-existing medical diagnoses, coded as present or absent based on groups of billing codes (ICD-10 codes; Supplemental Table S3) in the EMR. To be coded as present, the diagnostic code had to be recorded on or before the day of presentation. The Weighted Charlson Comorbidity Index (CCI) was computed based on groups of ICDs using the “wscore” from the R package “comorbidity”[14]. We omitted race and ethnicity as predictors because (1) we have found that these variables were often unavailable or inaccurately recorded, especially ethnicity; and (2) we wanted to limit the influence of local demographic patterns on the score, which might limit the ability to generalize to external populations.
For patients who underwent chest X-ray (CXR) imaging during these encounters, we identified groups of common findings based on radiology reports that tend to indicate adverse events in patients with COVID-19. These groups were identified in two steps. First, we manually reviewed 50 CXR reports and liberally extracted key words, phrases, and word patterns describing abnormal findings. Next, a pulmonary and critical care medicine physician (LB) categorized these phrases into groups. Five groups were identified: multifocal, patterns typical for COVID-19 (pneumonia, bronchopneumonia, Acute Respiratory Distress Syndrome (ARDS)), patchy consolidation, peripheral or interstitial opacity, or hazy or airspace opacities. The phrases and groupings are shown in Supplemental Table S4. CXR predictors were coded as present, not present or unavailable.
Data Preprocessing and Selection of Predictor Variables
We treated predictors outside of physiologically plausible ranges as unavailable (see Supplemental Table S5). Unavailable values were imputed using K-nearest neighbors (KNN)[15], where the value of K was determined by minimizing the imputation error on a subset without unavailable data by randomly masking variables according to the pattern of unavailability in the overall data. Note that we did not impute CXR predictors or COVID status for patients who did not have either of them available. Instead, for these we coded unavailability explicitly (described above).
Predictors were standardized to zero mean and unit standard deviation using z-score transformation (Supplemental Table S1). Predictors were selected for inclusion in the CoVA model in two stages. First, we used ANOVA statistics to identify predictors associated with hospitalization, ICU care or MV, or death. At 0.05 significance level, we identified 65 predictors to carry forward into the model fitting procedure (p-values shown in Supplemental Table S7). Second, we used least absolute shrinkage and selection (LASSO) regression during the model fitting procedure to select a reduced subset of highly predictive variables.
Model Development
We assigned an ordinal scale to adverse events, including no event, hospitalized, ICU care and/or MV, and death. We used an implementation of ordinal regression: pairwise learning to rank (LTR) with LASSO penalized logistic regression[16,17]. Training involves learning to predict which of a pair of patients will have a worse outcome. On biological grounds, and to address co-linearity among predictors, we constrained the model optimization to allow only non-negative coefficients for CXR predictors, medical comorbidities, CCI, and history of current or past tobacco use; and unconstrained for other predictors. Model training and preliminary evaluation of model performance was performed using the development cohort, using nested 5-fold cross validation (CV) (Supplemental Materials and Supplementary Figure S3). The final model provides acuity scores between 0 to 100 and predicted probabilities for hospitalization and for critical illness or death within 7 days. The final model was tested on the prospective validation cohort. Performance is reported both for the development cohort (results from cross validation), and for the prospective cohort.
Statistical Analysis of CoVA Predictive Performance
We summarized the distribution of cohort characteristics and adverse events using event counts and proportions for categorical predictors, and mean and standard deviation for continuous variables. For model calibration, an expected-to-observed ratio (E/O)[18] of 1.0 indicates that the number of expected events equals the number of observed events; and calibration slope (CS) is defined as the linear correlation between the observed O and expected probabilities E, where the expected (predicted) values (E) are binned into quintiles.
We calculated the area under the Receiver Operator Characteristic (ROC) curve (AUC, also called the C-statistic) to quantify how well CoVA scores discriminate between individuals who within 7 days were vs. not hospitalized, and between those who did vs. not experience critical illness (ICU care, MV, or death). We considered an AUC between 0.50 and 0.55 to be poor; 0.55 and 0.65, moderate; 0.65 and 0.75, acceptable; and >0.75, excellent. We also calculated specificity, positive predictive value (PPV), and negative predictive values (NPV) at the 90% sensitivity level.
We also looked at the time course of adverse events as a function of CoVA score over 4 weeks (28 days) following initial presentation to the RIC or ED to understand the properties of CoVA score.
RESULTS
Cohort Characteristics
From March 7 to May 2, 2020, 9381 patients met inclusion criteria and were included in the development cohort. The average age was 51 years old with 49% being female. Among these, 3344 (35.6%) had adverse events within 7 days of presentation: 2562 (27.3%) were hospitalized, 679 (7.2%) received ICU care and/or were mechanically ventilated, and 103 (1.1%) died.
From May 3–14, 2205 additional patients met inclusion criteria and were included in the prospective cohort. The average age was 53 years old with 49% being female. Among these, 726 (32.9%) had adverse events: 575 (26.1%) were hospitalized, 139 (6.3%) received ICU care and/or were mechanically ventilated, and 12 (0.5%) died.
Cohort characteristics are summarized in Table 1. Compared to the model development period, during the prospective validation period there were modest increases in the proportion of patients with CXRs (prospective 52.3% vs. development 41.0%), the proportion of outpatient evaluations performed in RIC clinics (prospective 26.5% vs. development 21.4%), and in testing rates for COVID-19 (prospective 78.8% vs. development 59.5%). Several other small but likely clinically insignificant differences between cohorts also reached statistical significance, due to the large cohort sizes. COVID-19 infections and clinical adverse events by age and decade of life are shown in Supplemental Figure S2.
Table 1. Characteristics of Study Participants.
Development cohort | Prospective cohort | p-value | |
---|---|---|---|
# Patient | 9381 | 2205 | -- |
RIC: n (%) | 2013 (21.5%) | 584 (26.5%) | <0.01** |
ED: n (%) | 7368 (78.5%) | 1621 (73.5%) | <0.01** |
COVID+ | |||
Yes: n (%) | 1404 (15.0%) | 243 (11.0%) | <0.01** |
No: n (%) | 4178 (44.5%) | 1494 (67.8%) | <0.01** |
Untested or Unknown: n (%) | 3799 (40.5%) | 468 (21.2%) | <0.01** |
Outcome in 7 days | |||
Hospitalization: n (%) | 2562 (27.3%) | 575 (26.1%) | 0.241 |
ICU or Intubation: n (%) | 679 (7.2%) | 139 (6.3%) | 0.123 |
Death: n (%) | 103 (1.1%) | 12 (0.5%) | 0.018* |
Age, year: Mean (SD) | 51.1 (19.2) | 52.6 (18.6) | <0.01** |
Female: n (%) | 4587 (48.9%) | 1087 (49.3%) | 0.735 |
BMI, kg/m2: Mean (SD) | 28.4 (7.0) | 28.4 (6.8) | 0.792 |
Race | |||
Asian: n (%) | 373 (4.0%) | 79 (3.6%) | 0.391 |
Black: n (%) | 1023 (10.9%) | 218 (9.9%) | 0.164 |
Pacific Islander: n (%) | 15 (0.2%) | 4 (0.2%) | 0.822 |
Native American: n (%) | 6 (0.1%) | 1 (0.0%) | 0.749 |
White: n (%) | 5673 (60.5%) | 1434 (65.0%) | <0.01** |
Other or Unknown: n (%) | 2291 (24.4%) | 469 (21.3%) | <0.01** |
Hispanic Ethnicity: n (%) | 1879 (20.0%) | 377 (17.1%) | <0.01** |
Smoking | |||
Yes / Quit: n (%) | 3190 (34.0%) | 834 (37.8%) | <0.01** |
Never / Passive: n (%) | 5245 (55.9%) | 1165 (52.8%) | <0.01** |
Not Asked / Unknown: n (%) | 946 (10.1%) | 206 (9.3%) | 0.295 |
Weighted Charlson Score: Mean (SD) | 1.5 (2.4) | 1.6 (2.6) | 0.031* |
SpO2, %: Mean (SD) | 97.3 (2.3) | 97.3 (2.2) | 0.152 |
Chest X-ray (percentage is among available patients except the first one) | |||
Chest X-ray available: n (%) | 3851 (41.1%) | 1154 (52.3%) | <0.01** |
Multifocal: n (%) | 1214 (31.5%) | 242 (21.0%) | <0.01** |
Typical pattern for COVID-19: n (%) | 548 (14.2%) | 119 (10.3%) | <0.01** |
Patchy consolidation: n (%) | 739 (19.2%) | 245 (21.2%) | 0.126 |
Peripheral/interstitial opacity: n (%) | 72 (1.9%) | 8 (0.7%) | <0.01** |
Hazy or airspace opacity: n (%) | 480 (12.5%) | 101 (8.8%) | <0.01** |
Predictive Performance
CoVA showed excellent calibration and discrimination in the development cohort for hospitalization (E/O: 1.00 [0.98, 1.02], CS: 0.99 [0.98, 0.99], AUC: 0.80 [0.79, 0.81]); for critical illness (E/O: 1.00 [0.93, 1.06], CS: 0.98 [0.96, 0.99], AUC: 0.82 [0.80, 0.83]); and for death (E/O: 1.00 [0.84, 1.21], CS: not calculated due to small sample size; AUC: 0.87 [0.83, 0.91]). Performance generalized to the prospective validation cohort, with similar results for hospitalization (E/O: 1.01 [0.96, 1.07], CS: 0.99 [0.98, 1.00], AUC 0.76 [ 0.73, 0.78]); for critical illness (E/O: 1.03 [0.89, 1.20], CS: 0.98 [0.94, 1.00], AUC: 0.79 [0.75, 0.82]); and for death (E/O: 1.63 [1.03, 3.25], CS: not calculated due to small sample size; AUC: 0.93 [0.86, 0.98]). Additional performance metrics are reported in Table 2.
Table 2. Prediction performance.
Adverse event | Hospitalization, ICU, Intubation, or death | ICU, Intubation, or death | Death |
---|---|---|---|
Concurrent validation (based on the development cohort but cross-validated) | |||
Number of patients (n, %) | 3344 (35.6%) | 782 (8.3%) | 103 (1.1%) |
Calibration (E/O) | 1.00 [0.98, 1.02] | 1.00 [0.93, 1.06] | 1.00 [0.84, 1.21] |
Calibration slope (CS) | 0.99 [0.98, 0.99] | 0.97 [0.96, 0.99] | n.c |
AUC | 0.80 [0.80, 0.81] | 0.82 [0.80, 0.83] | 0.87 [0.83, 0.91] |
Spec. at 90% Sens. | 0.56 [0.53, 0.59] | 0.44 [0.41, 0.48] | 0.35 [0.24, 0.45] |
PPV at 90% Sens. | 0.47 [0.45, 0.49] | 0.16 [0.14, 0.17] | 0.03 [0.02, 0.04] |
NPV at 90% Sens. | 0.89 [0.88, 0.90] | 0.98 [0.98, 0.99] | 1.00 [1.00, 1.00] |
Prospective validation | |||
Number of patients (n, %) | 726 (32.9%) | 151 (6.8%) | 12 (0.54%) |
Calibration (E/O) | 1.01 [0.96, 1.07] | 1.03 [0.89, 1.20] | 1.63 [1.03, 3.25] |
Calibration slope (CS) | 0.99 [0.98, 1.00] | 0.98 [0.96, 0.99] | n.c |
AUC | 0.76 [0.73, 0.78] | 0.79 [0.75, 0.82] | 0.93 [0.86, 0.98] |
Spec. at 90% Sens. | 0.66 [0.62, 0.70] | 0.53 [0.43, 0.63] | 0.29 [0.01, 0.31] |
PPV at 90% Sens. | 0.40 [0.38, 0.43] | 0.11 [0.09, 0.14] | 0.017 [0.01, 0.22] |
NPV at 90% Sens. | 0.87 [0.86, 0.89] | 0.98 [0.98, 0.99] | 1.00 [1.00, 1.00] |
+ 95% confidence interval
PPV = positive predictive value, NPV = negative predictive value, Spec = specificity, Sens = sensitivity, ICU = intensive care unit, E/O = ratio of Expected to number of Observed adverse events, n.c. = not calculated due to small sample size.
Properties of COVA Score
The proportion of patients with adverse events at 7 days increases with higher CoVA scores, rising from 18% with CoVA scores in the 0–20 range, to 88% for those with scores in the 80–100 range. The proportion underwent critical illness or death also rises, from 2% for scores between 0–20, to 32% with scores between 80–100 (Figure 1A).
We investigated the time course of adverse events as a function of CoVA score over 4 weeks (28 days) following initial presentation to the RIC or ED. We limited this investigation to the development cohort, because 28 days have not passed for the prospective validation cohort at the time of writing. By 28 days, 3399 (36%) of patients experienced hospitalization, critical illness, or death. Of these, 3214 (95%) occurred within 1 day, and 3344 (98%) had occurred by 7 days. Critical illness or death occurred in 861 (9%) of patients within 28 days of presentation. Of these, 537 (62%) occurred within 1 day, and 782 (91%) occurred within 7 days. These numbers support our choice of 7 days as a clinically meaningful event prediction horizon. Curves for the cumulative incidence of adverse events over time for different levels of CoVA scores are shown in Figure 1B.
Predictors of Adverse Events
Thirty predictors were selected by the data-driven model training procedure (Table 3). All but four predictors increase the predicted probability of adverse events when present. SpO2, diastolic and systolic blood pressure and low BMI inversely correlated with the probability of adverse events. Predictors from CXR reports included in the model were multifocal patterns (diffuse opacities, ground glass) and patterns typical for COVID.
Table 3.
ID | Predictor | coefficient | ID | Predictor | coefficient |
---|---|---|---|---|---|
1 | Age | 0.7353 | 16 | Renal cancer* | 0.0681 |
2 | Diastolic blood pressure | −0.4724 | 17 | Pancreatitis* | 0.0608 |
3 | SpO2 | −0.3776 | 18 | Cystic fibrosis* | 0.0492 |
4 | Ever COVID+ up to event | 0.2750 | 19 | Cardiac Arrest* | 0.0491 |
5 | Respiratory Rate | 0.2746 | 20 | Seizure disorder | 0.0437 |
6 | Acute ischemic stroke* | 0.1746 | 21 | Amyolateral sclerosis* | 0.0405 |
7 | CXR: Multifocal | 0.1293 | 22 | Metabolic acidosis* | 0.0385 |
8 | Heart rate | 0.1215 | 23 | Myasthenia gravis* | 0.0374 |
9 | Body temperature | 0.1206 | 24 | Pneumothorax* | 0.0300 |
10 | Systolic blood pressure | −0.1151 | 25 | Spinal muscular atrophy* | 0.0241 |
11 | Weighted Charlson Score | 0.1142 | 26 | Pericarditis* | 0.0144 |
12 | Intracranial hemorrhage* | 0.1087 | 27 | High BMI (>35kg/m2) | 0.0028 |
13 | Subarachnoid hemorrhage* | 0.0919 | 28 | CXR: Typical for COVID | 0.0001 |
14 | Male sex | 0.0808 | 29 | Low BMI (<18.5kg/m2) | −0.0001 |
15 | Hematologic malignancy* | 0.0765 | 30 | ARDS* | 0.0001 |
Abbreviations: CXR = chest X-ray; ARDS = Acute Respiratory Distress Syndrome; BMI = Body Mass Index. Diagnoses are based on past medical history, and are coded as present (e.g. Pneumothorax = 1) if recorded in the electronic medical record at any time before the date of presentation for COVID-19 screening. Disorders marked by ‘*’ refer to pre-existing conditions documented in the electronic medical record.
Effect of Chest X-Ray Findings and COVID Status on the Probability of Adverse Events
CXRs and testing for COVID-19 were not universal, and testing rates evolved over time. One potential use of CoVA is to help determine whether to perform these tests. We therefore investigated the impact of CXR findings on the predicted probability of critical illness or death. As shown in Supplemental Figure S4, positive CXR findings are most informative when the pre-CXR probability is 30% (critical illness or death), in which case they increase the predicted probability of an adverse event by 4%. Negative CXR findings are maximally informative when the pretest probability is near 34% (critical illness or death), in which case they decrease the probability by 4%.
We also examined the effect of COVID-19 testing results on the predicted probability of adverse events (critical illness or death). The largest effect of a positive test result occurs when pretest probability is 28%, in which case the post-test probability increases by 8%. A negative result has the largest impact when pretest probability is 36%, in which case the posttest probability decreases by 8%.
DISCUSSION
Identifying outpatients presenting for COVID-19 screening at high risk or low risk for adverse events is important for medical decisions regarding testing, hospitalization, and follow-up. We developed and prospectively validated the COVID-19 Acuity Score (CoVA) as an automatable outpatient screening score that can be implemented in the EMR. The model exhibits excellent calibration, discrimination, and negative predictive value both in concurrent validation (n = 9381, March 7-May 2) and in large-scale prospective validation (n = 2205, May 3–14). While several COVID-19 risk prediction models have been proposed for the inpatient setting, CoVA fills an unmet need for a prospectively validated risk designed for outpatient screening.
Several predictors selected into the model have been identified in prior studies, including advanced age[4,5,9,19]; pre-existing pulmonary[4,5], kidney[4], and cardiovascular disease[4,9,20]; obesity[21]; and increased respiratory or heart rate or hypoxia[4]. We found that other pre-existing medical conditions also increased risk for adverse outcomes, including hematologic malignancy, cancer and pancreatitis. Hypertension and diabetes mellitus did not emerge as predictors in the CoVA model, despite being identified in prior studies[4, 5, 9]. These comorbidities are correlated with outcomes in univariate analysis (Supplemental Table S7) and correlation analysis shows that both are strongly associated with older age, higher CCI, high BMI, and other comorbidities already selected as predictors, helping to explain their not being selected into the model (Supplemental Figure S5). HIV/AIDS was not significantly associated with the outcome, but this may have been due to low numbers (85/9381<1%, less than 1% of developmental cohort).
Several studies have documented neurological manifestation of COVID-19[22–24]. Nevertheless, we were surprised to find a variety of neurological diseases surfaced as robust predictors of adverse outcomes in COVID-19 infection, including ischemic stroke, intracranial hemorrhage, subarachnoid hemorrhage, epilepsy, amyotrophic lateral sclerosis, myasthenia gravis, and spinal muscular atrophy. It is unclear if these neurological diseases are merely markers of health or if the worsened outcomes are due to the interaction of COVID-19 with neurological disorders that amplifies the pathology.
Prior work on predicting outcomes in COVID-19 patients is summarized in Supplemental Table S2. Like our study, most have attempted to predict critical illness or death. However, most are based on small cohorts (median: n = 189, range n = 26 to 577); focus on inpatients; and utilize laboratory values, which were rarely available for our outpatient cohort. Only 3 included prospective or external validation. By contrast, CoVA is designed for the outpatient setting. In this setting, the availability of COVID-19 test results are variably available (for 60% in the development and 80% in the prospective cohort) and other laboratory results were rarely available. To ensure generalizability, we used a large development cohort of 9381 patients, and trained the model using a rigorous approach. We ensured clinical interpretability by utilizing a linear model with positivity constraints on predictors expected to increase risk. Finally, we validated our model on a large (n = 2205) prospectively collected patient cohort, providing an unbiased assessment of model generalizability.
Strengths of this study include its large sample size, careful EMR phenotyping, rigorous statistical approach, and the feature that all variables required by the model are available and automatically extractable within most electronic health record systems. The study also has limitations. First, our study is from a single center, with patient demographics specific to MGH, Boston. Nevertheless, important biological parameters in the predictive model are universal and increase the likelihood that the model will generalize. Second, although RIC clinics were established for patients with suspicion for COVID-19, patients seen in the ED were seen for a variety of reasons. Nevertheless COVID-19 was and for now remains a universal concern for patients seen in the ED, and we excluded patients prior to the onset of COVID-19 in Boston (March 7, 2020), therefore our model is relevant for screening during times of high alert for COVID-19. Finally, we did not include lab test values, since they were typically not available. Among the few studies of inpatients, inclusion of lab tests appears to improve model performance.
In conclusion, the COVID-19 Acuity Score (CoVA) represents a well-calibrated, discriminative, prospectively validated, and interpretable score for assessing the risk for adverse events for outpatients presenting with possible COVID-19 infection.
Supplementary Material
Acknowledgments.
The authors thank all of the physicians and nurses of Massachusetts General Hospital for their exemplary patient care. The authors also gratefully acknowledge all of the physicians of the Respiratory Infection Clinics in the MassGeneral Brigham hospital system, and the MGH Emergency Departments for their constructive feedback on this work.
Financial support. HS and ML were supported by a Developmental Award from the Harvard University Center for AIDS Research (HU CFAR NIH/NIAID fund 5P30AI060354-16). MBW was supported by the Glenn Foundation for Medical Research and American Federation for Aging Research through a Breakthroughs in Gerontology Grant; the American Academy of Sleep Medicine through an AASM Foundation Strategic Research Award; the Department of Defense through a subcontract from Moberg ICU Solutions, Inc, and by grants from the NIH (1R01NS102190, 1R01NS102574, 1R01NS107291, 1RF1AG064312). Dr. Westover is co-founder of Beacon Biosignals. S.M. was supported by the National Institute of Mental Health at the National Institutes of Health (K23MH115812) and the Harvard University Eleanor and Miles Shore Fellowship Program. JBM was supported for this work by the MGH Division of Clinical Research. BF was supported by the BRAIN Initiative Cell Census Network grant U01MH117023, the NIBIB (P41EB015896, 1R01EB023281, R01EB006758, R21EB018907, R01EB019956), NIA (1R56AG064027, 1R01AG064027, 5R01AG008122, R01AG016495), NIHM and NIMDKD (1-R21-DK-108277-01), NINDS (R01NS0525851, R21NS072652, R01NS070963, R01NS083534, 5U01NS086625,5U24NS10059103, R01NS105820), and Shared Instrumentation Grants 1S10RR023401, 1S10RR019307, and 1S10RR023043. BF received additional support from the NIH Blueprint for Neuroscience Research (5U01-MH093765), part of the Human Connectome Project. In addition, BF has a financial interest in CorticoMetrics, a company whose medical pursuits focus on brain imaging and measurement technologies. BF’s interests were reviewed and are managed by Massachusetts General Hospital and Partners HealthCare in accordance with their conflict of interest policies.
Footnotes
Potential conflicts of interest. All authors report no potential conflicts of interest.
REFERENCES
- 1.S R, Js H, M N, et al. Presenting Characteristics, Comorbidities, and Outcomes Among 5700 Patients Hospitalized With COVID-19 in the New York City Area. JAMA, 2020. Available at: https://pubmed.ncbi.nlm.nih.gov/32320003/?from_term=covid+new+york&from_pos=1. Accessed 5 June 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.First Case of Covid-19 in the United States | NEJM. Available at: https://www-nejm-org.ezp-prod1.hul.harvard.edu/doi/full/10.1056/NEJMc2004794. Accessed 5 June 2020.
- 3.Covid-19 — Navigating the Uncharted | NEJM. N Engl J Med Available at: http://www.nejm.org/doi/10.1056/NEJMe2002387. Accessed 5 June 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhou F, Yu T, Du R, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. The Lancet 2020; 395:1054–1062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wu Z, McGoogan JM. Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. JAMA 2020; [DOI] [PubMed] [Google Scholar]
- 6.CDC COVID-19 Response Team. Severe Outcomes Among Patients with Coronavirus Disease 2019 (COVID-19) - United States, February 12-March 16, 2020. MMWR Morb Mortal Wkly Rep 2020; 69:343–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.He X, Lau EHY, Wu P, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat Med 2020; 26:672–675. [DOI] [PubMed] [Google Scholar]
- 8.Lauer SA, Grantz KH, Bi Q, et al. The Incubation Period of Coronavirus Disease 2019 (COVID-19) From Publicly Reported Confirmed Cases: Estimation and Application. Ann Intern Med 2020; 172:577–582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.D W, B H, C H, et al. Clinical Characteristics of 138 Hospitalized Patients With 2019 Novel Coronavirus-Infected Pneumonia in Wuhan, China. JAMA, 2020. Available at: https://pubmed.ncbi.nlm.nih.gov/32031570/?from_term=first+case+wuhan&from_pos=10. Accessed 5 June 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Holshue ML, DeBolt C, Lindquist S, et al. First Case of 2019 Novel Coronavirus in the United States. N Engl J Med 2020; 382:929–936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.The Covid-19 Tracker. STAT. 2020; Available at: https://www.statnews.com/feature/coronavirus/covid-19-tracker/. Accessed 5 June 2020. [Google Scholar]
- 12.Staff JSG, April 17 U, 2020, Comments127 6:26 p m Email to a Friend Share on Facebook Share on TwitterPrint this Article View. Nearly a third of 200 blood samples taken in Chelsea show exposure to coronavirus - The Boston Globe. Available at: https://www.bostonglobe.com/2020/04/17/business/nearly-third-200-blood-samples-taken-chelsea-show-exposure-coronavirus/. Accessed 9 June 2020. [Google Scholar]
- 13.Results released for antibody and COVID-19 testing of Boston residents. 2020. Available at: https://www.boston.gov/news/results-released-antibody-and-covid-19-testing-boston-residents. Accessed 9 June 2020.
- 14.Gasparini A. comorbidity: An R package for computing comorbidity scores. J Open Source Softw 2018; 3:648. [Google Scholar]
- 15.Batista GE, Monard MC. A Study of K-Nearest Neighbour as an Imputation Method. His 2002; 87:48. [Google Scholar]
- 16.Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: Machine learning in Python. J Mach Learn Res 2011; 12:2825–2830. [Google Scholar]
- 17.Liu T-Y. Learning to rank for information retrieval. Found Trends Inf Retr 2009; 3:225–331. [Google Scholar]
- 18.Vergouwe Y, Steyerberg EW, Eijkemans MJC, Habbema JDF. Validity of prognostic models: when is a model clinically useful? Semin Urol Oncol 2002; 20:96–107. [DOI] [PubMed] [Google Scholar]
- 19.Guan W-J, Ni Z-Y, Hu Y, et al. Clinical Characteristics of Coronavirus Disease 2019 in China. N Engl J Med 2020; 382:1708–1720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ruan Q, Yang K, Wang W, Jiang L, Song J. Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan, China. Intensive Care Med 2020; 46:846–848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cai Q, Chen F, Wang T, et al. Obesity and COVID-19 Severity in a Designated Hospital in Shenzhen, China. Diabetes Care 2020; [DOI] [PubMed] [Google Scholar]
- 22.Filatov A, Sharma P, Hindi F, Espinosa PS. Neurological Complications of Coronavirus Disease (COVID-19): Encephalopathy. Cureus 12 Available at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7170017/. Accessed 7 June 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mao L, Jin H, Wang M, et al. Neurologic Manifestations of Hospitalized Patients With Coronavirus Disease 2019 in Wuhan, China. JAMA Neurol 2020; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wu Y, Xu X, Chen Z, et al. Nervous system involvement after infection with COVID-19 and other coronaviruses. Brain Behav Immun 2020; Available at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146689/. Accessed 7 June 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.