Derivation and validation of a clinical severity score for acutely ill adults with suspected COVID-19: The PRIEST observational cohort study

Steve Goodacre; Ben Thomas; Laura Sutton; Matthew Burnsall; Ellen Lee; Mike Bradburn; Amanda Loban; Simon Waterhouse; Richard Simmonds; Katie Biggs; Carl Marincowitz; Jose Schutter; Sarah Connelly; Elena Sheldon; Jamie Hall; Emma Young; Andrew Bentley; Kirsty Challen; Chris Fitzsimmons; Tim Harris; Fiona Lecky; Andrew Lee; Ian Maconochie; Darren Walter

doi:10.1371/journal.pone.0245840

. 2021 Jan 22;16(1):e0245840. doi: 10.1371/journal.pone.0245840

Derivation and validation of a clinical severity score for acutely ill adults with suspected COVID-19: The PRIEST observational cohort study

Steve Goodacre ^1,^*, Ben Thomas ¹, Laura Sutton ¹, Matthew Burnsall ¹, Ellen Lee ¹, Mike Bradburn ¹, Amanda Loban ¹, Simon Waterhouse ¹, Richard Simmonds ¹, Katie Biggs ¹, Carl Marincowitz ¹, Jose Schutter ¹, Sarah Connelly ¹, Elena Sheldon ¹, Jamie Hall ¹, Emma Young ¹, Andrew Bentley ², Kirsty Challen ³, Chris Fitzsimmons ⁴, Tim Harris ⁵, Fiona Lecky ¹, Andrew Lee ¹, Ian Maconochie ⁶, Darren Walter ⁷

Editor: Itamar Ashkenazi⁸

¹School of Health and Related Research (ScHARR), University of Sheffield, Sheffield, United Kingdom

²Intensive Care, Manchester University NHS Foundation Trust, Wythenshawe Hospital, Manchester, United Kingdom

³Emergency Department, Lancashire Teaching Hospitals NHS Foundation Trust, Preston, United Kingdom

⁴Emergency Department, Sheffield Children's NHS Foundation Trust, Sheffield, United Kingdom

⁵Emergency Department, Barts Health NHS Trust, London, United Kingdom

⁶Emergency Department, Imperial College Healthcare NHS Trust, London, United Kingdom

⁷Emergency Department, Manchester University NHS Foundation Trust, Wythenshawe Hospital, Manchester, United Kingdom

⁸Technion - Israel Institute of Technology, ISRAEL

Competing Interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: grant funding to their employing institutions from the National Institute for Health Research; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

^✉

* E-mail: s.goodacre@sheffield.ac.uk

Roles

Steve Goodacre: Conceptualization, Formal analysis, Funding acquisition, Writing – original draft

Ben Thomas: Data curation, Formal analysis, Project administration, Writing – review & editing

Laura Sutton: Data curation, Formal analysis, Writing – review & editing

Matthew Burnsall: Formal analysis, Validation, Writing – review & editing

Ellen Lee: Formal analysis, Validation, Writing – review & editing

Mike Bradburn: Methodology, Writing – review & editing

Amanda Loban: Data curation, Writing – review & editing

Simon Waterhouse: Data curation, Writing – review & editing

Richard Simmonds: Data curation, Writing – review & editing

Katie Biggs: Project administration, Writing – review & editing

Carl Marincowitz: Investigation, Writing – review & editing

Jose Schutter: Data curation, Writing – review & editing

Sarah Connelly: Data curation, Writing – review & editing

Elena Sheldon: Data curation, Writing – review & editing

Jamie Hall: Data curation, Writing – review & editing

Emma Young: Data curation, Writing – review & editing

Andrew Bentley: Conceptualization, Funding acquisition, Writing – review & editing

Kirsty Challen: Conceptualization, Funding acquisition, Writing – review & editing

Chris Fitzsimmons: Conceptualization, Funding acquisition, Writing – review & editing

Tim Harris: Conceptualization, Funding acquisition, Writing – review & editing

Fiona Lecky: Conceptualization, Funding acquisition, Writing – review & editing

Andrew Lee: Conceptualization, Funding acquisition, Writing – review & editing

Ian Maconochie: Conceptualization, Funding acquisition, Writing – review & editing

Darren Walter: Conceptualization, Funding acquisition, Writing – review & editing

Itamar Ashkenazi: Editor

PMCID: PMC7822515 PMID: 33481930

Abstract

Objectives

We aimed to derive and validate a triage tool, based on clinical assessment alone, for predicting adverse outcome in acutely ill adults with suspected COVID-19 infection.

Methods

We undertook a mixed prospective and retrospective observational cohort study in 70 emergency departments across the United Kingdom (UK). We collected presenting data from 22445 people attending with suspected COVID-19 between 26 March 2020 and 28 May 2020. The primary outcome was death or organ support (respiratory, cardiovascular, or renal) by record review at 30 days. We split the cohort into derivation and validation sets, developed a clinical score based on the coefficients from multivariable analysis using the derivation set, and the estimated discriminant performance using the validation set.

Results

We analysed 11773 derivation and 9118 validation cases. Multivariable analysis identified that age, sex, respiratory rate, systolic blood pressure, oxygen saturation/inspired oxygen ratio, performance status, consciousness, history of renal impairment, and respiratory distress were retained in analyses restricted to the ten or fewer predictors. We used findings from multivariable analysis and clinical judgement to develop a score based on the NEWS2 score, age, sex, and performance status. This had a c-statistic of 0.80 (95% confidence interval 0.79–0.81) in the validation cohort and predicted adverse outcome with sensitivity 0.98 (0.97–0.98) and specificity 0.34 (0.34–0.35) for scores above four points.

Conclusion

A clinical score based on NEWS2, age, sex, and performance status predicts adverse outcome with good discrimination in adults with suspected COVID-19 and can be used to support decision-making in emergency care.

Registration

ISRCTN registry, ISRCTN28342533, http://www.isrctn.com/ISRCTN28342533

Introduction

The initial management of acutely ill people with suspected COVID-19 involves assessing the risk of adverse outcome and the need for life-saving intervention, to then determine decisions around hospital admission and inpatient referral [1–5]. Triage tools can assist decision-making by combining information from clinical assessment in a structured manner to predict the risk of adverse outcome. They can take the form of a score that increases with the predicted risk of adverse outcome or a rule that categorises patients into groups according to their risk or their intended management. Inclusion of laboratory and radiological information can improve prediction but requires hospital attendance, increases emergency department (ED) length of stay, and increases the infection risk related to repeated patient contacts. Triage tools also need to be applied prospectively to the relevant patient group, using the information available at the time of presentation. The limited availability of rapid tests with sufficient sensitivity to rule out COVID-19 at initial assessment means that the relevant population is suspected rather than confirmed COVID-19. An appropriate triage tool for COVID-19 therefore needs to be based on clinical assessment alone and applicable to people with suspected COVID-19.

We designed the Pandemic Influenza Triage in the Emergency Department (PAINTED) study following the 2009 H1N1 influenza pandemic to develop and evaluate triage tools in any future influenza pandemic [6]. We changed PAINTED to the Pandemic Respiratory Infection Emergency System Triage (PRIEST) study in January 2020 to address any pandemic respiratory infection, including COVID-19. The United Kingdom (UK) Department of Health and Social Care activated PRIEST on 20 March 2020 to develop and evaluate triage tools in the COVID-19 pandemic. Initial descriptive analysis of the PRIEST data showed that adults presenting to the ED with suspected COVID-19 have much higher rates of COVID-19 positivity, hospital admission and adverse outcome than children [7]. We therefore decided to undertake separate studies in adults and children, and only develop a new triage tools in adults, which we present here.

Evaluation of existing triage tools using the PRIEST study data suggested that CURB-65 [8], the National Early Warning Score version 2 (NEWS2) [9] and the Pandemic Modified Early Warning Score (PMEWS) [10] provide reasonable prediction for adverse outcome in suspected COVID-19 (c-statistics 0.75 to 0.77) [11]. Scope therefore existed to develop a specific triage tool for COVID-19 with better prediction for adverse outcome.

We aimed to derive and validate a triage tool in the form of an illness severity score, based on clinical assessment alone, for predicting adverse outcome in acutely ill adults with suspected COVID-19 infection.

Materials and methods

We designed PRIEST as an observational study to collect standardised predictor variables recorded in the ED, which we would then use to derive and validate new tools for predicting adverse outcome up to 30 days after initial hospital presentation. The study did not involve any change to patient care. Hospital admission and discharge decisions were made according to usual practice, informed by local and national guidance.

We identified consecutive patients presenting to the ED of participating hospitals with suspected COVID-19 infection. Patients were eligible if they met the clinical diagnostic criteria [12] of fever (≥37.8°C) and acute onset of persistent cough (with or without sputum), hoarseness, nasal discharge or congestion, shortness of breath, sore throat, wheezing, or sneezing. This was determined on the basis of the assessing clinician recording that the patient had suspected COVID-19 or completing a standardised assessment form designed for suspected pandemic respiratory infection [6]. During the study period COVID-19 testing was only recommended for those admitted to hospital, so it was recorded as a descriptive variable but not used to select patients or in the analysis.

For this study we planned to develop a triage tool in the form of an illness severity score based on clinical assessment and routine observations that any health care professional could use to rapidly estimate the risk of adverse outcome. The score would be based on a number of categorised variables, with points allocated to each category of each variable, which would then be summed to give a total score reflecting the predicted risk of adverse outcome. To enhance usability, we planned to (a) use a restricted number of variables, rather than all potentially predictive variables, and (b) categorise variables in accordance with currently used scores, unless there was clear evidence that these categories provided suboptimal prediction.

Data collection was both prospective and retrospective. Participating EDs were provided with a standardised data collection form (S1 Appendix) that included variables used in existing triage tools or considered to be potentially useful predictors of adverse outcome. Participating sites could adapt the form to their local circumstances, including integrating it into electronic or paper clinical records to facilitate prospective data collection, or using it as a template for research staff to retrospectively extract data from clinical records. We did not seek consent to collect data but information about the study was provided in the ED and patients could withdraw their data at their request. Patients with multiple presentations to hospital were only included once, using data from the first presentation identified by research staff.

Research staff at participating hospitals reviewed patient records at 30 days after initial attendance and recorded outcomes using the follow-up form in S2 Appendix. The primary outcome was death or major organ support (respiratory, cardiovascular, or renal) up to 30 days after initial attendance. Death and major organ support were also analysed separately as secondary outcomes. Our primary outcome definition reflected the need for triage tools to identify patients at risk of adverse outcome or requiring life-saving intervention to prevent adverse outcome. Respiratory support was defined as any intervention to protect the patient’s airway or assist their ventilation, including non-invasive ventilation or acute administration of continuous positive airway pressure. It did not include supplemental oxygen alone or nebulised bronchodilators. Cardiovascular support was defined as any intervention to maintain organ perfusion, such as inotropic drugs, or invasively monitor cardiovascular status, such as central venous pressure or pulmonary artery pressure monitoring, or arterial blood pressure monitoring. It did not include peripheral intravenous cannulation or fluid administration. Renal support was defined as any intervention to assist renal function, such as haemofiltration, haemodialysis, or peritoneal dialysis. It did not include intravenous fluid administration.

We randomly split the study population into derivation and validation cohorts by randomly allocating the participating sites to one or other cohort. We developed a score based on the prognostic value of predictor variables in multivariable analysis of the derivation cohort and expert judgements regarding clinical usability. Candidate predictors were combined in a multivariable regression with Least Absolute Shrinkage and Selection Operator (LASSO) using ten sample cross validation to select the model. The LASSO begins with a full model of candidate predictors and simultaneously performs predictor selection and penalisation during model development to avoid overfitting. The LASSO was performed twice: once where the number of predictors were unrestricted, and a second time when the LASSO was restricted to pick ten predictors. Fractional polynomials were used to model non-linear relationships for continuous variables.

We excluded cases from all analyses if age or outcome data were missing. We undertook three multivariable analyses, using different approaches to missing predictor variable data in the derivation cohort: (1) Complete case; (2) Multiple imputation using chained equations; (3) Deterministic imputation with missing predictor data assumed to be normal, where applicable. We did not consider any predictor with more than 50% missing data across the cohort for inclusion in the predictive model.

Clinical members of the research team reviewed the models and selected variables for inclusion in the triage tool, based on their prognostic value in the model, the clinical credibility of their association with adverse outcome, and their availability in routine clinical care. We categorised continuous variables, using recognised categories from existing scores where appropriate, while checking that categorisation reflected the relationship between the variable and adverse outcome in the derivation data. We then assigned integer values to each category of predictor variable, taking into account the points allocated to the category in existing scores, and the coefficient derived from a multivariable logistic regression model using categorised continuous predictors. This generated a composite clinical score in which risk of adverse outcome increased with the total score.

We applied the clinical score to the validation cohort, calculating diagnostic parameters at each threshold of the score, constructing a receiver-operating characteristic (ROC) curve, calculating the area under the ROC curve (c-statistic) and calculating the proportion with an adverse outcome at each level of the score. We used deterministic imputation to handle missing data in the validation cohort, assuming missing predictor variable data were normal, but excluding cases with more than a pre-specified number of predictor variables missing. We also undertook a complete case sensitivity analysis.

The sample size was dependent on the size and severity of the pandemic, but based on a previous study in the 2009 H1N1 influenza pandemic we estimated we would need to collect data from 20,000 patients across 40–50 hospitals to identify 200 (1%) with an adverse outcome, giving sufficient power for model derivation. In the event, the adverse outcome rate in adults was much higher in the COVID-19 pandemic (22%) [7], giving us adequate power to undertake derivation and validation of triage tools to predict all three outcomes.

Patient and public involvement

The Sheffield Emergency Care Forum (SECF) is a public representative group interested in emergency care research [13]. Members of SECF advised on the development of the PRIEST study and two members joined the Study Steering Committee. Patients were not involved in the recruitment to and conduct of the study. We are unable to disseminate the findings to study participants directly.

Ethical approval

The North West—Haydock Research Ethics Committee gave a favourable opinion on the PAINTED study on 25 June 2012 (reference 12/NW/0303) and on the updated PRIEST study on 23rd March 2020, including the analysis presented here. The Confidentiality Advisory Group of the Health Research Authority granted approval to collect data without patient consent in line with Section 251 of the National Health Service Act 2006.

Results

The PRIEST study recruited 22485 patients from 70 EDs across 53 sites between 26 March 2020 and 28 May 2020. We included 20889 in the analysis after excluding 39 who requested withdrawal of their data, 1530 children, 20 with missing outcome data, and seven with missing age. The derivation cohort included 11773 patients and the validation cohort 9118. Table 1 shows the characteristics of the derivation and validation cohorts. Around 31% of each cohort had COVID-19 confirmed, reflecting a combination of lack of testing in those discharged, suboptimal sensitivity of standard tests, and the difficulty of differentiating COVID-19 from similar presentations.

Table 1. Characteristics of the study population (derivation and validation cohorts).

Characteristic	Statistic/level	Derivation	Validation
Age (years)	N	11773	9118
	Mean (SD)	62.4 (19.9)	62.4 (19.5)
	Median (IQR)	64 (48,79)	64 (48,79)
Sex	Missing	137	56
	Male	5746 (49.4%)	4455 (49.2%)
	Female	5890 (50.6%)	4607 (50.8%)
Ethnicity	Missing/prefer not to say	1819	2379
	UK/Irish/other white	8376 (84.1%)	5867 (87.1%)
	Asian	699 (7%)	345 (5.1%)
	Black/African/Caribbean	368 (3.7%)	272 (4%)
	Mixed/multiple ethnic groups	178 (1.8%)	69 (1%)
	Other	333 (3.3%)	186 (2.8%)
Presenting features	Cough	7248 (61.6%)	5737 (62.9%)
	Shortness of breath	8570 (72.8%)	7000 (76.8%)
	Fever	5714 (48.5%)	4562 (50%)
Comorbidities	Hypertension	3627 (30.8%)	2807 (30.8%)
	Heart Disease	2512 (21.3%)	2188 (24%)
	Diabetes	2394 (20.3%)	1735 (19%)
	Asthma	1867 (15.9%)	1541 (16.9%)
	Other chronic lung disease	2047 (17.4%)	1717 (18.8%)
	Renal impairment	1074 (9.1%)	856 (9.4%)
	Active malignancy	577 (4.9%)	543 (6%)
	Immunosuppression	312 (2.7%)	319 (3.5%)
	Steroid therapy	303 (2.6%)	254 (2.8%)
	No chronic disease	3385 (28.8%)	2406 (26.4%)
Symptom duration (days)	N	10790	8087
	Mean (SD)	8.1 (9.1)	7.6 (8.6)
	Median (IQR)	5 (2,10)	5 (2,10)
Heart rate (beats/min)	N	11506	8954
	Mean (SD)	94.7 (21.5)	95.2 (21.7)
	Median (IQR)	93 (80,108)	94 (80,109)
Respiratory rate (breaths/min)	N	11438	8908
	Mean (SD)	23.1 (6.9)	23.4 (7.1)
	Median (IQR)	22 (18,26)	22 (18,26)
Systolic BP (mmHg)	N	11423	8875
	Mean (SD)	134.5 (24.9)	134.8 (25)
	Median (IQR)	133 (118,149)	133 (118,150)
Diastolic BP (mmHg)	N	11373	8839
	Mean (SD)	78.3 (15.8)	78.2 (16.5)
	Median (IQR)	78 (68,88)	78 (68,88)
Temperature (°C)	N	11307	8924
	Mean (SD)	37.1 (1.1)	37.2 (1.1)
	Median (IQR)	37 (36.4,37.8)	37 (36.5,37.9)
Oxygen saturation (%)	N	11658	8974
	Mean (SD)	94.9 (6.2)	94.4 (7.5)
	Median (IQR)	96 (94,98)	96 (94,98)
Air or supplementary oxygen	Missing	4113	4735
	On air	5243 (68.4%)	2544 (58%)
	On supplementary oxygen	2417 (31.6%)	1839 (42%)
Supplementary inspired oxygen (%)	N	2417	1839
	Mean (SD)	45.9 (21.9)	48.6 (22.5)
	Median (IQR)	36 (28,60)	36 (28,80)
Glasgow Coma Scale	N	8627	6801
	Mean (SD)	14.6 (1.4)	14.6 (1.4)
	Median (IQR)	15 (15,15)	15 (15,15)
Consciousness	Missing	1515	872
	Alert	9774 (95.3%)	7794 (94.5%)
	Verbal	333 (3.2%)	307 (3.7%)
	Pain	101 (1%)	82 (1%)
	Unresponsive	50 (0.5%)	63 (0.8%)
Performance status	Missing	620	458
	1. Unrestricted normal activity	5989 (53.7%)	4547 (52.5%)
	2. Limited strenuous activity, can do light activity	1315 (11.8%)	1056 (12.2%)
	3. Limited activity, can self-care	1565 (14%)	1211 (14%)
	4. Limited self-care	1494 (13.4%)	1155 (13.3%)
	5. Bed/chair bound, no self-care	790 (7.1%)	691 (8%)
Admitted at initial assessment	Missing	7	21
	No	3744 (31.8%)	3122 (34.3%)
	Yes	8022 (68.2%)	5975 (65.7%)
Location of first admission^†	Missing	173	159
	Ward	7238 (92.2%)	5409 (93%)
	ITU	479 (6.1%)	311 (5.3%)
	HDU	132 (1.7%)	96 (1.7%)
Respiratory pathogen	COVID-19	3660 (31.1%)	2861 (31.4%)
	Influenza	2 (0%)	25 (0.3%)
	Other	912 (7.7%)	809 (8.9%)
	None identified	7199 (61.1%)	5423 (59.5%)
Mortality status	Missing	0	3
	Alive	10002 (85%)	7640 (83.8%)
	Dead	1771 (15%)	1475 (16.2%)
	Death with organ support^*	326 (18.4%)	367 (24.9%)
	Death with no organ support^*	1445 (81.6%)	1108 (75.1%)
Organ support	Respiratory	939 (8%)	1005 (11%)
	Cardiovascular	316 (2.7%)	201 (2.2%)
	Renal	104 (0.9%)	114 (1.3%)
	Any	999 (8.5%)	1059 (11.6%)

Open in a new tab

* Denominator total deaths in category

† Denominator admitted patients

Table 2 shows summary statistics for each predictor variable in those with and without adverse outcome in the derivation sample, and univariate odds ratios for prediction of adverse outcome. Physiological variables were categorised to reflect their expected relationships with adverse outcome.

Table 2. Univariate analysis of predictor variables for adverse outcome (derivation cohort).

Predictor Variable	Category (categorical variables)	n (outcome)		Odds ratio	p-value	95% CI
		Adverse	Non adverse
Age (n = 11773)				1.04	0.00	(1.04, 1.04)
Sex (n = 11636)	Ref = Female	1008	4882
	Male	1413	4333	1.58	0.000	(1.44, 1.73)
Ethnicity Category (n = 9954)	Ref = UK/Irish/other white	1767	6609
	Asian	138	561	0.92	0.399	(0.76, 1.12)
	Black/African/Caribbean	72	296	0.91	0.481	(0.70, 1.18)
	Mixed/multiple ethnic groups	28	150	0.70	0.084	(0.46, 1.05)
	Other	39	294	0.50	0.000	(0.35, 0.70)
Shortness of breath (n = 11746)	Ref = No	536	2640
	Yes	1896	6674	1.40	0.000	(1.26, 1.56)
Cough (n = 11746)	Ref = No	1065	3433
	Yes	1367	5881	0.75	0.000	(0.68, 0.82)
Fever (n = 11746)	Ref = No	1274	4758
	Yes	1158	4556	0.95	0.253	(0.87, 1.04)
Hypertension (n = 11732)	Ref = No	1445	6660
	Yes	995	2632	1.74	0.000	(1.59, 1.91)
Heart Disease (n = 11732)	Ref = No	1680	7540
	Yes	760	1752	1.95	0.000	(1.76, 2.15)
Diabetes (n = 11732)	Ref = No	1733	7605
	Yes	707	1687	1.84	0.000	(1.66, 2.04)
Asthma (n = 11732)	Ref = No	2143	7722
	Yes	297	1570	0.68	0.000	(0.60, 0.78)
Other chronic lung disease (n = 11732)	Ref = No	1919	7766
	Yes	521	1526	1.38	0.000	(1.24, 1.54)
Renal impairment (n = 11732)	Ref = No	2051	8607
	Yes	389	685	2.38	0.000	(2.09, 2.72)
Active malignancy (n = 11732)	Ref = No	2248	8907
	Yes	192	385	1.98	0.000	(1.65, 2.36)
Immunosuppression (n = 11732)	Ref = No	2360	9060
	Yes	80	232	1.32	0.033	(1.02, 1.71)
Steroid therapy (n = 11732)	Ref = No	2363	9066
	Yes	77	226	1.31	0.046	(1.01, 1.70)
Symptom duration (n = 10790)				0.97	0.000	(0.96, 0.98)
Number current medications (n = 11183)				1.09	0.00	(1.08, 1.10)
Respiratory rate (n = 11773)	Ref = 12–20 or missing	644	5061
	<9	3	5	4.72	0.034	(1.12, 19.78)
	9–11	3	8	2.95	0.111	(0.78, 11.14)
	21–24	581	2191	2.08	0.000	(1.84, 2.36)
	>24	1213	2064	4.62	0.000	(4.14, 5.15)
Systolic Blood Pressure (n = 11773)	Ref = 111–219 or missing	1860	8093
	101–110	269	745	1.57	0.000	(1.35, 1.82)
	91–100	170	320	2.31	0.000	(1.91, 2.80)
	<91	137	143	4.17	0.000	(3.28, 5.30)
	>219	8	28	1.24	0.588	(0.57, 2.73)
Heart rate (n = 11773)	Ref = 51–90 or missing	1007	4353
	<41	15	42	1.54	0.152	(0.85, 2.79)
	41–50	12	42	1.24	0.521	(0.65, 2.35)
	91–110	776	3112	1.08	0.159	(0.97, 1.20)
	111–130	450	1367	1.42	0.000	(1.25, 1.62)
	>130	184	413	1.93	0.000	(1.60, 2.32)
Temperature (n = 11773)	Ref = 36.1–38.0 or missing	1498	6747
	35.1–36	245	958	1.15	0.067	(0.99, 1.34)
	38.1–39	446	1137	1.77	0.000	(1.56, 2.00)
	>39.0	166	386	1.94	0.000	(1.60, 2.34)
	<35.1	89	101	3.97	0.000	(2.97, 5.31)
GCS Total (n = 8627)	Ref = Mild (13–15)	1551	6618
	Moderate (9–12)	187	150	5.32	0.000	(4.26, 6.64)
	Severe (< = 8)	73	48	6.49	0.000	(4.49, 9.38)
AVPU (n = 10258)	Ref = Alert	1756	8018
	Verbal	176	157	5.12	0.000	(4.10, 6.39)
	Pain	62	39	7.26	0.000	(4.85, 10.87)
	Unresponsive	32	18	8.12	0.000	(4.55, 14.49)
Performance status (n = 11153)	Ref = Unrestricted normal activity	709	5280
	Limited strenuous activity, can do light activity	268	1047	1.91	0.000	(1.63, 2.23)
	Limited activity, can self care	430	1135	2.82	0.000	(2.46, 3.23)
	Limited self care	560	934	4.47	0.000	(3.92, 5.09)
	Bed/chair bound, no self care	334	456	5.45	0.000	(4.64, 6.41)
Severe respiratory distress (n = 11773)	Ref = No	2250	9184
	Yes	194	145	5.46	0.000	(4.38, 6.81)
Respiratory exhaustion (n = 11773)	Ref = No	2360	9227
	Yes	84	102	3.22	0.000	(2.40, 4.31)
Severe dehydration (n = 11773)	Ref = No	2373	9240
	Yes	71	89	3.11	0.000	(2.27, 4.26)
Previous attendance (n = 11773)	Ref = No	2160	8429
	Yes	284	900	1.23	0.004	(1.07, 1.42)
Known contact with Covid-19 case (n = 1177	Ref = No	2175	8474
	Yes	269	855	1.23	0.006	(1.06, 1.42)
Central capillary refill (n = 2935)	Ref = Normal	486	2179
	Abnormal	101	169	2.68	0.000	(2.05, 3.49)

Open in a new tab

S1–S3 Tables show the results of multivariable analysis using complete case analysis, multiple imputation and deterministic imputation. Unrestricted LASSO on multiply imputed data included more predictors, with a higher c-statistic for the model (0.85, 95% CI 0.84 to 0.86), than the LASSO on deterministically imputed data or complete cases (c-statistics both 0.83, 95% CI 0.82 to 0.84). When restricted, there were nine predictors that were retained by LASSO in all three analyses (age, sex, respiratory rate, systolic BP, oxygen saturation/inspired oxygen ratio, history of renal impairment, performance status, consciousness and respiratory distress). C-statistics for the restricted models using deterministic imputation and complete case analysis (0.82, 95% CI 0.81 to 0.83) were slightly lower than c-statistics for the respective unrestricted models.

We developed a score through the following steps:

Clinical review judged that the nine predictors are clinically credible; that age, sex, respiratory rate, systolic BP, consciousness, oxygen saturation and inspired oxygen are routinely recorded in administrative systems and early warning scores (although the ratio of oxygen saturation to inspired oxygen is not routinely recorded); and that many EDs routinely record a measure of performance status for suspected COVID-19 cases that could be mapped onto our scale.
We decided to include temperature and heart rate, as these are routinely recorded alongside other physiological variables in early warning scores, and added prognostic value in the full models.
We created categories for age based on the observed multivariate association between age and outcome in our data, and categories for respiratory rate, heart rate, oxygen saturation, inspired oxygen, systolic BP, consciousness and temperature based on those used in the NEWS2 early warning score. NEWS2 is outlined in S3 Appendix.
We created a multivariable logistic regression model using categorised predictor variables (S4 Table) and compared the coefficients for each category of predictor variable in the NEWS2 score to the points allocated in the NEWS2 score. We judged that the inconsistencies between the coefficients and the points used in NEWS2 were insufficient to justify allocating alternative points in our score. We allocated points to categories of age, sex, performance status, renal history, and respiratory distress, based on the coefficients in the model.
We removed renal history and respiratory distress from the multivariable model (S5 Table), noted that this made no meaningful difference to the c-statistic (0.82 in both models) and, given concerns about subjectivity and lack of routine recording, decided not to include them in the score.

Fig 1 provides a summary of the derivation process. The developed score is shown in Fig 2. We applied the score to the validation cohort. Fig 3 shows the ROC curve, with a c-statistic of 0.80 (95% CI 0.79 to 0.81) for the score. Sensitivity analysis using only complete cases gave a c-statistic of 0.79 (95% CI 0.77 to 0.80). S1 and S2 Figs show the calibration plots for the unrestricted and restricted LASSO models applied to the validation cohort. The c-statistics (0.82 and 0.81 respectively, compared with 0.80 for the score) indicate the effect of restricting the number of variables and then developing a score had upon discrimination. Fig 4 shows the probability of adverse outcome for each value of the score. Table 3 shows the sensitivity and specificity for predicting outcome at each threshold of the triage tool.

Table 3. Sensitivity, specificity, PPV, NPV and proportion with a positive score at each score threshold for predicting the primary outcome of death or organ support, validation cohort.

Score threshold	Proportion with positive score	Sensitivity (95% CI)	Specificity (95% CI)	Positive predictive value (95% CI)	Negative predictive value (95% CI)
>0	0.97	1.00 (1.00, 1.00)	0.04 (0.03,0.04)	0.25 (0.24, 0.25)	0.99 (0.98, 1.00)
>1	0.92	1.00 (1.00, 1.00)	0.10 (0.10, 0.10)	0.26 (0.25, 0.26)	0.99 (0.99, 0.99)
>2	0.87	0.99 (0.99, 0.99)	0.17 (0.17, 0.18)	0.27 (0.27, 0.28)	0.99 (0.98, 0.99)
>3	0.80	0.99 (0.98, 0.99)	0.26 (0.26, 0.27)	0.30 (0.29, 0.30)	0.98 (0.98, 0.99)
>4	0.73	0.98 (0.97, 0.98)	0.34 (0.34, 0.35)	0.32 (0.31, 0.32)	0.98 (0.98, 0.98)
>5	0.66	0.95 (0.95, 0.95)	0.43 (0.43, 0.43)	0.34 (0.34, 0.35)	0.96 (0.96, 0.97)
>6	0.59	0.91 (0.91, 0.91)	0.50 (0.50, 0.51)	0.37 (0.36, 0.37)	0.95 (0.94, 0.95)
>7	0.53	0.86 (0.85, 0.86)	0.58 (0.57, 0.58)	0.39 (0.39, 0.40)	0.93 (0.93, 0.93)
>8	0.46	0.80 (0.79, 0.80)	0.65 (0.64, 0.65)	0.42 (0.41, 0.42)	0.91 (0.91, 0.91)
>9	0.40	0.73 (0.72, 0.74)	0.71 (0.71, 0.71)	0.44 (0.44, 0.45)	0.89 (0.89, 0.90)
>10	0.33	0.65 (0.64, 0.66)	0.77 (0.77, 0.77)	0.47 (0.46, 0.48)	0.87 (0.87, 0.88)
>11	0.27	0.57 (0.56, 0.58)	0.82 (0.82, 0.82)	0.50 (0.49, 0.50)	0.86 (0.86, 0.86)
>12	0.21	0.47 (0.46, 0.48)	0.87 (0.87, 0.87)	0.53 (0.53, 0.54)	0.84 (0.84, 0.84)
>13	0.16	0.37 (0.37, 0.38)	0.91 (0.91, 0.91)	0.56 (0.55, 0.57)	0.82 (0.82, 0.82)
>14	0.12	0.29 (0.28, 0.30)	0.94 (0.93, 0.94)	0.59 (0.58, 0.60)	0.81 (0.80, 0.81)
>15	0.09	0.23 (0.22, 0.23)	0.96 (0.95, 0.96)	0.62 (0.61, 0.64)	0.80 (0.79, 0.80)
>16	0.06	0.17 (0.17, 0.18)	0.97 (0.97, 0.97)	0.65 (0.64, 0.67)	0.79 (0.79, 0.79)
>17	0.04	0.12 (0.11, 0.12)	0.98 (0.98, 0.98)	0.68 (0.66, 0.70)	0.78 (0.78, 0.78)
>18	0.03	0.08 (0.07, 0.08)	0.99 (0.99, 0.99)	0.67 (0.65, 0.69)	0.77 (0.77, 0.78)
>19	0.02	0.05 (0.04, 0.05)	0.99 (0.99, 1.00)	0.73 (0.70, 0.76)	0.77 (0.77, 0.77)
>20	0.01	0.03 (0.03, 0.03)	1.00 (1.00, 1.00)	0.76 (0.72, 0.79)	0.77 (0.76, 0.77)
>21	0.01	0.02 (0.02, 0.02)	1.00 (1.00, 1.00)	0.81 (0.76, 0.85)	0.76 (0.76, 0.77)
>22	0.00	0.01 (0.01, 0.01)	1.00 (1.00, 1.00)	0.84 (0.76, 0.90)	0.76 (0.76, 0.77)
>23	0.00	0.01 (0.00, 0.01)	1.00 (1.00, 1.00)	0.87 (0.76, 0.94)	0.76 (0.76, 0.76)
>24	0.00	0.00 (0.00, 0.00)	1.00 (1.00, 1.00)	0.86 (0.66, 0.96)	0.76 (0.76, 0.76)

Open in a new tab

S3 and S4 Figs show the ROC curves, and S6 and S7 Tables show the predictive performance of the score when applied to the secondary outcomes of organ support and death without organ support in the validation cohort. The score provided better prognostic discrimination for death without organ support (c-statistic 0.83, 95% CI 0.82 to 0.84) than for organ support (0.68, 95% CI 0.67 to 0.69).

Discussion

We have developed a clinical illness severity score for acutely ill patients presenting to the ED with suspected COVID-19 that combines the NEWS2 score, age, sex, and performance status to predict the risk of death or receipt of organ support in the following 30 days. The score ranges from zero to 29 points, with a score greater than four predicting adverse outcome with high sensitivity and low specificity. In developing the score, we tried to optimise usability without compromising performance. Usability was optimised by basing the score on the existing NEWS2 score and only adding easily available information. The c-statistic of the score on the validation cohort was 0.80, compared with 0.82 and 0.81 when the unrestricted and restricted models were applied to the validation cohort, suggesting that simplifying the tool did not excessively compromise prediction.

Our score has a number of features that sets it apart from other scores. Derivation and validation were rigorously undertaken, following an independently peer-reviewed protocol set up in advance of the pandemic, using data from a very large and representative cohort presenting to EDs across the UK, and analysed using a pre-specified statistical analysis plan. Our choice of adverse outcome ensured that the score predicts need for life-saving intervention, not just mortality. Our patient selection criteria ensure that the score is applicable to the clinically relevant population of suspected COVID-19 rather than a confirmed cohort, which would typically be assembled retrospectively and exclude those with diagnostic uncertainty at presentation. We also included patients who were discharged after ED assessment, which is essential if the score is to be used to support decision-making around admission or discharge.

Our score improves upon those recommended in existing guidelines for the initial assessment of suspected acute COVID-19, with a c-statistic of 0.8 compared to 0.75 for CURB-65 and 0.77 for NEWS2 and PMEWS [11]. The practical implications of improved prediction can be appreciated by considering how the addition of age, sex and performance status to NEWS2 might improve decision-making around admission. NEWS2 would suggest that a young person with unlimited performance status and an elderly person with limited performance but the same NEWS2 score should have the same admission decision, whereas our score recognises that safe discharge is much more likely to be achieved in the younger patient. Our score shares similarities with PMEWS, which was developed for the H1N1 influenza pandemic, but achieves better prediction by using more detailed age, sex and performance status data.

Since the start of the pandemic numerous studies have sought to develop and evaluate prediction scores for COVID-19. A living systematic review [14] has identified 50 prognostic models for adverse outcome in people with diagnosed COVID-19. C-statistics ranged from 0.68 to 0.99, and the most frequently used predictor variables were age, sex, comorbidities, temperature, lymphocyte count, C reactive protein, creatinine, and imaging features. Recently the ISARIC WHO Clinical Characterisation Protocol developed and validated the 4C Mortality Score [15] that predicts the mortality risk for people admitted with COVID-19 with better discriminant performance than 15 pre-existing risk stratification scores (c-statistic 0.77 versus 0.61–0.76).

These scores have important limitations as triage tools, which we have attempted to address in developing our triage tool. Many were developed to predict mortality, whereas triage tools need to predict need for life-saving treatment. Most were developed on admitted populations, whereas the relevant population for an initial assessment tool needs to include those discharged after assessment. This is because the decision to be admit is likely to be based upon the same predictor variables that are used in the tool, so excluding discharged patients will underestimate the predictive value of these variables. For example, oxygen saturation is an important predictor of adverse outcome and is also an important criterion for determining hospital admission. Developing a triage tool on a population selected on the basis of oxygen saturation will underestimate the value of oxygen saturation as a predictor. This may explain why many scores developed on admitted patients do not include well-recognised clinical predictors. Finally, inclusion of laboratory data as predictor variables prolongs ED stay and prevents the triage tool being used for rapid assessment.

Rapid clinical scores have been proposed or evaluated in several studies. Liao et al [16] proposed adding age>65 years to the NEWS2 score to aid decision-making, based on early experience of the pandemic in China. Myrstad et al [17] reported a c-statistic of 0.822 (95% CI 0.690 to 0.953) for NEWS2 predicting death or severe disease in a small study (N = 66) of people hospitalised with confirmed COVID-19. Hu et al [18] reported c-statistics of 0.833 (0.737 to 0.928) for the Rapid Emergency Medicine Score (REMS) and 0.677 (0.541 to 0.813) for the Modified Emergency Medicine Score (MEWS) for predicting mortality in critically ill patients with COVID-19. Haimovich et al [19] developed the quick COVID-19 severity index, consisting of respiratory rate, oxygen saturation, and oxygen flow rate, which predicted respiratory failure within 24 hours in adults admitted with COVID-19 requiring supplemental oxygen with a c-statistic of 0.81 (0.73 to 0.89). These studies are limited by small numbers (producing imprecise estimates of accuracy), single-centre design (limiting generalisability) and only including admitted patients.

An important limitation of our study is that retrospective data collection resulted in some missing and may have resulted in some inaccuracy of predictor variable recording. Recording of inspired oxygen concentration was subject to a particularly high rate of missing data. We anticipated this problem and pre-specified analyses involving multiple imputation, deterministic imputation, and complete case analysis to explore the impact of missing data. There was reasonable concordance between the models. Another potential limitation is that our definition of adverse outcome did not include events occurring after 30 days or requirements for hospital admission (such as oxygen therapy or intravenous fluids) that fell short of our definition of organ support. We may also have missed adverse outcomes if patients attended a different hospital after initial hospital discharge. This is arguably less likely in the context of a pandemic, in which movements between regions were curtailed, but cannot be discounted. The 5-point scale we used for determining performance status has not been widely used or evaluated, although the 9-point clinical frailty index maps onto it reasonably well. Finally, although our triage tool can be used in the prehospital or community setting, we recommend caution in extrapolating our findings to settings where there is likely to be a lower prevalence of adverse outcome.

Our clinical score could be used to support ED decision-making around hospital admission and inpatient referral. Scores of four or less could identify a proportion of patients at low risk of adverse outcome who would be suitable for discharge home, while a higher threshold could be used to select patients for critical care. However, triage tools should only support and not replace clinical decision-making. The clinical context, patient preferences, and available resources must be considered. This may be illustrated by older patients (especially male) with limited performance status who score greater than four with little or no physiological abnormalities. These patients would not necessarily be at high risk of adverse outcome if they were managing their symptoms at home but the clinical context is presentation to a hospital ED. Our data show that if these patients needed ED assessment then they were at significant risk of adverse outcome even if there was little physiological abnormality. In terms of decision-making, patient preference should be taken into account, since these patients may accept discharge with a significant risk of adverse outcome if hospital admission provides no clear benefit.

Our triage tool could also be used to support prehospital and community decision-making around decisions to refer for hospital assessment. However, the importance of developing scores in an appropriate population needs to be considered. A score developed on an ED population may be inappropriate for supporting decisions to transport to the ED in the same way as scores developed on the inpatient population may be inappropriate for supporting admission decisions in an ED population. Further validation is required to determine the performance of the tool in these settings. Further ED validation in subsequent waves of the pandemic or other ED settings would also be helpful to determine whether changes or differences in the pandemic population or outcomes lead to changes in outcome prediction.

In summary, we have developed a clinical score that can provide a rapid and accurate assessment of the risk of adverse outcome in adults who are acutely ill with suspected COVID-19.

Supporting information

S1 Fig. Calibration plot for unrestricted LASSO model performance, validation cohort.

(TIF)

Click here for additional data file.^{(526.8KB, tif)}

S2 Fig. Calibration plot for restricted LASSO model performance, validation cohort.

(TIF)

Click here for additional data file.^{(528.7KB, tif)}

S3 Fig. ROC curve for the tool predicting the secondary outcome of organ support, validation cohort.

(TIF)

Click here for additional data file.^{(77.9KB, tif)}

S4 Fig. ROC curve for tool predicting the secondary outcome of death without organ support, validation cohort.

(TIF)

Click here for additional data file.^{(77.7KB, tif)}

S1 Table. Multivariable analysis, complete case (N = 5988).

(DOCX)

Click here for additional data file.^{(28.2KB, docx)}

S2 Table. Multivariable analysis, using multiple imputation (50 imputations; N = 11636).

(DOCX)

Click here for additional data file.^{(31.9KB, docx)}

S3 Table. Multivariable analysis, using deterministic imputation (N = 9891).

(DOCX)

Click here for additional data file.^{(28.5KB, docx)}

S4 Table. Logistic regression model based on selected categorised predictor variables.

(DOCX)

Click here for additional data file.^{(31.3KB, docx)}

S5 Table. Logistic regression model based on selected categorised predictor variables, excluding respiratory distress and history of renal impairment.

(DOCX)

Click here for additional data file.^{(29.9KB, docx)}

S6 Table. Sensitivity, specificity, PPV, NPV at each score threshold for predicting the secondary outcome of organ support, validation cohort.

(DOCX)

Click here for additional data file.^{(25.1KB, docx)}

S7 Table. Sensitivity, specificity, PPV, NPV at each score threshold for predicting the secondary outcome of death without organ support, validation cohort.

(DOCX)

Click here for additional data file.^{(24.4KB, docx)}

S1 Appendix. Standardised data collection form.

(PDF)

Click here for additional data file.^{(130.1KB, pdf)}

S2 Appendix. Follow-up form.

(PDF)

Click here for additional data file.^{(150.8KB, pdf)}

S3 Appendix. The NEWS2 score.

(DOCX)

Click here for additional data file.^{(23.2KB, docx)}

S4 Appendix. Study steering committee.

(DOCX)

Click here for additional data file.^{(15.5KB, docx)}

S5 Appendix. Site research staff.

(DOCX)

Click here for additional data file.^{(15.9KB, docx)}

S6 Appendix. Supporting research staff.

(DOCX)

Click here for additional data file.^{(12.3KB, docx)}

Acknowledgments

We thank Katie Ridsdale for clerical assistance with the study, Erica Wallis (Sponsor representative, all members of the Study Steering Committee (S4 Appendix) and the site research teams who delivered the data for the study (S5 Appendix), and the research team at the University of Sheffield past and present (S6 Appendix).

Data Availability

Data are available in the in the ORDA data repository: http://doi.org/10.15131/shef.data.13194845.

Funding Statement

The PRIEST study was funded by the United Kingdom National Institute for Health Research Health Technology Assessment (HTA) programme (project reference 11/46/07). The funder played no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; and in the decision to submit the article for publication. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care.

References

1.World Health Organisation. Clinical care of severe acute respiratory infections–Tool kit. https://www.who.int/publications-detail/clinical-care-of-severe-acute-respiratory-infections-tool-kit (accessed 28/04/2020)
2.International Federation for Emergency Medicine. Global Response to COVID-19 for Emergency Healthcare Systems and Providers: From the IFEM Task Force on ED Crowding and Access Block. https://www.ifem.cc/coronavirus-2019-information/ (accessed 15/06/2020)
3.NHS. Clinical guide for the management of emergency department patients during the coronavirus pandemic. 17 March 2020 Version 1 https://www.england.nhs.uk/coronavirus/secondary-care/other-resources/specialty-guides/#ae (accessed 15/06/2020)
4.National Institute for Health and Care Excellence. COVID-19 rapid guideline: managing suspected or confirmed pneumonia in adults in the community. Published: 3 April 2020. www.nice.org.uk/guidance/ng165 (accessed 28/04/2020) [PubMed]
5.American College of Emergency Physicians. Guide to Coronavirus Disease (COVID-19) https://www.acep.org/corona/covid-19-field-guide/cover-page/
6.Goodacre S, Irving A, Wilson R, Beever D, Challen K. The PAndemic INfluenza Triage in the Emergency Department (PAINTED) pilot cohort study. Health Technol Assess 2015;19(3):1–70. 10.3310/hta19030 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Goodacre S, Thomas B, Lee E, Sutton L, Loban A, Waterhouse S, et al. (2020) Characterisation of 22445 patients attending UK emergency departments with suspected COVID-19 infection: Observational cohort study. PLoS ONE 15(11): e0240206 10.1371/journal.pone.0240206 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Lim W, van der Eerden MM, Laing R, Boersma W, Karalus N, Town G, et al. Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study. Thorax 2003; 58(5): 377–382. 10.1136/thorax.58.5.377 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Royal College of Physicians. (2017). National Early Warning Score (NEWS) 2: Standardising the assessment of acute-illness severity in the NHS. Updated report of a working party. London: RCP. [Google Scholar]
10.Challen K, Bright J, Bentley A, Walter D. Physiological-social score (PMEWS) vs. CURB-65 to triage pandemic influenza: a comparative validation study using community-acquired pneumonia as a proxy. BMC Health Serv Res. 2007; 7: 33 10.1186/1472-6963-7-33 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Thomas B, Biggs K, Goodacre S, et al. Prognostic accuracy of emergency department triage tools for adults with suspected COVID-19: The PRIEST observational cohort study. [Prerpint] medRxiv 2020. 09.01.20185793; https://www.medrxiv.org/content/10.1101/2020.09.02.20185892v1 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Public Health England. COVID-19: investigation and initial clinical management of possible cases. https://www.gov.uk/government/publications/wuhan-novel-coronavirus-initial-investigation-of-possible-cases/investigation-and-initial-clinical-management-of-possible-cases-of-wuhan-novel-coronavirus-wn-cov-infection#criteria (accessed 27/04/2020)
13.Hirst E, Irving A, Goodacre S. Patient and public involvement in emergency care research. Emerg Med J 2016;33:665–670. 10.1136/emermed-2016-205700 [DOI] [PubMed] [Google Scholar]
14.Wynants Laure, Van Calster Ben, Collins Gary S, Riley Richard D, Heinze Georg, Schuit Ewoud et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal BMJ 2020; 369: m1328 10.1136/bmj.m1328 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Knight Stephen R, Ho Antonia, Pius Riinu, Buchan Iain, Carson Gail, Drake Thomas M et al. Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: development and validation of the 4C Mortality Score BMJ 2020; 370: m3339 10.1136/bmj.m3339 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Liao X, Wang B, Kang Y. Novel coronavirus infection during the 2019–2020 epidemic: preparing intensive care units—the experience in Sichuan Province, China. Intensive Care Med 46, 357–360 (2020). 10.1007/s00134-020-05954-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Myrstad M, Ihle-Hansen H, Tveita AA, Andersen EL, Nygård S, Tveit A, et al. National Early Warning Score 2 (NEWS2) on admission predicts severe disease and inhospital mortality from Covid-19 –a prospective cohort study. Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine 2020; 28:66 10.1186/s13049-020-00764-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Hu H, Yao N, Qiu Y. Comparing Rapid Scoring Systems in Mortality Prediction of Critically Ill Patients With Novel Coronavirus Disease. Acad Emerg Med. 2020;27(6):461–468. 10.1111/acem.13992 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Haimovich A, Ravindra NG, Stoytchev S, Young HP, PerryWilson F, van Dijk D, et al. , Development and validation of the quick COVID-19 severity index (qCSI): a prognostic tool for early clinical decompensation Annals of Emergency Medicine (2020), 10.1016/j.annemergmed.2020.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0245840.r001

Decision Letter 0

Itamar Ashkenazi

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

11 Dec 2020

PONE-D-20-34978

Derivation and validation of a clinical severity score for acutely ill adults with suspected COVID-19: The PRIEST observational cohort study

PLOS ONE

Dear Dr. Goodacre,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

I do wish to emphasize one point, the authors need to emphasize their uniqueness among various other scores recently published.

Please submit your revised manuscript by Jan 25 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Itamar Ashkenazi

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.Please amend the manuscript submission data (via Edit Submission) to include author Ben Thomas, Laura Sutton, Matthew Burnsall, Ellen Lee, Mike Bradburn, Amanda Loban, Simon Waterhouse, Richard Simmonds, Katie Biggs, Carl Marincowitz, Jose Schutter, Sarah Connelly, Elena Sheldon, Jamie Hall, Emma Young, Andrew Bentley, Kirsty Challen, Chris Fitzsimmons, Tim Harris, Fiona Lecky, Andrew Lee, Ian Maconochie, Darren

Walter.

3.Thank you for stating the following in the Competing Interests section:

[All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: grant funding to their employing institutions from the National Institute for Health Research; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.].

Please confirm that this does not alter your adherence to all PLOS ONE policies on sharing data and materials, by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests). If there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include your updated Competing Interests statement in your cover letter; we will change the online submission form on your behalf.

Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests

4.Thank you for submitting the above manuscript to PLOS ONE. During our internal evaluation of the manuscript, we found significant text overlap between your submission and the following previously published works:

- https://www.medrxiv.org/content/10.1101/2020.09.02.20185892v1

- http://eprints.whiterose.ac.uk/165084/1/2020.08.10.20171496v1.full.pdf

We would like to make you aware that copying extracts from previous publications, especially outside the methods section, word-for-word is unacceptable. In addition, the reproduction of text from published reports has implications for the copyright that may apply to the publications.

Please revise the manuscript and tables to rephrase or remove the duplicated text, cite your sources, and provide details as to how the current manuscript advances on previous work. Please note that further consideration is dependent on the submission of a manuscript that addresses these concerns about the overlap in text with published work.

We will carefully review your manuscript upon resubmission, so please ensure that your revision is thorough.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have derived a severity of illness / probability of adverse outcome score for patients with suspected COVID-19, who presented to hospital. Their data was derived from 70 centres in the UK and included roughly 22,500 individual patient initial assessments. The primary outcome was death or ongoing organ support at 30 days. 15% of the patients had this outcome. Their score combines NEWS2, age , sex and performance status.

General comments

1. The patient data was derived during the first wave of COVID-19 in the UK. The second wave in the UK, and elsewhere, appears to have some different characteristics. Could the authors comment on this and / or speculate on the need to repeat their data collection in other countries / during the current (second) wave, to establish whether it c statistic is affected?

2. Might the authors data have a significant selection bias in that the location of data collection was emergency departments? Patients will have presented to primary care, Acute Medical Units etc etc hence how certain can we be that this result is generalisable?

3. Do the authors have any data to suggest what the short-term outcome of their patients was - e.g. how many were receiving level 1, level 2 and level 3 care at 72 hours post presentation? If not, might this secondary outcome be useful in future studies?

4. Could the authors please include the data for patient sex in Table 2?

5. Could the authors comment on the fact that ~67% of patients were admitted but only 31% were SARS-CoV-2 positive?

6. Could the authors comment further on the very high proportion of missing data related to supplemental oxygen, and the much larger proportion in the validation cohort?

7. Could the authors please provide more detail in figure 3 , in particular, the number of patients with each score and the probabilities for each score 17-29?

8. Given that the authors cut off for risk of a "bad outcome" is stated as a score >4 / probability of >9%, which can be achieved by being a 50 year old male with a temperature of 38.1 and a heart rate of 91 seems poorly calibrated to real life?

9. As currently written, I am unconvinced that the authors addition of age, sex and performance status adds anything useful to NEWS2 as a triage tool, especially in the absence of knowing whether the patient is positive for SARS-CoV-2. Perhaps they could clarify their justification for ading these variables to their real-world, pre-diagnosis target population?

Reviewer #2: In this study, the authors aimed to develop and validate a triage tool, based on clinical assessment alone, for predicting death or organ support at Day-30 in acutely ill adults with suspected COVID-19 infection. In the first part of the study (derivation cohort of 11773 patients), using multivariable analysis, the authors identified a restricted number of variables with the best prognostic value, clinical relevance and availability and then assigned integer values to each, resulting in a composite score in which the higher the value, the poorer the prognosis. In the second part of the study (validation cohort of 9118 patients), the authors tested their score and its ability to predict adverse outcomes. They found that a score including the NEWS2 score, age, sex and performance status could predict Day-30 adverse outcomes with high sensitivity but low specificity. The study is well-written and easy to read. Methods are well described and explained and the statistical analysis is appropriate. Results are interesting and the score could be useful in clinical practice in the event of a third pandemic wave. The two main strengths of the study are the large sample size and the appropriate method for building the score. One of the main limitations is the added-value of this new score, compared to existing triage scores for patients admitted to emergency department. I have some additional concerns that need to be discussed.

1. Why did you consider patients with suspected and not confirmed COVID-19? In fact, this score could be used for all patients admitted to the emergency department, regardless of their COVID-19 status and is therefore not very different from existing triage scores. It would have been more interesting to develop a specific score for COVID-19 patients. Please clarify this point. In this regard, it would be interesting to repeat the analysis only in patients with confirmed COVID-19 if you are convinced of the need to develop specific predictive scores for COVID-19 patients.

2. What was the proportion of patients in whom COVID-19 was confirmed? This is a very important point before discussing your results, since the rationale for your study is based on the need to develop scores for COVID-19. Your results cannot be considered in the same way according to the proportion of confirmed COVID-19 patients.

3. You state in the opening of the discussion that you included the NEWS2 score in your own score in addition to age, sex and performance status. However, you never described the NEWS2 score in the manuscript and in the Figure 1 (which is a Table), which is very confusing for readers. From my point of view, it would be clearer to simply indicate all the variable you included in your score rather than talking about the News2 score. Please clearly explain what the NEWS2 score is and clarify this point.

4. In figure 2, since your score ranged from 0 to 29, why did you censor the data after a score of 16 by merging all scores > 17? Please also indicate the probability of adverse outcomes for each score > 17.

5. It is of importance to provide in the manuscript a table and/or a figure summarizing the key findings of the first part of the study (derivation cohort).

6. Please further discuss your score in the light of the existing literature, not only by considering its ability to predict adverse outcomes (c-statistic) but also by considering its potential added-value (relevance of variables, ease of use…). In addition, the ability of your score to predict adverse outcomes should be also further discussed especially the sensitivity and specificity you found.

7. It would have been very interesting to compare your score to the results of the PAINTED study and to the scores developed for influenza pandemic. Please discuss this specific point.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Jonathan Ball

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Jan 22;16(1):e0245840. doi: 10.1371/journal.pone.0245840.r002

Author response to Decision Letter 0

22 Dec 2020

Thank you for considering our paper and providing the reviewer’s thoughtful comments. We have addressed them in a revised version of our paper. Our responses to each specific comment are as follows:

Editor:

I do wish to emphasize one point, the authors need to emphasize their uniqueness among various other scores recently published.

RESPONSE: We have extensively rewritten the discussion to address this point. The second paragraph has been added to describe the methodological features, including the study population selection criteria and choice of outcome, that ensure we have developed an optimal score. The third paragraph describes how our score improves upon previously recommended scores, specifically NEWS2 and PMEWS. We have rewritten paragraph five, critiquing other prediction scores, and we have added to paragraph six on rapid clinical scores to highlight their limitations compared to ours.

Reviewer #1:

RESPONSE: We have added a sentence to the last paragraph of the discussion suggesting this as a future research priority.

RESPONSE: We agree that our data are most applicable to patients presenting to the ED and less applicable to patients presenting elsewhere. We have added consideration of this to the last paragraph of the discussion, along with a research recommendation.

RESPONSE: We have added initial location of admission (ICU, HDU or ward) to Table 1.

4. Could the authors please include the data for patient sex in Table 2?

RESPONSE: We have added patient sex to Table 2.

5. Could the authors comment on the fact that ~67% of patients were admitted but only 31% were SARS-CoV-2 positive?

RESPONSE: We have added a sentence to the methods clarify that only admitted patients were tested at the participating sites during the first wave and noting in the results that the 31% positivity rate reflects a combination of lack of testing in those discharged, suboptimal sensitivity of standard tests, and the difficulty of differentiating COVID-19 from similar presentations. We feel this represents an expected rate for a clinically relevant cohort with suspected rather than confirmed COVID-19 (see response to reviewer #2, Q1).

6. Could the authors comment further on the very high proportion of missing data related to supplemental oxygen, and the much larger proportion in the validation cohort?

RESPONSE: We recognised that missing data could be a problem, given that oxygen supplementation is often poorly recorded in clinical notes, and planned multiple analyses using different approaches to handling missing data. We identify this as a limitation in the discussion – “Recording of inspired oxygen concentration was subject to a particularly high rate of missing data. We anticipated this problem and pre-specified analyses involving multiple imputation, deterministic imputation, and complete case analysis to explore the impact of missing data. There was reasonable concordance between the models”. We randomly allocated sites to the derivation and validation cohorts, so the higher rate of missing data in the validation cohort reflects random allocation of sites with higher missing data to the validation cohort.

7. Could the authors please provide more detail in figure 3, in particular, the number of patients with each score and the probabilities for each score 17-29?

RESPONSE: We have added the number of patients with each score and the probabilities for each score 17-29.

RESPONSE: We have added to paragraph eight of the discussion to address this important point. The key issue is that the score was developed upon, and therefore applies to, people who have attended the ED with suspected COVID-19. A patient with the characteristics described who did not need to attend the ED would probably not have a high risk of adverse outcome. However, our findings show that in patients attending the ED with a high baseline risks even people with a little or no physiological abnormality have a significant risk of adverse outcome. This does not necessarily mean they have to be admitted, but they should not be discharged with false reassurance that they are at low risk.

9. As currently written, I am unconvinced that the authors addition of age, sex and performance status adds anything useful to NEWS2 as a triage tool, especially in the absence of knowing whether the patient is positive for SARS-CoV-2. Perhaps they could clarify their justification for adding these variables to their real-world, pre-diagnosis target population?

RESPONSE: We have added the third paragraph of the discussion to address this point. Adding these variables improves the prediction, as measured by the c-statistic. The practical impact of this is to ensure that pre-morbid risk factors are taken into account when using NEWS2. Thus, a young person with unrestricted performance status is identified as being at low risk of adverse outcome, even if their NEWS2 score is 3-4, whereas an older person is identified as being at higher risk of adverse outcome, even if their NEWS2 score is 0-2. Decision-making on the basis of NEWS2 alone is likely to result in over-triage of younger, healthy people to unnecessary admission, and under-triage of older people with restricted performance status to inappropriate discharge.

Reviewer #2:

The study is well-written and easy to read. Methods are well described and explained and the statistical analysis is appropriate. Results are interesting and the score could be useful in clinical practice in the event of a third pandemic wave. The two main strengths of the study are the large sample size and the appropriate method for building the score. One of the main limitations is the added-value of this new score, compared to existing triage scores for patients admitted to emergency department. I have some additional concerns that need to be discussed.

RESPONSE: We have extensively rewritten the discussion, with the addition of paragraphs two and three, to describe the added value of the new score.

RESPONSE: We have added to the first paragraph of the introduction to explain why we selected those with suspected COVID-19 and added to the second paragraph of the discussion why we feel this is a strength of our study. ED triage tools need to be used prospectively, when the diagnosis of COVID-19 is still only suspected in most cases, rather than retrospectively, when it has been confirmed. Rapid tests may be used to confirm COVID-19 in the ED, but limited sensitivity means COVID-19 cannot be ruled out in cases with a strong clinical suspicion, so COVID-19 still needs to be suspected in the absence of positive testing. Furthermore, despite much talk of a roll-out of rapid tests, we are now well into the second wave of the pandemic and use of rapid tests remains limited.

It would be interesting to repeat the analysis only in those with confirmed COVID-19 but the lack of testing on those discharged would mean that this effectively limits the cohort to those admitted, thus incurring the issues associated with limiting the cohort to patients whose selection for admission is likely to be based on key predictor variables.

RESPONSE: Table 1 shows that COVID-19 was confirmed in 31.1% of the derivation cohort and 31.4% of the validation cohort. We have added a sentence to the methods to clarify that only admitted patients were tested and noting in the results that the 31% positivity rate reflects a combination of lack of testing in those discharged, suboptimal sensitivity of standard tests, and the difficulty of differentiating COVID-19 from similar presentations. We feel this is an expected rate for a clinically-relevant cohort with suspected rather than confirmed COVID-19.

RESPONSE: We have added Appendix 3 to describe the NEWS2 score.

RESPONSE: We collapsed scores above 17 into one column because the numbers in each strata above 17 were relatively small, and thus subject to greater random variation. We have amended the figure to include separate scores above 17.

5. It is of importance to provide in the manuscript a table and/or a figure summarizing the key findings of the first part of the study (derivation cohort).

RESPONSE: We have added a new figure (Figure 1) summarising the derivation part of the study and have renumbered subsequent figures.

RESPONSE: We have added paragraph three of the discussion and extended paragraph five to discuss our score in the light of existing literature and consider its potential added value, and have extended paragraph eight to discuss how it could be used to predict adverse outcome.

7. It would have been very interesting to compare your score to the results of the PAINTED study and to the scores developed for influenza pandemic. Please discuss this specific point.

RESPONSE: We have added paragraph three of the discussion to address this point. We used data from our previous analysis of existing scores in the PRIEST data to compare our score to those developed for the influenza pandemic, since this provides much greater precision than the PAINTED data and a direct comparison using the same cohort.

Attachment

Submitted filename: Response to review 1.docx

Click here for additional data file.^{(18.4KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0245840.r003

Decision Letter 1

Itamar Ashkenazi

4 Jan 2021

PONE-D-20-34978R1

Derivation and validation of a clinical severity score for acutely ill adults with suspected COVID-19: The PRIEST observational cohort study

PLOS ONE

Dear Dr. Goodacre,

Thank you for submitting your manuscript to PLOS ONE. Before making the last decision, I wish you would address the issues raised by one of the reviewer's whose comments are attached below.

Please submit your revised manuscript by Feb 18 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

We look forward to receiving your revised manuscript.

Kind regards,

Itamar Ashkenazi

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: No

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Reviewer #1: Thank you for considering the points raised by both reviewers and making related changes to the manuscript.

I have the following residual concerns.

1. The primary outcome has been chosen for pragmatic reasons however predicting death or the need for organ support in the 30 days following presentation to ED ignores at least 2 very important groups, namely those who were admitted but did not required organ support, and those that did not receive organ support but did die after 30 days.

2. A minor point related to the penultimate paragraph on page 6, please include the total number of deaths observed - all that is stated is that the number was >200 [being 1% of the 20,000 used in the undocumented power calculation].

3. The proportion of missing data and the large difference between the derivation and validation cohorts in the "air or supplemental oxygen" question are problematic.

4. The first sentence of the first paragraph in the discussion is missing several vital caveats as follows:

"We have developed a clinical illness severity score for acutely ill patients, WHO PRESENTED TO EMERGENCY DEPARTMENTS IN THE UK BETWEEN MARCH AND JUNE[??] with suspected COVID-19" the clinical context, geography and calendar are critical.

" . . . to predict the risk of death or receipt of organ support IN THE FOLLOWING 30 DAYS"

". . . predicting adverse outcome with A high sensitivity BUT A VERY LOW SPECIFICITY."

5. The authors appear to be be simultaneously proposing their score be used to inform triage decisions in ED with a clear bias towards overtriage i.e. admitting some patients who at very low risk of the adverse outcome they define; and yet give a clinical example in which "patient preference" rather than clinical judgement should be a relevant factor. The latter assumes that the triage decision is not being made in the clinical context of a stressed / overwhelmed hospital system.

6. Similarly, the authors make the case that their score is superior to certain others because it includes acute physiology plus age and performance status but seem to believe that clinicians do not take these chronic factors into account when using acute scores to assist in objectively grading severity of illness. Furthermore, do the authors really believe that their c-statistic of 0.80 is clinically rather than statistically better than the 0.75 and 0.77 for CURB-65 and NEWS2?

7. The exclusion of any discriminating laboratory / diagnostic tests from a population who have attended a hospital appears to be an effort to provide a means of turning patients away at first triage. The authors have not demonstrated that the addition of any such variables fails to improve the discrimination, especially the specificity of their score. Given that accurate and rapid point-of-care testing is now widely available, is turnaround time in ED too critical to have considered the potential value of such tests?

8. The authors should comment on the QCOVID score - https://pubmed.ncbi.nlm.nih.gov/33082154/ - and consider the accompanying editorial.

Reviewer #2: Dear authors, you have taken into account all my comments and suggestions and provided a detailed point-by-point response. The manuscript has been significantly improved and I have no additional comments. Best regards.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Jonathan Ball

Reviewer #2: No

PLoS One. 2021 Jan 22;16(1):e0245840. doi: 10.1371/journal.pone.0245840.r004

Author response to Decision Letter 1

4 Jan 2021

Thank you for considering our paper and providing the additional reviewer comments. We have revised the paper to address them and outline our responses to each point below.

Reviewer #1: Thank you for considering the points raised by both reviewers and making related changes to the manuscript.

I have the following residual concerns.

RESPONSE: We have added this to the limitations section of the discussion.

RESPONSE: We have added the adverse outcome rate in our cohort to this paragraph (not the death rate, since the sample size estimate was based on 200 adverse events, rather than deaths).

3. The proportion of missing data and the large difference between the derivation and validation cohorts in the "air or supplemental oxygen" question are problematic.

RESPONSE: We acknowledge in the discussion that the high rate of missing data for "air or supplemental oxygen" is an important limitation and describe the approaches we took to handling missing data. We randomly allocated sites to derivation and validation cohorts, so the difference between the cohorts is unlikely to be systematic.

4. The first sentence of the first paragraph in the discussion is missing several vital caveats as follows:

" . . . to predict the risk of death or receipt of organ support IN THE FOLLOWING 30 DAYS"

". . . predicting adverse outcome with A high sensitivity BUT A VERY LOW SPECIFICITY."

RESPONSE: We have added these caveats, with the exception of the location and timing of data collection. Science inevitably involves generalizing findings beyond the analysed data. In the discussion we consider potential limitations on generalizability and the need to validate in other settings and waves of the pandemic.

5. The authors appear to be simultaneously proposing their score be used to inform triage decisions in ED with a clear bias towards overtriage i.e. admitting some patients who at very low risk of the adverse outcome they define; and yet give a clinical example in which "patient preference" rather than clinical judgement should be a relevant factor. The latter assumes that the triage decision is not being made in the clinical context of a stressed / overwhelmed hospital system.

RESPONSE: We address this issue in the discussion and have added availability of resources to the factors that need to be considered alongside risk of adverse outcome in clinical decision-making.

RESPONSE: We believe that it is up to clinicians how they use our score, or any alternative, to predict adverse outcome and support decision-making. Our analysis presents estimates of prognostic accuracy to inform clinicians in making their choices. We believe that a c-statistic of 0.80 is clinically significantly better than a c-statistic of 0.75 or 0.77, but our belief is not important. What is important is that we have presented our findings in a transparent manner that allows readers to judge for themselves whether the additional complexity of our score compared to NEWS2 or CURB-65 is worth the improved prognostic value.

RESPONSE: We have described the reasons why we think that a clinical score (without laboratory or radiological tests) is most useful and of greatest interest to clinicians working in the emergency department. This is why we have made this the principal output of our analysis. We have data available to explore whether laboratory and radiological information can improve prediction, but for the reasons outlined in our paper, we have not made this a priority in our plans for analysis and dissemination.

8. The authors should comment on the QCOVID score - https://pubmed.ncbi.nlm.nih.gov/33082154/ - and consider the accompanying editorial.

RESPONSE: The QCOVID score was developed in a population cohort study to predict the risk of death from COVID-19 in the general population. It is clearly very useful for guiding public health interventions, such as shielding for high-risk individuals and targeting vaccination, but has a very different purpose to our score, which is aimed at predicting the risk of adverse outcome among people who are acutely ill with COVID-19.

RESPONSE: Thank you

Attachment

Submitted filename: Response to review 2.docx

Click here for additional data file.^{(14.6KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0245840.r005

Decision Letter 2

Itamar Ashkenazi

11 Jan 2021

Derivation and validation of a clinical severity score for acutely ill adults with suspected COVID-19: The PRIEST observational cohort study

PONE-D-20-34978R2

Dear Dr. Goodacre,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Itamar Ashkenazi

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

PLoS One. doi: 10.1371/journal.pone.0245840.r006

Acceptance letter

Itamar Ashkenazi

15 Jan 2021

PONE-D-20-34978R2

Derivation and validation of a clinical severity score for acutely ill adults with suspected COVID-19: The PRIEST observational cohort study

Dear Dr. Goodacre:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Itamar Ashkenazi

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Calibration plot for unrestricted LASSO model performance, validation cohort.

(TIF)

Click here for additional data file.^{(526.8KB, tif)}

S2 Fig. Calibration plot for restricted LASSO model performance, validation cohort.

(TIF)

Click here for additional data file.^{(528.7KB, tif)}

S3 Fig. ROC curve for the tool predicting the secondary outcome of organ support, validation cohort.

(TIF)

Click here for additional data file.^{(77.9KB, tif)}

S4 Fig. ROC curve for tool predicting the secondary outcome of death without organ support, validation cohort.

(TIF)

Click here for additional data file.^{(77.7KB, tif)}

S1 Table. Multivariable analysis, complete case (N = 5988).

(DOCX)

Click here for additional data file.^{(28.2KB, docx)}

S2 Table. Multivariable analysis, using multiple imputation (50 imputations; N = 11636).

(DOCX)

Click here for additional data file.^{(31.9KB, docx)}

S3 Table. Multivariable analysis, using deterministic imputation (N = 9891).

(DOCX)

Click here for additional data file.^{(28.5KB, docx)}

S4 Table. Logistic regression model based on selected categorised predictor variables.

(DOCX)

Click here for additional data file.^{(31.3KB, docx)}

S5 Table. Logistic regression model based on selected categorised predictor variables, excluding respiratory distress and history of renal impairment.

(DOCX)

Click here for additional data file.^{(29.9KB, docx)}

S6 Table. Sensitivity, specificity, PPV, NPV at each score threshold for predicting the secondary outcome of organ support, validation cohort.

(DOCX)

Click here for additional data file.^{(25.1KB, docx)}

S7 Table. Sensitivity, specificity, PPV, NPV at each score threshold for predicting the secondary outcome of death without organ support, validation cohort.

(DOCX)

Click here for additional data file.^{(24.4KB, docx)}

S1 Appendix. Standardised data collection form.

(PDF)

Click here for additional data file.^{(130.1KB, pdf)}

S2 Appendix. Follow-up form.

(PDF)

Click here for additional data file.^{(150.8KB, pdf)}

S3 Appendix. The NEWS2 score.

(DOCX)

Click here for additional data file.^{(23.2KB, docx)}

S4 Appendix. Study steering committee.

(DOCX)

Click here for additional data file.^{(15.5KB, docx)}

S5 Appendix. Site research staff.

(DOCX)

Click here for additional data file.^{(15.9KB, docx)}

S6 Appendix. Supporting research staff.

(DOCX)

Click here for additional data file.^{(12.3KB, docx)}

Attachment

Submitted filename: Response to review 1.docx

Click here for additional data file.^{(18.4KB, docx)}

Attachment

Submitted filename: Response to review 2.docx

Click here for additional data file.^{(14.6KB, docx)}

Data Availability Statement

Data are available in the in the ORDA data repository: http://doi.org/10.15131/shef.data.13194845.

[pone.0245840.ref001] 1.World Health Organisation. Clinical care of severe acute respiratory infections–Tool kit. https://www.who.int/publications-detail/clinical-care-of-severe-acute-respiratory-infections-tool-kit (accessed 28/04/2020)

[pone.0245840.ref002] 2.International Federation for Emergency Medicine. Global Response to COVID-19 for Emergency Healthcare Systems and Providers: From the IFEM Task Force on ED Crowding and Access Block. https://www.ifem.cc/coronavirus-2019-information/ (accessed 15/06/2020)

[pone.0245840.ref003] 3.NHS. Clinical guide for the management of emergency department patients during the coronavirus pandemic. 17 March 2020 Version 1 https://www.england.nhs.uk/coronavirus/secondary-care/other-resources/specialty-guides/#ae (accessed 15/06/2020)

[pone.0245840.ref004] 4.National Institute for Health and Care Excellence. COVID-19 rapid guideline: managing suspected or confirmed pneumonia in adults in the community. Published: 3 April 2020. www.nice.org.uk/guidance/ng165 (accessed 28/04/2020) [PubMed]

[pone.0245840.ref005] 5.American College of Emergency Physicians. Guide to Coronavirus Disease (COVID-19) https://www.acep.org/corona/covid-19-field-guide/cover-page/

[pone.0245840.ref006] 6.Goodacre S, Irving A, Wilson R, Beever D, Challen K. The PAndemic INfluenza Triage in the Emergency Department (PAINTED) pilot cohort study. Health Technol Assess 2015;19(3):1–70. 10.3310/hta19030 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref007] 7.Goodacre S, Thomas B, Lee E, Sutton L, Loban A, Waterhouse S, et al. (2020) Characterisation of 22445 patients attending UK emergency departments with suspected COVID-19 infection: Observational cohort study. PLoS ONE 15(11): e0240206 10.1371/journal.pone.0240206 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref008] 8.Lim W, van der Eerden MM, Laing R, Boersma W, Karalus N, Town G, et al. Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study. Thorax 2003; 58(5): 377–382. 10.1136/thorax.58.5.377 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref009] 9.Royal College of Physicians. (2017). National Early Warning Score (NEWS) 2: Standardising the assessment of acute-illness severity in the NHS. Updated report of a working party. London: RCP. [Google Scholar]

[pone.0245840.ref010] 10.Challen K, Bright J, Bentley A, Walter D. Physiological-social score (PMEWS) vs. CURB-65 to triage pandemic influenza: a comparative validation study using community-acquired pneumonia as a proxy. BMC Health Serv Res. 2007; 7: 33 10.1186/1472-6963-7-33 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref011] 11.Thomas B, Biggs K, Goodacre S, et al. Prognostic accuracy of emergency department triage tools for adults with suspected COVID-19: The PRIEST observational cohort study. [Prerpint] medRxiv 2020. 09.01.20185793; https://www.medrxiv.org/content/10.1101/2020.09.02.20185892v1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref012] 12.Public Health England. COVID-19: investigation and initial clinical management of possible cases. https://www.gov.uk/government/publications/wuhan-novel-coronavirus-initial-investigation-of-possible-cases/investigation-and-initial-clinical-management-of-possible-cases-of-wuhan-novel-coronavirus-wn-cov-infection#criteria (accessed 27/04/2020)

[pone.0245840.ref013] 13.Hirst E, Irving A, Goodacre S. Patient and public involvement in emergency care research. Emerg Med J 2016;33:665–670. 10.1136/emermed-2016-205700 [DOI] [PubMed] [Google Scholar]

[pone.0245840.ref014] 14.Wynants Laure, Van Calster Ben, Collins Gary S, Riley Richard D, Heinze Georg, Schuit Ewoud et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal BMJ 2020; 369: m1328 10.1136/bmj.m1328 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref015] 15.Knight Stephen R, Ho Antonia, Pius Riinu, Buchan Iain, Carson Gail, Drake Thomas M et al. Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: development and validation of the 4C Mortality Score BMJ 2020; 370: m3339 10.1136/bmj.m3339 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref016] 16.Liao X, Wang B, Kang Y. Novel coronavirus infection during the 2019–2020 epidemic: preparing intensive care units—the experience in Sichuan Province, China. Intensive Care Med 46, 357–360 (2020). 10.1007/s00134-020-05954-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref017] 17.Myrstad M, Ihle-Hansen H, Tveita AA, Andersen EL, Nygård S, Tveit A, et al. National Early Warning Score 2 (NEWS2) on admission predicts severe disease and inhospital mortality from Covid-19 –a prospective cohort study. Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine 2020; 28:66 10.1186/s13049-020-00764-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref018] 18.Hu H, Yao N, Qiu Y. Comparing Rapid Scoring Systems in Mortality Prediction of Critically Ill Patients With Novel Coronavirus Disease. Acad Emerg Med. 2020;27(6):461–468. 10.1111/acem.13992 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0245840.ref019] 19.Haimovich A, Ravindra NG, Stoytchev S, Young HP, PerryWilson F, van Dijk D, et al. , Development and validation of the quick COVID-19 severity index (qCSI): a prognostic tool for early clinical decompensation Annals of Emergency Medicine (2020), 10.1016/j.annemergmed.2020.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Derivation and validation of a clinical severity score for acutely ill adults with suspected COVID-19: The PRIEST observational cohort study

Steve Goodacre

Ben Thomas

Laura Sutton

Matthew Burnsall

Ellen Lee

Mike Bradburn

Amanda Loban

Simon Waterhouse

Richard Simmonds

Katie Biggs

Carl Marincowitz

Jose Schutter

Sarah Connelly

Elena Sheldon

Jamie Hall

Emma Young

Andrew Bentley

Kirsty Challen

Chris Fitzsimmons

Tim Harris

Fiona Lecky

Andrew Lee

Ian Maconochie

Darren Walter

Roles

Abstract

Objectives

Methods

Results

Conclusion

Registration

Introduction

Materials and methods

Patient and public involvement

Ethical approval

Results

Table 1. Characteristics of the study population (derivation and validation cohorts).

Table 2. Univariate analysis of predictor variables for adverse outcome (derivation cohort).

Fig 1. Summary of the derivation process.

Fig 2. The PRIEST COVID-19 clinical severity score.

Fig 3. ROC curve for the tool predicting the primary outcome of death or organ support, validation cohort.

Fig 4. Probability of adverse outcome for each value of the score, validation cohort.

Table 3. Sensitivity, specificity, PPV, NPV and proportion with a positive score at each score threshold for predicting the primary outcome of death or organ support, validation cohort.

Discussion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Itamar Ashkenazi

Roles

Transfer Alert

Author response to Decision Letter 0

Decision Letter 1

Itamar Ashkenazi

Roles

Author response to Decision Letter 1

Decision Letter 2

Itamar Ashkenazi

Roles

Acceptance letter

Itamar Ashkenazi

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases