Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2021 Mar 4;16(3):e0247676. doi: 10.1371/journal.pone.0247676

Development and validation of a predictive model of in-hospital mortality in COVID-19 patients

Diego Velasco-Rodríguez 1, Juan-Manuel Alonso-Dominguez 1,*, Rosa Vidal Laso 1, Daniel Lainez-González 1, Aránzazu García-Raso 1, Sara Martín-Herrero 1, Antonio Herrero 2, Inés Martínez Alfonzo 1, Juana Serrano-López 1, Elena Jiménez-Barral 1, Sara Nistal 3, Manuel Pérez Márquez 4, Elham Askari 1, Jorge Castillo Álvarez 5, Antonio Núñez 6, Ángel Jiménez Rodríguez 7, Sarah Heili-Frades 8, César Pérez-Calvo 4, Miguel Górgolas 5, Raquel Barba 3, Pilar Llamas-Sillero 1
Editor: Aleksandar R Zivkovic9
PMCID: PMC7932507  PMID: 33661939

Abstract

We retrospectively evaluated 2879 hospitalized COVID-19 patients from four hospitals to evaluate the ability of demographic data, medical history, and on-admission laboratory parameters to predict in-hospital mortality. Association of previously published risk factors (age, gender, arterial hypertension, diabetes mellitus, smoking habit, obesity, renal failure, cardiovascular/ pulmonary diseases, serum ferritin, lymphocyte count, APTT, PT, fibrinogen, D-dimer, and platelet count) with death was tested by a multivariate logistic regression, and a predictive model was created, with further validation in an independent sample. A total of 2070 hospitalized COVID-19 patients were finally included in the multivariable analysis. Age 61–70 years (p<0.001; OR: 7.69; 95%CI: 2.93 to 20.14), age 71–80 years (p<0.001; OR: 14.99; 95%CI: 5.88 to 38.22), age >80 years (p<0.001; OR: 36.78; 95%CI: 14.42 to 93.85), male gender (p<0.001; OR: 1.84; 95%CI: 1.31 to 2.58), D-dimer levels >2 ULN (p = 0.003; OR: 1.79; 95%CI: 1.22 to 2.62), and prolonged PT (p<0.001; OR: 2.18; 95%CI: 1.49 to 3.18) were independently associated with increased in-hospital mortality. A predictive model performed with these parameters showed an AUC of 0.81 in the development cohort (n = 1270) [sensitivity of 95.83%, specificity of 41.46%, negative predictive value of 98.01%, and positive predictive value of 24.85%]. These results were then validated in an independent data sample (n = 800). Our predictive model of in-hospital mortality of COVID-19 patients has been developed, calibrated and validated. The model (MRS-COVID) included age, male gender, and on-admission coagulopathy markers as positively correlated factors with fatal outcome.

Introduction

Initial symptoms of the disease produced by SARS-CoV-2, 2019-nCoV (COVID-19), are similar to other viral syndromes, but COVID-19 has the potential to develop a systemic inflammatory response syndrome, acute respiratory distress syndrome (ARDS), multi-organ failure and shock, especially in older patients with comorbidities [14]. The COVID-19 pandemic has spread to the whole world [2], causing over 1.5 million deaths to date.

Several factors have been correlated with higher mortality in these patients: older age [57], male gender [6, 8], arterial hypertension [8], diabetes [5, 8], smoking [8], obesity [7], cardiac and pulmonary pathology [5, 8], and lymphopenia [9].

Several studies have described that severe COVID-19 disease is frequently complicated with coagulopathy [1013]. However, COVID-19 associated coagulopathy behaves predominantly as a pro-thrombotic status rather than a bleeding disorder [11, 14]. High fibrinogen levels and normal or slightly low platelet counts are usually found [11, 14], unlike “classical” overt DIC [15]. These patients show not only high venous thromboembolism (VTE) rates, up to 16–27%, even despite having received adequate VTE prophylaxis [16, 17], but also cardiovascular complications [18]. Pathological evidence of pulmonary microthrombosis in severe cases has also been provided [19]. Among coagulation parameters, elevated D-dimer levels show a strong correlation with mortality [3, 13, 20].

Early and effective predictive models of clinical outcomes are necessary for risk stratification of hospitalized COVID-19 patients, especially if there is a high volume of patients consulting in the emergency departments [11]. Clinicians need better predictors of mortality and tools capable to detect which patients are prone to deteriorate rapidly. Our aim was to evaluate the ability of demographic data, medical history, and on-admission laboratory parameters to predict mortality in hospitalized COVID-19 patients.

Materials and methods

Patients and sample handling

Two thousand eight hundred and seventy nine consecutive hospitalized adult patients with confirmed moderate or severe COVID-19 from four hospitals [Hospital General de Villalba (Collado Villalba, Madrid), Hospital Infanta Elena (Valdemoro, Madrid), Hospital Universitario Rey Juan Carlos (Móstoles, Madrid) and Hospital Universitario Fundación Jiménez Díaz in Madrid] from February 27 to April 17, 2020, were retrospectively evaluated. COVID-19 was considered at least moderate and required hospitalization if any of these criteria was met: CURB-65 score >2 or FINE>II, peripheral capillary oxygen saturation (SpO2) <93% or respiratory rate >20 breaths per minute or PaO2 <65 mmHg, bilateral infiltrates in chest X-ray, ARDS or sepsis/septic shock. All patients received protocolized pharmacological and supportive treatment after admission, and VTE prophylaxis with low molecular weight heparin. Demographic data and medical history of arterial hypertension, diabetes mellitus, smoking habit, obesity [body mass index (BMI) ≥30 kg/m2], renal failure [estimated glomerular filtration rate (eGFR) by CKD-EPI <60 ml/min/1.73m2], cardiovascular diseases and pulmonary diseases were obtained. Cardiovascular diseases included arrhythmia, congestive heart failure, ischemic heart disease, valvulopathy and hypertensive cardiomyopathy. Pulmonary diseases included chronic obstructive pulmonary disease, asthma, obstructive sleep apnea and pulmonary tuberculosis. Patients were considered to have thrombocytopenia when platelet count was lower than 140 x109/l, prolonged PT when PT was higher than 14 seconds, and elevated ferritin when serum ferritin levels were higher than 400 ng/ml.

Data were obtained from a big data research using extract transform load (ETL) tools and natural language processing (NLP) with our Huawei (Huawei Technologies Co., Ltd., Shenzhen, China) platform and the collaboration of Indizen-Scalian (Madrid, Spain). The clinical outcomes were monitored up to April 17, 2020. Only those patients that had been discharged from hospital or those who had died were finally recruited. Exclusion criteria: patients who remained hospitalized at the time of analysis and patients on chronic anticoagulant treatment before hospitalization. A flow diagram of the sample selection and study design is shown in Fig 1. The diagnosis of COVID-19 was made according to World Health Organization interim guidance [21] and confirmed by RNA detection of the 2019-nCoV in the clinical laboratory of Hospital Universitario Fundación Jiménez Díaz.

Fig 1. Flow diagram of the sample selection and study design.

Fig 1

Laboratory tests

D-dimer levels were determined on ACL Top 700 analyzer (Instrumentation Laboratory, Bedford, MA, USA) using a highly sensitive assay (IL D-dimer HS 500). Prothrombin time (PT), activated partial thromboplastin time (APTT), and fibrinogen were also determined on ACL Top 700 analyzer. Complete blood count was determined on Sysmex XN-1000 analyzer (Sysmex, Kobe, Japan). Serum ferritin levels were determined on Roche Cobas 6000 (Roche Diagnostics, Mannheim, Germany).

Ethics statement

This observational study followed the ethical principles of the Helsinki Declaration and was previously approved by the Ethics Committee for Clinical Research of the Hospital Universitario Fundación Jiménez Díaz on April 14, 2020. Medical records of all the patients included were accessed from April 1 to May 15, 2020. All data were fully anonymized before we accessed them. Due to the retrospective nature of our study, the ethics committee waived the requirement for informed consent.

Statistical analysis

All the laboratory results analyzed (serum ferritin, lymphocyte count, APTT, PT, fibrinogen levels, D-dimer levels, and platelet count) were the first determination of each parameter, which had been performed either in the emergency department or within 3 days from admission to ward. Age, gender and chronic comorbidities (arterial hypertension, diabetes mellitus, obesity, smoking habit, renal failure, cardiovascular disease and pulmonary disease) were also analyzed. Statistical comparisons of survivors and non-survivors were calculated using the chi-square test for categorical variables and Student’s t test for continuous variables. The results were expressed as mean ± standard deviation if normal distributed, and as median (25–75 percentiles) if skewed, and numbers (percentage). Two-sided p values less than 0.05 were considered statistically significant.

In order to simplify the score and increase its reproducibility and applicability in other hospitals and countries, the statistically significant quantitative variables were categorized. Age was splitted into 5 subgroups (≤50, 51–60, 61–70, 71–80, and >80 years-old) since it is the most important prognostic factor [5]. We applied a previously published cut-off for D-dimer levels ≤1000 μg/l [two-fold increase of upper limit of normality (ULN)] or >2 ULN [3], whereas the other two variables were categorized into two subgroups according to their normality range: PT ≤14 or >14 seconds and platelet count ≤140 x109/l or >140 x109/l.

Statistically significant variables in the categorical analysis were included in a logistic regression model, performed in a randomly selected training cohort including around 60% of the total amount of patients. Missing data were estimated by multiple imputation with 50 different estimations performed [22]. Enter method was employed with Wald P values. In order to achieve a better adjustment of the model, once significant variables were identified, a new model including only these variables was estimated. Logistic regression coefficients and P values shown were obtained from the pooled analysis. Brier score analysis was calculated and odds predicted by the model were analyzed by using a ROC analysis. Prognostic features of the model in both cohorts were calculated by using a complete case analysis. A cut-off was selected based on its sensitivity and specificity. The logistic regression coefficients and the cut-off selected were validated in a different cohort composed of around 40% of total patients. Sensitivity, specificity, and predictive values in both cohorts were assessed and two-sided confidence intervals (CI) were calculated by the Wilson method. This was carried out using the Domenech Macro! DTfor SPSS (http://www.metodo.uab.cat/macros.htm). All statistical tests were performed in SPSS version 19.0 statistics package.

We adhered to the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) statement for reporting [23].

Results

A total of 2879 moderate to severe COVID-19 hospitalized patients were initially evaluated for inclusion in the development cohort. Of these, 809 were excluded: 515 remained hospitalized at the time of analysis and 294 were on chronic anticoagulant treatment before hospitalization. The final sample consisted of 2070 patients (884 females and 1186 males) with definite outcomes: 1677 (81.01%) patients had been discharged (survivors) and 393 (18.99%) patients had died (non-survivors).

The laboratory parameters and clinical characteristics of the patients at baseline are presented in Table 1 for all patients, survivors and non-survivors; data on some variables were missing for some patients. The mean age at disease onset was 65.68 years (range, 20–104). The proportion of male patients was higher in non-survivors (20.92% vs. 16.29%, p = 0.008). The mean length of hospital stay was 6.87 days (range, 0–41) in survivors and 6.51 days (range, 0–35) in non-survivors.

Table 1. Baseline characteristics of survivors and non-survivors, and univariate analysis.

Parameters Total (n = 2070) Survivors (n = 1677) Non-survivors (n = 393) P values Odds ratio (95% CI)
Age at diagnosis (years) 67 (54–79) 63 (51–75) 81 (72–87) <0.001
Age categorized
≤50 403 (19.48) 392 (97.27) 11 (2.73) <0.001
51–60 379 (18.32) 360 (95.00) 19 (5.00) <0.001 1.88 (0.88–4.01)
61–70 413 (19.96) 363 (89.89) 50 (12.11) <0.001 4.91 (2.52–9.57)
71–80 437 (21.12) 325 (74.37) 112 (25.63) <0.001 12.28 (6.49–23.21)
>80 437 (21.12) 237 (54.23) 200 (45.77) <0.001 30.07 (16.05–56.35)
Gender
Female 884 (42.71) 740 (83.71) 144 (16.29) 0.008
Male 1186 (57.29) 938 (79.08) 248 (20.92) 0.008 1.36 (1.08–1.70)
Comorbidities- n (%)
No 752 (36.32) 686 (91.20) 66 (8.80) <0.001
Yes 1318 (63.68) 992 (75.30) 326 (24.70) <0.001 3.42 (2.58–4.53)
Arterial hypertension- n (%)
No 1140 (55.37) 1006 (88.24) 134 (11.76) <0.001
Yes 919 (44.63) 664 (72.25) 255 (27.75) <0.001 2.88 (2.29–3.63)
Diabetes mellitus- n (%)
No 1650 (80.13) 1380 (83.63) 270 (16.37) <0.001
Yes 409 (19.87) 290 (70.90) 119 (29.10) <0.001 2.09 (1.63–2.69)
Obesity- n (%)
No 707 (70.56) 588 (83.17) 119 (16.83) 0.664
Yes 295 (29.44) 242 (82.03) 53 (17.97) 0.664 1.08 (0.76–1.55)
Smoking- n (%)
No 1971 (95.86) 1599 (81.13) 372 (18.87) 0.795
Yes 85 (4.14) 68 (80.00) 17 (20.00) 0.795 1.07 (0.62–1.85)
Cardiovascular disease- n (%)
No 1732 (84.24) 1446 (83.49) 286 (16.51) <0.001
Yes 324 (15.76) 222 (68.51) 102 (31.49) <0.001 2.32 (1.78–3.03)
Pulmonary disease- n (%)
No 1716 (83.42) 1406 (81.93) 310 (18.07) 0.028
Yes 341 (16.58) 262 (76.83) 79 (23.17) 0.028 1.37 (1.03–1.81)
Chronic kidney disease- n (%)
No 1665 (95.91) 1439 (86.42) 226 (13.58) <0.001
Yes 71 (4.09) 42 (59.15) 29 (40.85) <0.001 4.39 (2.68–7.20)
Lymphocyte count (x109/l) (NR 1.2–5) 1.00 (0.70–1.30) 1.00 (0.70–1.40) 0.80 (0.52–1.10) 0.539
Platelet count (x109/l) (NR 140–450) 209.00 (161.00–274.00) 213.00 (104.00–277.25) 195.50 (145.25–256.75) <0.001
Thrombocytopenia
No 1701 (84.62) 1411 (83.00) 290 (17.00) <0.001
Yes 309 (15.38) 223 (72.16) 86 (27.84) <0.001 1.87 (1.42–2.48)
PT (seconds) (NR 10–14) 12.80 (12.10–13.80) 12.80 (12.10–13.70) 13.20 (12.40–14.30) 0.001
Prolonged PT
No 1517 (80.35) 1264 (83.32) 253 (16.68) <0.001
Yes 371 (19.65) 273 (73.58) 98 (26.42) <0.001 1.79 (1.37–2.34)
APTT (seconds) (NR 26–36) 30.40 (28.10–32.70) 30.50 (28.20–32.70) 29.80 (27.40–33.00) 0.549
Fibrinogen (mg/dl) (NR 200–400) 677.00 (568.00–801.00) 679.00 (570.00–801.00) 677.00 (564.00–805.00) 0.220
D-dimer levels (μg/l) (NR 70–500) 623.50 (336.00–1106.50) 562.00 (315.04–995.00) 1046.00 (568.35–1976.00) 0.001
Elevated D-dimer (>2 ULN)
No 1185 (70.62) 1052 (88.77) 133 (11.23) <0.001
Yes 493 (29.38) 343 (69.57) 150 (30.43) <0.001 3.46 (2.66–4.50)
Ferritin levels (ng/ml) (NR 30–400) 613.00 (314.00–1294.50) 626.50 (354.50–1313.50) 501.50 (241.00–1145.00) 0.028
Ferritin elevated
No 373 (28.15) 313 (83.91) 60 (16.09) 0.883
Yes 952 (71.85) 802 (84.24) 150 (15.76) 0.883 0.97 (0.70–1.35)

APTT = activated partial Thromboplastin time; CI = confidence interval; dl = decilitre; l = litre; mg = miligrams; ml = mililitre; ng = nanograms; NR = normal range; PT = prothrombin time; ULN = upper limit of normal range. Missing data: age (1), lymphocyte count (60), platelet count (60), PT (182), APTT (176), fibrinogen (216), D-dimer levels (392), ferritin levels (1742).

Compared with survivors, non-survivors showed higher D-dimer levels on admission, prolonged PT and lower platelet count (Table 1). No significant differences were found in smoking, obesity, lymphocyte count, APTT and fibrinogen levels. Additionally, there were no differences in the duration of hospitalization between survivors and non-survivors (6.87 ± 5.86 days vs. 6.51 ± 5.25 days, p = 0.232).

Significant differences in their in-hospital mortality were observed in categorized quantitative variables. In-hospital mortality was 2.73% in patients younger than 50 years old (used as the reference category) (p<0.001), 5% in those between 51 and 60 years [Odds ratio (OR) 1.88; 95% CI, 0.88 to 4.01], 12.11% in those between 61 and 70 years (OR 4.91; 95% CI, 2.52 to 9.57), 25.63% in the 71–80 group (OR 12.28; 95% CI, 6.49 to 23.21), and 45.77% in those older than 80 years old (OR 30.07; 95% CI, 16.05 to 56.35). The proportion of non-survivors was significantly higher in COVID-19 patients with on-admission D-dimer levels >2 ULN (p<0.001; OR 3.46; 95% CI, 2.66 to 4.50), prolonged PT (p<0.001; OR 1.79; 95% CI, 1.37 to 2.34), and thrombocytopenia (p<0.001; OR 1.87; 95% CI, 1.42 to 2.48). Additionally, OR of in-hospital mortality was higher in patients with arterial hypertension, cardiovascular diseases, pulmonary diseases, and renal failure. Although non-survivors had slightly lower serum ferritin levels, when categorized according to elevated ferritin (yes/no), no differences were found between both groups.

A total of ten parameters showed statistically significant differences between survivors and non-survivors. They were then examined in a multivariate logistic regression model including 1270 patients to identify independent prognostic factors of moderate/severe COVID-19 in-hospital mortality (Table 2). The following features were identified as independent predictors of poor outcome on multivariable analysis: age 61–70 years (p<0.001; OR: 7.69; 95%CI: 2.93 to 20.14), age 71–80 years (p<0.001; OR: 14.99; 95%CI: 5.88 to 38.22), age >80 years (p<0.001; OR: 36.78; 95%CI: 14.42 to 93.85), male gender (p<0.001; OR: 1.84; 95%CI: 1.31 to 2.58), D-dimer levels >2 ULN (p = 0.003; OR: 1.79; 95%CI: 1.22 to 1.62), and prolonged PT (p<0.001; OR: 2.18; 95%CI: 1.49 to 3.18) (Table 2). Arterial hypertension, diabetes mellitus, cardiovascular disease, pulmonary disease, renal failure, and thrombocytopenia lost their significance and were not included in the final model. D-dimer levels >1000 μg/l (2 ULN), prolonged PT, male gender, and age showed an increase in the probabilities of death. The model showed no overdispersion. The formula of MRS-COVID-19 (Mortality Risk prognostic Score for hospitalized COVID-19 patients) is:

MRSCOVID19=EXP{4.585+(0.610ifmalegender)+(0.581ifDdimer>1000)+(0.778ifprolongedPT)+[(0.855ifage5160)OR(2.04ifage6170)OR(2.708ifage7180)OR(3.605ifage>80)]}.

Table 2. Multivariate analysis.

Parameters P values Odds ratio (95% CI)
Age categorized (years) *
51–60 0.115 2.35 (0.81–6.82)
61–70 <0.001 7.69 (2.93–20.14)
71–80 <0.001 14.99 (5.88–38.22)
>80 <0.001 36.78 (14.42–93.85)
Gender (male) <0.001 1.84 (1.31–2.58)
Arterial hypertension 0.705 0.93 (0.64–1.35)
Diabetes mellitus 0.251 1.24 (0.86–1.78)
Pulmonary disease 0.303 1.24 (0.83–1.85)
Cardiovascular disease 0.945 1.01 (0.68–1.52)
Chronic kidney disease 0.455 1.31 (0.65–2.64)
Thrombocytopenia 0.215 1.29 (0.86–1.93)
Prolonged PT <0.001 2.18 (1.49–3.18)
D-dimer elevated (>2 ULN) 0.003 1.79 (1.22–2.62)
Constant <0.001 0.01 (0.00–0.03)

*Age ≤50 years was used as the reference category.

†Variable not included in the final model.

A cut-off of 0.076 was arbitrarily selected, in order to maximize sensitivity and negative predictive value. In the first cohort (n = 1270; missing data = 270), an AUC of 0.81 was obtained, with a sensitivity of 95.83% (95% CI, 91.65 to 97.97), a specificity of 41.46% (95% CI, 38.16 to 44.85), negative predictive value (NPV) of 98.01% (95% CI, 95.95 to 99.03), and a positive predictive value (PPV) of 24.85% (95% CI, 21.67 to 28.31). Mortality rate in this cohort was 16.81%. In the validation cohort (n = 800; missing data = 185), an AUC of 0.80 was obtained, with a sensitivity of 92.52% (95% CI, 85.94 to 96.16), a specificity of 41.34% (95% CI, 37.14 to 45.67), NPV of 96.33% (95% CI, 92.93 to 98.13), and a PPV of 24.94% (95% CI, 20.93 to 29.42) (Table 3). Mortality rate in the validation cohort was 17.39%, comparable to first cohort´s. Brier score was 0.11 and 0.12 in the development and validation cohorts, respectively.

Table 3. Comparison of the distribution of parameters, AUC, sensitivity, specificity, PPV and NPV in the development and validation cohorts of MRS-COVID-19 score.

Development cohort n = 1270 Validation cohort n = 800
Patients with missing data 270 185
AUC 0.81 0.80
Sensitivity -% (95% CI) 95.83 (91.65–97.97) 92.52 (85.94–96.16)
Specificity -% (95% CI) 41.46 (38.16–44.85) 41.34 (37.14–45.67)
PPV -% (95% CI) 24.85 (21.67–28.31) 24.94 (20.93–29.42)
NPV -% (95% CI) 98.01 (95.95–99.03) 96.33 (92.93–98.13)
Mortality (%) 16.81 17.39
Age categorized- n (%)
≤50 244 (19.23) 159 (19.88)
51–60 245 (19.32) 134 (16.75)
61–70 238 (18.75) 175 (21.87)
71–80 273 (21.51) 164 (20.50)
>80 269 (21.19) 168 (21.00)
Gender- n (%)
Male 729 (57.40) 457 (57.12)
Female 541 (42.60) 343 (42.88)
Arterial hypertension- n (%)
No 694 (54.86) 446 (56.17)
Yes 571 (45.14) 348 (43.83)
Diabetes mellitus- n (%)
No 999 (78.97) 651 (81.98)
Yes 266 (21.03) 143 (18.02)
Pulmonary disease- n (%)
No 1049 (83.05) 667 (84.01)
Yes 214 (16.95) 127 (15.99)
Cardiovascular disease- n (%)
No 1061 (84.07) 671 (84.51)
Yes 201 (15.93) 123 (15.49)
Renal Failure- n (%)
No 1011 (96.19) 654 (95.47)
Yes 40 (3.81) 31 (3.53)
Thrombocytopenia- n (%)
No 1048 (84.17) 565 (79.58)
Yes 197 (15.83) 137 (20.42)
Prolonged PT- n (%)
No 952 (80.81) 565 (79.58)
Yes 226 (19.19) 145 (20.42)
D-dimer elevated (>2 ULN)- n (%)
No 728 (70.47) 457 (70.85)
Yes 305 (29.53) 188 (29.15)

AUC = area under the curve; NPV = negative predictive value; PPV = positive predictive value; ULN = upper limit of normality.

An interactive risk calculator for the application of individual combinations of the five parameters is provided at GooglePlay called MRS-COVID-19. This calculator allows for the classification of patients into low-risk or high-risk of in-hospital mortality and estimates OR values using young female without coagulopathy markers as the reference category.

Discussion

The main finding of our study was the development and validation of a predictive model of in-hospital mortality based on age, gender, and on-admission coagulopathy markers of COVID-19. The actual COVID-19 pandemic has become a huge challenge for the health care systems of many countries due to the massive number of infected subjects. Emergency departments have been overwhelmed due to insufficient medical personnel and resources and patient overcrowding [24]. The access to invasive ventilation and/or intensive care units has been limited or prioritized to patients developing severe hypoxemic respiratory failure. In order to address these shortages and their consequences, it is essential that health care systems develop efficient strategies and plans to effectively deal with them. A risk model or score capable of predicting on admission which COVID-19 patients will most probably survive would be a strategy of great interest, in order to avoid the collapse of acute care hospitals as far as possible. Thus, predictive models with high sensibility and, therefore, high negative predictive value would be desirable, since low-risk patients could either be discharged or derived to other support institutions that lack intensive care units.

Our study demonstrates that in-hospital mortality among moderate or severe hospitalized COVID-19 patients is predicted by the combination of age, gender, and coagulopathy markers (D-dimer and PT). The regression coefficients and cut-off selected were then validated in an independent data sample. Because our aim was to create a screening tool, we intentionally used a cut-off with high sensitivity and NPV, but low specificity and PPV. Therefore, the proportion of patients misclassified as high-risk will be elevated. However, patients classified as low-risk on admission could get either discharged early or derived to other centers without intensive care units with the certainty that their likelihood of dying is not as high as those classified as high risk, based on our arbitrarily selected and afterwards validated cut-off. Further external validation of our findings should be performed. Although COVID-19 mortality rates may be lower in future outbreaks due to improvements in its management and better access to medical infrastructures, the predictive capacity of our model should not be worse.

The model could be easily implemented in any laboratory information system (LIS), so that clinicians may automatically have the prognostic information. Additionally, in clinical trials that include adult COVID-19 patients of all ages, our model could be useful to ensure the comparability of included comparison groups.

Based on the logistic regression model coefficients, age was confirmed to be the strongest predictor of mortality in our cohort. Most of COVID-19 patients aged less than 50 years old (97.27%) or between 51 and 60 years old (95%) will be discharged within a few days regardless of their laboratory parameters on admission. On the other hand, near half of the patients over 80 years old died (45.77%), probably owing to a less rigorous immune response, thus suggesting that our predictive model seems to be less helpful in extreme age ranges. However, the addition of coagulopathy markers to age and gender may help clinicians refine the prognosis of hospitalized COVID-19 patients, especially those aged between 50 and 70 years.

Moderate or severe COVID cases are more likely to occur in older men with comorbidities [1]. A recent meta-analysis with aggregated data, including a total number of 3027 COVID-19 patients, confirmed that male, aged over 65 years, smoking and comorbidities such as hypertension, diabetes, cardiovascular disease, and respiratory diseases were risk factors for severe disease and mortality [2]. More than 60% of our 2070 cases were over 60 years old, and the likelihood of dying was higher in men compared to women. Non-survivors from our cohort were older, had more chronic pathologies (with the exception of obesity and smoking habit), and a showed a higher proportion of males. Our findings are in agreement with previous reports, since the outcome was significantly worse in male patients and those with chronic pathologies. However, the presence of all of these comorbidities was excluded from our final model.

Although the pathophysiology underlying severe COVID-19 remains poorly understood, a lung-centric coagulopathy is believed to play an important role [25]. COVID-19 associated coagulopathy correlates with illness severity and mortality, and may include increased D-dimer levels, mild PT prolongation and mild thrombocytopenia [10, 13, 26]. Thrombotic complications have emerged as an important issue in COVID-19 patients as a result of the inflammatory response to SARS-CoV-2. COVID-19 prothrombotic status seems to be multifactorial. The illness severity and hypoxia, hemostatic abnormalities, the severe inflammatory response, plus any other underlying thrombotic risk factors can lead to a thrombotic event [27].

Compared to survivors, the COVID-19 non-survivors from our cohort presented significantly higher D-dimer levels, prolonged PT and lower platelet counts. These results are in agreement with previously published data [1014].‬ Similar to our approach, Zhang et al retrospectively analyzed 343 COVID-19 hospitalized patients and reported that a four-fold increase of on-admission D-dimer levels could effectively predict their in-hospital mortality [20]. To our knowledge, there are two studies reporting predictive models of mortality in adult hospitalized COVID-19 patients based on baseline clinical and laboratory data [28, 29]. Their risk of bias is high, either because the sample size is small or because they are not validated.

Wang and colleagues developed (n = 296) and validated (n = 44) two models, both based on age: one clinical (including age, hypertension and coronary heart disease sensitivity), and one based on laboratory parameters [age, C-reactive protein, SpO2, neutrophil and lymphocyte count, D-dimer, aspartate aminotransferase (AST) and GFR] which had a significantly stronger discriminatory power than the clinical model [28]. The model from Chen and colleagues was developed from a bigger retrospective cohort (n = 1590), and included age, coronary heart disease, cerebrovascular disease, dyspnea, procalcitonin level >0.5 ng/mL, and AST) [29]. However, it has not been validated.

Although most of predictive models have been reported to be at high risk of bias [30], we adhered to the TRIPOD reporting guideline [23] to perform our model, and the Brier test results ensure its good calibration.

The strengths of our model include the study population size, the multicenter nature of data and the inclusion of a validation cohort. However, the model has some limitations. First of all, it is a retrospective analysis. Second, no data from possible hospital readmission of survivors were available, and it is possible that some initially recovered patients may have worsened a few days later. Finally, although we obtained dichotomized variables in order to simplify the model and increase its applicability, the use of continuous variables has the potential to provide more refined information.

In conclusion, we developed and validated a predictive model for in-hospital mortality of moderate or severe COVID-19 patients, which included D-dimer levels >2 ULN, prolonged PT, male gender and age as positively correlated factors with fatal outcome. Our findings, obtained and validated from a large series of hospitalized COVID-19 patients, support the use of this prognostic tool on admission to identify a low-risk group that may benefit from early discharge or derivation to support institutions, in order to prevent acute care hospitals getting overwhelmed. Prospective studies are needed to confirm our findings.

Supporting information

S1 Data

(XLSX)

Acknowledgments

We acknowledge all health-care workers involved in the diagnosis and treatment of COVID-19 patients in Hospital General de Villalba (Collado Villalba, Madrid), Hospital Infanta Elena (Valdemoro, Madrid), Hospital Universitario Rey Juan Carlos (Móstoles, Madrid) and Hospital Universitario Fundación Jiménez Díaz in Madrid.

Data Availability

All data will be uploaded as Supporting Information file.

Funding Statement

The authors received no specific funding for this work. Quironsalud provided support in the form of salaries for all authors, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.

References

  • 1.Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395: 497–506. 10.1016/S0140-6736(20)30183-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Zhu J, Ji P, Pang J, Zhong Z, Li H, He C, et al. Clinical characteristics of 3,062 COVID-19 patients: a meta-analysis. J Med Virol. 2020. 10.1002/jmv.25884 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395: 1054–1062. 10.1016/S0140-6736(20)30566-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sinclair AJ, Abdelhafiz AH. Age, frailty and diabetes—triple jeopardy for vulnerability to COVID-19 infection. EClinicalMedicine. 2020; 100343. 10.1016/j.eclinm.2020.100343 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Zhang J, Wang X, Jia X, Li J, Hu K, Chen G, et al. Risk factors for disease severity, unimprovement, and mortality in COVID-19 patients in Wuhan, China. Clin Microbiol Infect. 2020. 10.1016/j.cmi.2020.04.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wu Z, McGoogan JM. Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. JAMA. 2020. 10.1001/jama.2020.2648 [DOI] [PubMed] [Google Scholar]
  • 7.Zheng KI, Gao F, Wang X-B, Sun Q-F, Pan K-H, Wang T-Y, et al. Obesity as a risk factor for greater severity of COVID-19 in patients with metabolic associated fatty liver disease. Metab Clin Exp. 2020;108: 154244. 10.1016/j.metabol.2020.154244 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zheng Z, Peng F, Xu B, Zhao J, Liu H, Peng J, et al. Risk factors of critical & mortal COVID-19 cases: A systematic literature review and meta-analysis. J Infect. 2020. 10.1016/j.jinf.2020.04.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Liu Y, Du X, Chen J, Jin Y, Peng L, Wang HHX, et al. Neutrophil-to-lymphocyte ratio as an independent risk factor for mortality in hospitalized patients with COVID-19. J Infect. 2020. 10.1016/j.jinf.2020.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tang N, Li D, Wang X, Sun Z. Abnormal coagulation parameters are associated with poor prognosis in patients with novel coronavirus pneumonia. J Thromb Haemost. 2020;18: 844–847. 10.1111/jth.14768 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lillicrap D. Disseminated intravascular coagulation in patients with 2019-nCoV pneumonia. J Thromb Haemost. 2020;18: 786–787. 10.1111/jth.14781 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Thachil J, Tang N, Gando S, Falanga A, Cattaneo M, Levi M, et al. ISTH interim guidance on recognition and management of coagulopathy in COVID-19. J Thromb Haemost. 2020;18: 1023–1026. 10.1111/jth.14810 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lippi G, Favaloro EJ. D-dimer is Associated with Severity of Coronavirus Disease 2019: A Pooled Analysis. Thromb Haemost. 2020;120: 876–878. 10.1055/s-0040-1709650 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Yin S, Huang M, Li D, Tang N. Difference of coagulation features between severe pneumonia induced by SARS-CoV2 and non-SARS-CoV2. J Thromb Thrombolysis. 2020. 10.1007/s11239-020-02105-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Levi M, van der Poll T. Coagulation and sepsis. Thromb Res. 2017;149: 38–44. 10.1016/j.thromres.2016.11.007 [DOI] [PubMed] [Google Scholar]
  • 16.Helms J, Tacquard C, Severac F, Leonard-Lorant I, Ohana M, Delabranche X, et al. High risk of thrombosis in patients with severe SARS-CoV-2 infection: a multicenter prospective cohort study. Intensive Care Med. 2020. 10.1007/s00134-020-06062-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Klok FA, Kruip MJHA, van der Meer NJM, Arbous MS, Gommers D a. MPJ, Kant KM, et al. Incidence of thrombotic complications in critically ill ICU patients with COVID-19. Thromb Res. 2020. 10.1016/j.thromres.2020.04.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Clerkin KJ, Fried JA, Raikhelkar J, Sayer G, Griffin JM, Masoumi A, et al. Coronavirus Disease 2019 (COVID-19) and Cardiovascular Disease. Circulation. 2020. 10.1161/CIRCULATIONAHA.120.046941 [DOI] [PubMed] [Google Scholar]
  • 19.Dolhnikoff M, Duarte-Neto AN, de Almeida Monteiro RA, Ferraz da Silva LF, Pierre de Oliveira E, Nascimento Saldiva PH, et al. Pathological evidence of pulmonary thrombotic phenomena in severe COVID-19. J Thromb Haemost. 2020. 10.1111/jth.14844 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang L, Yan X, Fan Q, Liu H, Liu X, Liu Z, et al. D-dimer levels on admission to predict in-hospital mortality in patients with Covid-19. J Thromb Haemost. 2020. 10.1111/jth.14859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.World Health Organization. Clinical management of severe acute respiratory infection when COVID-19 is suspected. [cited 14 May 2020]. Available: https://www.who.int/publications-detail/clinical-management-of-severe-acute-respiratory-infection-when-novel-coronavirus-(ncov)-infection-is-suspected
  • 22.Ware JH, Harrington D, Hunter DJ, D’Agostino RB. Missing Data. New England Journal of Medicine. 2012;367: 1353–1354. 10.1056/NEJMsm1210043 [DOI] [Google Scholar]
  • 23.Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement. Br J Cancer. 2015;112: 251–259. 10.1038/bjc.2014.639 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hu H, Yao N, Qiu Y. Comparing rapid scoring systems in mortality prediction of critical ill patients with novel coronavirus disease. Acad Emerg Med. 2020. 10.1111/acem.13992 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Connors JM, Levy JH. Thromboinflammation and the hypercoagulability of COVID-19. J Thromb Haemost. 2020. 10.1111/jth.14849 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Fogarty H, Townsend L, Ni Cheallaigh C, Bergin C, Martin-Loeches I, Browne P, et al. COVID-19 Coagulopathy in Caucasian patients. Br J Haematol. 2020. 10.1111/bjh.16749 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bikdeli B, Madhavan MV, Jimenez D, Chuich T, Dreyfus I, Driggin E, et al. COVID-19 and Thrombotic or Thromboembolic Disease: Implications for Prevention, Antithrombotic Therapy, and Follow-up. J Am Coll Cardiol. 2020. 10.1016/j.jacc.2020.04.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Wang K, Zuo P, Liu Y, Zhang M, Zhao X, Xie S, et al. Clinical and laboratory predictors of in-hospital mortality in patients with COVID-19: a cohort study in Wuhan, China. Clin Infect Dis. 2020. 10.1093/cid/ciaa538 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Chen R, Liang W, Jiang M, Guan W, Zhan C, Wang T, et al. Risk Factors of Fatal Outcome in Hospitalized Subjects With Coronavirus Disease 2019 From a Nationwide Analysis in China. Chest. 2020. 10.1016/j.chest.2020.04.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. BMJ. 2020. June 3;369:m2204. 10.1136/bmj.m2204 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Aleksandar R Zivkovic

21 Dec 2020

PONE-D-20-35733

DEVELOPMENT AND VALIDATION OF A PREDICTIVE MODEL OF IN-HOSPITAL MORTALITY IN COVID-19 PATIENTS

PLOS ONE

Dear Dr. Alonso-Dominguez,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Feb 04 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Aleksandar R. Zivkovic

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In the ethics statement in the manuscript and in the online submission form, please provide additional information about the patient records/samples used in your retrospective study, including the date range (month and year) during which patients' medical records/samples were accessed.

3. In your ethics statement in the manuscript and in the online submission form, please ensure that you have discussed whether all data/samples were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data/samples from their medical records used in research, please include this information.

4.In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

5.Thank you for stating the following in the Financial Disclosure section:

"The author(s) received no specific funding for this work."

We note that one or more of the authors are employed by a commercial company: Quironsalud

  1. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement.

2. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc.  

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to  PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and  there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Partly

Reviewer #4: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No

Reviewer #4: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Reviewer #4: No

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In this retrospective observational multicenter study, the authors developed and validated a predictive model for in-hospital mortality of moderate and severe COVID-19 patients, using demographic data, medical history and laboratory parameters. Overall, the study was well executed and the authors achieved its clearly defined objectives. The manuscript was concise, easy to understand and a good read. It contributes to the existing literature on prognostication of outcomes in COVID-19 patients. Following are listed some comments and points to be addressed:

Major issues:

---------------

1) The authors states (p. 19, l. 319) that a predictive model with a high negative predictive value is desirable, since this allows to identify low-risk patients which may be discharged at an earlier stage, and thereby relieve the health care system. However, since the data from the present study is based on patients (both survivors and non-survivors) receiving full treatment, I could fear that mortality will increase in the low-risk patients if they are discharged early solely based on their risk score. The authors should emphasize the need for prospective studies to back up this statement in the conclusions.

2) Was there a sample size analysis or was this a sample of convenience?

Minor issues:

---------------

1) The authors should either combine the first two sentences or rewrite the first sentence in the abstract (p. 2, l. 85-88).

2) In the statistical analysis section (p. 6, l. 190-192), the authors state: “The results were expressed as the mean ± standard deviation and range or number (percentage), wherever appropriate. P values less than 0.05 were considered statistically significant.” It would be more informative and logic to present data as mean ± standard deviation if normal distributed, and as median (25-75 percentiles) if skewed, and numbers (percentage). Even though the authors states that range is listed, I think the authors have instead presented minimum and maximum values, which is of less importance. Also, it should be stated if the P values used are one- or two-sided.

3) Table 1 is very long and difficult to overlook. I would suggest removing ranges, and instead list mean ± SD if normal distributed, median (25-75 percentiles) if skewed, and number (percentage). In the column listing OR (95% CI), please type reference group in each of the univariate analyses involving categorical data, starting with the reference group. It is not clear which variable is associated with a higher OR (e.g. going from male to female, or female to male). Missing data of the individual parameters should be listed as a foot note. Also, the category “hospital stay” is somehow relevant and informative but also misleading, since non-survivors are discharged to the morgue. The authors should remove this parameter in Table 1, and carefully mention the results in the Results section. Finally, “sex” is used in the table, but “gender” is used in the text. Please be consistent.

4) Results are listed in details both in the tables and in the Results section. Data should generally only be listed either in the tables or in the text. Please refine manuscript.

5) Serum ferritin have in several studies been found valuable to determine poor prognosis in COVID-19 patients. The authors state (p. 14, l. 259); “No significant differences were found in categorized ferritin”. Looking at Table 1, non-survivors had significantly higher serum ferritin levels, but when categorized according to elevated ferritin (yes/no), this is no longer significantly different between groups. This detail should be added to the text. Also, please define the term “elevated ferritin” in the Methods section.

6) The authors used area under the ROC curve to evaluate both the development model and the validation model. Even though this is a generally accepted method in medical literature, the AUC only describe the discriminative ability of the model, i.e. correctly allocate patients as survivor or non-survivors according the model. However, adding a statistical analysis to test the predictive ability, e.g. Brier scores, could provide additional strength to the study.

Reviewer #2: Dear Authors;

I have read the manuscript titled “DEVELOPMENT AND VALIDATION OF A PREDICTIVE MODEL OF IN-HOSPITAL MORTALITY IN COVID-19 PATIENTS”. This study retrospectively evaluated 2879 hospitalized COVID-19 patients from four hospitals. Although it is a nice designed study there are several limitations.

1. There are more than 50 studies investigating the in hospital or 7-14 days mortality in patients with COVID-19 infection. Therefore (unfortunately) this manuscript doesn’t add something new to the literature.

2. This new predictive model has a sensitivity of 95.83%, specificity of 41.46%, negative predictive value of 98.01%, and positive predictive value of 24.85%. The authors explain these results however for a predictive model specificity is quite low.

3. doi: https://doi.org/10.1136/bmj.m1328 is a nice review and summary of literature why these predictive models have bias.

4. One selection bias is the inclusion criteria which CURB-65 score is used. Literature showed that CALL or other scores may be used instead of CURB-65 for covid pneumonia.

5. The authors found no difference in lymphocyte count between deceased and survivors. An interesting finding because in most studies lymphocyte count is a predictive factor for mortality.

Thank you.

Reviewer #3: Thank you for the opportunity to review this manuscript, which focusses on an important topic and addresses the important issue of risk stratification for patients with suspected and confirmed Covid-19 infection. With regards to the model presented, I am encouraged to see a relatively large sample size and contemporary data from patients across several centres.

However, I have serious concerns about a number of methodological flaws with the development and internal validation of the model that make it impossible for me to recommend publication at this time:-

1. Inclusion of predictors. Predictors were initially selected to be included in the logistic regression model if a significant association was found between the predictor and the outcome on univariable analysis. This is recognised as an unsuitable approach and may lead to important predictors being excluded. [1] All predictors deemed to be of clinical significance should be included in the logistic regression analysis, regardless of the univariable statistical analysis.

2. Sample size. Although 2070 patients were included in the study, no formal statistical sample size analysis has been undertaken. [2] This is an important step to ensure that the results of the model development can be considered statistically robust.

3. Dichotomisation/categorisation of continuous variables. Transformation of continuous variables into non-continuous variables can reduce the power by the same amount as discarding approximately 1/3 of the data and should be avoided unless absolutely necessary. For variables with recognised cut-offs (i.e. haemoglobin values for anaemia) it can be an acceptable practice, but in this model, you have dichotomised four variables including age. I consider this a serious methodological flaw. [3]

4. Measures of model performance. Whilst you have provided information regarding the discrimination of the model, there are no reported measures of calibration. Whilst discrimination is important in terms of assessing how well the model discriminates between those with and without the outcome calibration is a measure of how closely model predicted outcomes match observed outcomes. Some measure of calibration (observed to expected ratio, calibration plot, calibration slope, calibration-in-the-large) must be included when presenting a model. [4]

5. Internal validation. Whilst an attempt to perform internal validation has been undertaken, the methodological approach is not appropriate. Using a split sample approach (splitting the population into a development and a validation set) is no longer recognised as an acceptable method of internal validation due to a number of drawbacks. Internal validation should be performed using cross-validation or bootstrapping

I think that the multicentre dataset and important nature of the question being asked is worthy of further investigation. However, I think that the entire methodological approach to model development needs to be re-assessed and a new analysis performed. A new paper should be submitted in the future if you are able to do this.

References

[1] Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and prognostic research: developing a prognostic model. BMJ 2009;338: b604.

[2] BMJ 2020;368:m441 doi: 10.1136/bmj.m441

[3] Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med 2006;25:127–41.

[4] Grant SW, Collins GS, Nashef SAM. Statistical Primer: developing and validating a risk prediction model. Eur J Cardiothorac Surg 2018;54:203–8

[5] Altman DG, Vergouwe Y, Royston P, Moons KGM. Prognosis and prog nostic research: validating a prognostic model. BMJ 2009;338:b605.

Reviewer #4: It is an important topic and interesstung as well. The authors provide a possible predictors of COVID-19 in-Hospital Mortality.

However the current work suffer many Major issues.

Good idea of your work. Relevant study. Introduction of your paper states not clearly the problem but underlines your study purpose.

Material and methods leave behind unanswered questions.

Results are not clearly structured and the table is way too long and confusing

In the discussion you go more into the background of your study (which could be partly mentioned in your introduction). You advice a new prediction tool for in hospital mortality. However

you did a validation in the same paper without background information of the idependent patient sample. ???? that is very much questionable for me….

Major issues:

1. what is the Definition of moderat to Severe COVID-19 Patients?

2. Lines 221 A total of 2879 moderate to severe COVID-19 hospitalized patients were initially

lines 222 evaluated for inclusion in the development cohort. Of these, 809 were excluded: 515

line 223 remained hospitalized at the time of analysis and 294 were on chronic anticoagulant

line 224 treatment before hospitalization. The final sample consisted of 2070 patients (884

line 225 females and 1186 males) with definite outcomes: 1677 (81.01%) patients had been

line 226 discharged (survivors) and 393 (18.99%) patients had died (non-survivors).

Exclusion of a large group of patients. Approximately 28,1 % of patient were excluded.

Follow up time is not clear. (or time to mortality.)?

3. it is not clear how many patients were mechanical ventilated?

4.lines 262- 264 " They were then examined in a multivariate logistic regression model including 1270 patients to identify independent prognostic factors of moderate/severe COVID-19 in-hospital mortality (Table 2)."

Not clear. Sub-analysis of 2070 patients? Why are the 800 patients excluded from this Analysis.

Is that the independent sample of patients about which we read in the abstract.? Not clear.

IF Yes: no characteristics of this patient population present.

5. Line 283 A cut-off of 0.076 was arbitrarily selected.

Please elaborate more why this cut off value and not another ?

6. If a major Revision is possible, the analysis must be checked by a statistician

Minor

1. The language need to be re-edited, there are many typos as well as weak sentences: lines 85-86 To evaluate the risk of demographic data, medical history, and on-admission laboratory parameters in hospitalized COVID-19 patients to predict mortality

(Not well written)

Lines 133-135 Clinicians need better predictors of which patients are prone to deteriorate rapidly or who may go on to die

(“predict Mortality” better) not go die!!!!

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Sebastian Roed Rasmussen

Reviewer #2: No

Reviewer #3: No

Reviewer #4: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Mar 4;16(3):e0247676. doi: 10.1371/journal.pone.0247676.r002

Author response to Decision Letter 0


8 Feb 2021

Dear Editor and reviewers,

Thank you very much for your comments. We truly appreciate them.

Here are our responses (in red).

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. We have modified the manuscript in order to meet PLOS ONE's style requirements.

2. In the ethics statement in the manuscript and in the online submission form, please provide additional information about the patient records/samples used in your retrospective study, including the date range (month and year) during which patients' medical records/samples were accessed.

3. In your ethics statement in the manuscript and in the online submission form, please ensure that you have discussed whether all data/samples were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data/samples from their medical records used in research, please include this information.

4. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Data will be available from the corresponding author upon reasonable request.

5. Thank you for stating the following in the Financial Disclosure section:

"The author(s) received no specific funding for this work."

We have added this sentence in the manuscript.

We note that one or more of the authors are employed by a commercial company: Quironsalud

1. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form.

Please also include the following statement within your amended Funding Statement.

“The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.”

We have included the mentioned statement.

2. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc. An updated Competing Interests Statement was added.

Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared.

The following statement was included: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.”

Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf.

Both an updated Funding Statement and Competing Interests Statement were included in the cover letter.

To reviewers:

Reviewer #1: In this retrospective observational multicenter study, the authors developed and validated a predictive model for in-hospital mortality of moderate and severe COVID-19 patients, using demographic data, medical history and laboratory parameters. Overall, the study was well executed and the authors achieved its clearly defined objectives. The manuscript was concise, easy to understand and a good read. It contributes to the existing literature on prognostication of outcomes in COVID-19 patients. Thank you for your comments. Here are our answers.

Following are listed some comments and points to be addressed:

Major issues:

---------------

1) The authors states (p. 19, l. 319) that a predictive model with a high negative predictive value is desirable, since this allows to identify low-risk patients which may be discharged at an earlier stage, and thereby relieve the health care system. However, since the data from the present study is based on patients (both survivors and non-survivors) receiving full treatment, I could fear that mortality will increase in the low-risk patients if they are discharged early solely based on their risk score. The authors should emphasize the need for prospective studies to back up this statement in the conclusions. We do agree with your comment. To date, although some drugs like dexametasone have shown benefit, none of the applied treatments have dramatically changed the outcomes in COVID-19 patients. Nevertheless, supportive treatment received may have decreased mortality rate in hospitalized patients. We have emphasized the need for prospective studies to back up this statement in the conclusions.

2) Was there a sample size analysis or was this a sample of convenience? We used all the available data we had at the time of the analysis.

Minor issues:

---------------

1) The authors should either combine the first two sentences or rewrite the first sentence in the abstract (p. 2, l. 85-88). Amended.

2) In the statistical analysis section (p. 6, l. 190-192), the authors state: “The results were expressed as the mean ± standard deviation and range or number (percentage), wherever appropriate. P values less than 0.05 were considered statistically significant.” It would be more informative and logic to present data as mean ± standard deviation if normal distributed, and as median (25-75 percentiles) if skewed, and numbers (percentage). Even though the authors states that range is listed, I think the authors have instead presented minimum and maximum values, which is of less importance. Also, it should be stated if the P values used are one- or two-sided. We have now presented data as median (25-75 percentiles) since all of quantitative variables had skewed distribution, and numbers (percentage). P values are two-sided. We have modified the text. You are absolutely right about the definition of range. We have amended this issue in the manuscript.

3) Table 1 is very long and difficult to overlook. I would suggest removing ranges, and instead list mean ± SD if normal distributed, median (25-75 percentiles) if skewed, and number (percentage). In the column listing OR (95% CI), please type reference group in each of the univariate analyses involving categorical data, starting with the reference group. It is not clear which variable is associated with a higher OR (e.g. going from male to female, or female to male). Missing data of the individual parameters should be listed as a foot note. Also, the category “hospital stay” is somehow relevant and informative but also misleading, since non-survivors are discharged to the morgue. The authors should remove this parameter in Table 1, and carefully mention the results in the Results section. Finally, “sex” is used in the table, but “gender” is used in the text. Please be consistent. We have modified Table 1 following your suggestions. OR values have been deleted in the reference categories, and ranges have been removed. We have listed missing data as a footnote.

4) Results are listed in details both in the tables and in the Results section. Data should generally only be listed either in the tables or in the text. Please refine manuscript. We have refined the manuscript by removing numeric values of the parameters in the Results section.

5) Serum ferritin has in several studies been found valuable to determine poor prognosis in COVID-19 patients. The authors state (p. 14, l. 259); “No significant differences were found in categorized ferritin”. Looking at Table 1, non-survivors had significantly higher serum ferritin levels, but when categorized according to elevated ferritin (yes/no), this is no longer significantly different between groups. This detail should be added to the text. Also, please define the term “elevated ferritin” in the Methods section. We have added this detail to the text and defined the term “elevated ferritin” in the Methods section.

6) The authors used area under the ROC curve to evaluate both the development model and the validation model. Even though this is a generally accepted method in medical literature, the AUC only describe the discriminative ability of the model, i.e. correctly allocate patients as survivor or non-survivors according the model. However, adding a statistical analysis to test the predictive ability, e.g. Brier scores, could provide additional strength to the study. Following your recommendation, Brier score was performed to test the predictive ability of our model, showing a good calibration.

Reviewer #2: Dear Authors;

I have read the manuscript titled “DEVELOPMENT AND VALIDATION OF A PREDICTIVE MODEL OF IN-HOSPITAL MORTALITY IN COVID-19 PATIENTS”. This study retrospectively evaluated 2879 hospitalized COVID-19 patients from four hospitals. Although it is a nice designed study there are several limitations.

1. There are more than 50 studies investigating the in hospital or 7-14 days mortality in patients with COVID-19 infection. Therefore (unfortunately) this manuscript doesn’t add something new to the literature. Although there are numerous studies that have evaluated COVID-19 mortality, very few of them carried out an external validation, and calibration was rarely assessed. Our score is well calibrated, validated and very easy to apply. In the actual setting, cases are rapidly increasing again, and the feared “third wave” is becoming real. Tools like ours can be of great utility in order to prevent the collapse of emergency departments and acute care hospitals.

2. This new predictive model has a sensitivity of 95.83%, specificity of 41.46%, negative predictive value of 98.01%, and positive predictive value of 24.85%. The authors explain these results however for a predictive model specificity is quite low. We have chosen the threshold of the score in order to maximize the negative predictive value and sensitivity. Of course, our score is not perfect but we believe its applicable and useful.

3. doi: https://doi.org/10.1136/bmj.m1328 is a nice review and summary of literature why these predictive models have bias. We have included this reference and discussed it.

4. One selection bias is the inclusion criteria which CURB-65 score is used. Literature showed that CALL or other scores may be used instead of CURB-65 for covid pneumonia. CURB-65 was one of the hospitalization criteria for COVID-19 patients in our institution protocol. Although CALL score has been developed specifically for COVID-19 pneumonia, it was not available when we performed our study.

5. The authors found no difference in lymphocyte count between deceased and survivors. An interesting finding because in most studies lymphocyte count is a predictive factor for mortality. It may seem surprising since lymphopenia on admission has been associated with poor outcome in patients with COVID-19 in many studies. However, we found no differences between survivors and deceased.

Thank you.

Reviewer #3: Thank you for the opportunity to review this manuscript, which focusses on an important topic and addresses the important issue of risk stratification for patients with suspected and confirmed Covid-19 infection. With regards to the model presented, I am encouraged to see a relatively large sample size and contemporary data from patients across several centres.

However, I have serious concerns about a number of methodological flaws with the development and internal validation of the model that make it impossible for me to recommend publication at this time:-

Thank you very much for your comments.

1. Inclusion of predictors. Predictors were initially selected to be included in the logistic regression model if a significant association was found between the predictor and the outcome on univariable analysis. This is recognised as an unsuitable approach and may lead to important predictors being excluded. [1] All predictors deemed to be of clinical significance should be included in the logistic regression analysis, regardless of the univariable statistical analysis. You are completely right. However, we have to keep in mind that COVID-19 is a novel disease and there are no completely established predictive factors that we should have included in the multivariable analysis.

2. Sample size. Although 2070 patients were included in the study, no formal statistical sample size analysis has been undertaken. [2] This is an important step to ensure that the results of the model development can be considered statistically robust. You are completely right and sample size calculation is the optimal procedure. Nevertheless, our sample size seems big enough taking into account the “10 events per predictor variable” rule. We preferred to include all the data we had available.

3. Dichotomisation/categorisation of continuous variables. Transformation of continuous variables into non-continuous variables can reduce the power by the same amount as discarding approximately 1/3 of the data and should be avoided unless absolutely necessary. For variables with recognised cut-offs (i.e. haemoglobin values for anaemia) it can be an acceptable practice, but in this model, you have dichotomised four variables including age. I consider this a serious methodological flaw. [3]

We have performed the dichotomization of laboratory variables in order to maximixe reproducibility of the score. Laboratory measurements as d-dimer, protrombine time, platelet count and ferritin might not yield the same results among different laboratories. By dichotomizing in normal or elevated valued we sought to increase the reproducibility, which is, in our opinion, especially interesting in the actual pandemic setting.

We categorized age into several groups, as published in most studies including COVID-19 patients. Within these groups, the risk of mortality seems to be stable. We are aware that we can reduce the power of the score but by doing it in this way we can increase reproducibility of the score and easiness of use. We believe that both reproducibility and easiness are essential for a score to help to effectively deal with the shortage of resources during the actual COVID-19 pandemic, since the feared “third wave” is getting worse in Spain and many other countries.

4. Measures of model performance. Whilst you have provided information regarding the discrimination of the model, there are no reported measures of calibration. Whilst discrimination is important in terms of assessing how well the model discriminates between those with and without the outcome calibration is a measure of how closely model predicted outcomes match observed outcomes. Some measure of calibration (observed to expected ratio, calibration plot, calibration slope, calibration-in-the-large) must be included when presenting a model. [4] We agree with your comment. A predictive model needs to be calibrated. Brier score summarizes model calibration and discrimination. This test provides a measure of the agreement between the observed binary outcome and the predicted probability of that outcome. It was performed to evaluate how closely our model predicted outcomes match observed outcomes.

5. Internal validation. Whilst an attempt to perform internal validation has been undertaken, the methodological approach is not appropriate. Using a split sample approach (splitting the population into a development and a validation set) is no longer recognised as an acceptable method of internal validation due to a number of drawbacks. Internal validation should be performed using cross-validation or bootstrapping

Thank you for your comment. We guess you refer to recommendations included in reference 5. As it is stated in that manuscript “If the available data are limited, the model can be developed on the whole dataset and techniques of data re-use, such as cross validation and bootstrapping, applied to assess performance”. We consider our cohort of over 2000 patients is not limited. Therefore, applying bootstrapping or cross-validation techniques is not mandatory, and splitting the sample is a feasible approach.

I think that the multicentre dataset and important nature of the question being asked is worthy of further investigation. However, I think that the entire methodological approach to model development needs to be re-assessed and a new analysis performed. A new paper should be submitted in the future if you are able to do this.

References

[1] Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and prognostic research: developing a prognostic model. BMJ 2009;338: b604.

[2] BMJ 2020;368:m441 doi: 10.1136/bmj.m441

[3] Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med 2006;25:127–41.

[4] Grant SW, Collins GS, Nashef SAM. Statistical Primer: developing and validating a risk prediction model. Eur J Cardiothorac Surg 2018;54:203–8

[5] Altman DG, Vergouwe Y, Royston P, Moons KGM. Prognosis and prog nostic research: validating a prognostic model. BMJ 2009;338:b605.

Reviewer #4: It is an important topic and interesting as well. The authors provide possible predictors of COVID-19 in-Hospital Mortality.

However the current work suffers many Major issues.

Good idea of your work. Relevant study. Introduction of your paper states not clearly the problem but underlines your study purpose. Thank you for your suggestions. We have added the following sentence in the Introduction in order to clearly state the problem: “The COVID-19 pandemic has spread to the whole world, causing over 1.5 million deaths to date”.

Material and methods leave behind unanswered questions.

Results are not clearly structured and the table is way too long and confusing

In the discussion you go more into the background of your study (which could be partly mentioned in your introduction). You advice a new prediction tool for in hospital mortality. However you did a validation in the same paper without background information of the independent patient sample. ???? that is very much questionable for me….

Major issues:

1. what is the Definition of moderate to Severe COVID-19 Patients? Following your suggestion, we have included this information in the manuscript.

2. Lines 221 A total of 2879 moderate to severe COVID-19 hospitalized patients were initially

lines 222 evaluated for inclusion in the development cohort. Of these, 809 were excluded: 515

line 223 remained hospitalized at the time of analysis and 294 were on chronic anticoagulant

line 224 treatment before hospitalization. The final sample consisted of 2070 patients (884

line 225 females and 1186 males) with definite outcomes: 1677 (81.01%) patients had been

line 226 discharged (survivors) and 393 (18.99%) patients had died (non-survivors).

Exclusion of a large group of patients. Approximately 28,1 % of patient were excluded.

A total of 809 patients (approximately 28.1%) were excluded. The main outcome of the study (death/discharge) could not be evaluated in the 515 patients who were still hospitalized at the time of analysis. Thus, their exclusion was mandatory. Additionally, it is reasonable to exclude the 294 patients on chronic anticoagulant treatment before hospitalization. Otherwise, coagulopathy, which is a common complication of severe COVID-19 patients, could have been underestimated.

Follow up time is not clear. (or time to mortality.)? Since the aim of the study was to evaluate in-hospital mortality, the definition of follow up time is the number of days the patient has been hospitalized. As stated in the manuscript, “no data from possible hospital readmission of survivors were available, and it is possible that some initially recovered patients may have worsened a few days later”.

3. it is not clear how many patients were mechanical ventilated?

Due to the massive number of infected subjects, both the Emergency department and the Ward of our institution and of many other hospitals were overwhelmed. The access to invasive ventilation and/or intensive care units were limited or prioritized to young patients developing severe hypoxemic respiratory failure. Therefore, a lot of patients candidate to mechanical ventilation did not get it. That is the reason why we did not analyze it, since the results would have been biased.

4.lines 262- 264 " They were then examined in a multivariate logistic regression model including 1270 patients to identify independent prognostic factors of moderate/severe COVID-19 in-hospital mortality (Table 2)."

Not clear. Sub-analysis of 2070 patients? Why are the 800 patients excluded from this Analysis. As it is stated in the manuscript (Methods section; Statistical analysis): “Statistically significant variables in the categorical analysis were included in a logistic regression model, performed in a randomly selected training cohort including around 60% of the total amount of patients”.

Is that the independent sample of patients about which we read in the abstract? Not clear. Those 1270 patients are the development cohort (around 60% of the total population).

IF Yes: no characteristics of this patient population present.

Univariable analysis was carried out in the whole cohort, and their characteristics are summarized in Table 1.

5. Line 283 A cut-off of 0.076 was arbitrarily selected.

Please elaborate more why this cut off value and not another? This arbitrary cut-off was selected in order to maximize sensitivity and negative predictive value. In this way, patients classified as low-risk on admission could get either discharged early or derived to other centers without intensive care units with the certainty that their likelihood of dying is not as high as those classified as high risk.

6. If a major Revision is possible, the analysis must be checked by a statistician. One of the authors, who performed the statistical analysis, has a master in biomedical research and statistical analysis of public health data from Universidad Autónoma of Barcelona.

Minor issues

1. The language need to be re-edited, there are many typos as well as weak sentences: lines 85-86 To evaluate the risk of demographic data, medical history, and on-admission laboratory parameters in hospitalized COVID-19 patients to predict mortality

(Not well written)

We have modified this sentence.

Lines 133-135 Clinicians need better predictors of which patients are prone to deteriorate rapidly or who may go on to die

(“predict Mortality” better) not go die!!!!

We have modified this sentence.

I hope now you consider the manuscript acceptable for publication.

Yours sincerely,

Juan Manuel Alonso-Domínguez

Attachment

Submitted filename: Response to Reviewers Jan 25.docx

Decision Letter 1

Aleksandar R Zivkovic

11 Feb 2021

Development and validation of a predictive model of in-hospital mortality in COVID-19 patients

PONE-D-20-35733R1

Dear Dr. Alonso-Dominguez,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Aleksandar R. Zivkovic

Academic Editor

PLOS ONE

Acceptance letter

Aleksandar R Zivkovic

23 Feb 2021

PONE-D-20-35733R1

Development and validation of a predictive model of in-hospital mortality in COVID-19 patients

Dear Dr. Alonso-Dominguez:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Aleksandar R. Zivkovic

Academic Editor

PLOS ONE


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES