Predicting Illness Severity and Short-Term Outcomes of COVID-19: A Retrospective Cohort Study in China

Chuming Chen; Haihui Wang; Zhichao Liang; Ling Peng; Fang Zhao; Liuqing Yang; Mengli Cao; Weibo Wu; Xiao Jiang; Peiyan Zhang; Yinfeng Li; Li Chen; Shiyan Feng; Jianming Li; Lingxiang Meng; Huishan Wu; Fuxiang Wang; Quanying Liu; Yingxia Liu

doi:10.1016/j.xinn.2020.04.007

. 2020 Apr 22;1(1):100007. doi: 10.1016/j.xinn.2020.04.007

Predicting Illness Severity and Short-Term Outcomes of COVID-19: A Retrospective Cohort Study in China

Chuming Chen ^1,³, Haihui Wang ^2,³, Zhichao Liang ^2,³, Ling Peng ^1,³, Fang Zhao ¹, Liuqing Yang ¹, Mengli Cao ¹, Weibo Wu ¹, Xiao Jiang ¹, Peiyan Zhang ¹, Yinfeng Li ¹, Li Chen ¹, Shiyan Feng ¹, Jianming Li ¹, Lingxiang Meng ¹, Huishan Wu ¹, Fuxiang Wang ¹, Quanying Liu ^2,^∗, Yingxia Liu ^1,^∗∗

PMCID: PMC7237911 PMID: 33554186

Among 417 COVID-19 patients in Shenzhen, demographic characteristics, clinical manifestations and baseline laboratory tests showed significant differences between mild-moderate cohort and severe-critical cohort.Based on these differences, a convenient mathematical model was established to predict the illness severity of COVID-19. The model includes four parameters: age, BMI, CD4⁺ lymphocytes and IL-6 levels. The AUC of the model is 0.911.The high risk factors for developing to severe COVID-19 are: age ≥ 55 years, BMI > 27 kg / m², IL-6 ≥ 20 pg / ml, CD4⁺ T cell ≤ 400 count / μ L.Among 249 discharged COVID-19 patients, those who recovered after 20 days had a lower count of platelet, a higher level of estimated glomerular filtration rate, and higher level of interleukin-6 and myoglobin than those who recovered within 20 days.

Keywords: COVID-19, prediction model, illness severity, CD4⁺ T lymphocyte

Graphical Abstract

Introduction

COVID-19, caused by SARS-CoV-2, is a highly contagious disease.¹ By April 8, 2020, more than 1,350,000 patients were diagnosed with COVID-19 globally, with more than 79,000 deaths worldwide attributable to the disease.² Recent clinical data reported that mild and critical patients manifested different symptoms. Most of the mild patients with COVID-19 had symptoms such as fever, cough, and mild pneumonia, whereas the critical cases presented dyspnea, respiratory failure, sepsis, organ dysfunction, and even eventual death.3, 4, 5 Therefore, we hypothesize that the clinical and laboratory characteristics may be associated with patients' severity and short-term outcomes. A large cohort study indicated that the total mortality rate of COVID-19 was 2.3%; however, the rate of severe cases was 8.1% while that of critical cases reached 14%.⁶ In particular, patients with complications, such as septic shock and organ dysfunction, had the highest mortality rate.⁴^,⁵ The median length of severe patients' aggravation period was 10 days.⁵ Without specific antiviral agents, an early intervention and supportive treatments are essential for improving clinical outcomes and reducing mortality in severe patients.

The ongoing outbreak of COVID-19 has been overloading medical systems worldwide. An accurate prediction of disease progression is conducive to informative clinical decision making and optimized allocations of hospital resources. Several modeling studies on COVID-19 have been reported to predict the spreading profile using the number of infected patients and transportation,⁷^,⁸ as well as to predict the illness severity and mortality based on the clinical features.⁹ In our previous clinical practice and research, we found that the individual immune status plays an important role in the progress and prognosis of the disease.¹⁰^,¹¹ In this study we aimed to build a model combining clinical characteristics and laboratory findings, including the immune status, to predict the progression and outcomes of COVID-19.

Results

Demographics and Baseline Characteristics

Our study included all 417 confirmed COVID-19 cases in Shenzhen by February 29, 2020, who were all admitted to the Third People's Hospital of Shenzhen. The patients were divided into two groups, comprising 325 (77.9%) mildly to moderately ill patients and 92 (22.1%) severely to critically ill patients. The sample consisted of 198 males (47.5%) and 219 females (52.5%), but the sex ratio was skewed toward males in the severe-critical group (64.1%). No patient had an exposure history (shopping) in the Huanan Seafood Market in Wuhan, although 309 (74.1%) had a traveling history in Hubei province. In addition, 279 (66.9%) patients had a fever before admission and 148 (25.5%) presented with cough. Other symptoms that were not common included expectoration (3.1%), headache (3.4%), myalgia (3.2%), chill (0.7%), nausea or vomiting (0.2%), and diarrhea (2.6%). Most symptom profiles were comparable between the mild-moderate group (n = 325) and the severe-critical group (n = 92) while fever and cough occurred in a significantly higher proportion among severely to critically ill patients (p < 0.001 for fever and p = 0.015 for cough). Hypertension (23 [5.5%]), chronic liver disease (23 [5.5%]), and diabetes (15 [10.1%]) were the most common co-existing conditions. Moreover, 367 (88%) patients had pneumonia first time on admission (Table 1).

Table 1.

Epidemiological and Baseline Clinical Features of 417 Confirmed COVID-19 Patients in the Third People's Hospital of Shenzhen from January 11 to February 18, 2020

Characteristics	COVID-19 Patients
Characteristics	Total (N = 417)	Mild-Moderate (n = 325)	Severe-Critical (n = 92)	p-value
Median age (IQR)	47 (34–60)	41 (31–56)	61 (52–65)	<0.001
Age subgroups
0–17 years	30 (7.2)	30 (9.2)	0	–
18–60 years	286 (68.6)	241 (74.2)	45 (49)	–
>60 years	101 (24.2)	54 (16.6)	47 (51)	–
Male (%)	198 (47.5)	139 (42.8)	59 (64.1)	<0.001
BMI (kg/m²)	23.14 (20.96–25.39)	22.68 (20.43–24.97)	24.52 (22.84–26.68)	<0.001
Initial symptoms
Fever	279 (66.9)	199 (61.2)	80 (87.0)	<0.001
Cough	148 (25.5)	105 (32.3)	43 (46.7)	0.015
Expectoration	13 (3.1)	9 (2.8)	4 (4.3)	0.50
Headache	14 (3.4)	12 (3.7)	2 (2.2)	0.74
Myalgia	13 (3.2)	10 (3.1)	3 (3.3)	>0.99
Chill	3 (0.7)	2 (0.6)	1 (1.1)	0.53
Nausea or vomiting	1 (0.2)	1 (0.3)	0	>0.99
Diarrhea	11 (2.6)	7 (2.2)	4 (4.3)	0.27
Co-existing chronic medical conditions
Chronic heart disease	9 (2.2)	2 (0.6)	7 (7.6)	<0.001
Chronic lung disease	3 (0.7)	2 (0.6)	1 (1.1)	0.53
Chronic renal disease	6 (1.4)	2 (0.6)	4 (4.3)	0.023
Chronic liver disease	23 (5.5)	15 (4.6)	8 (8.7)	0.21
Diabetes	15 (3.6)	7 (2.2)	8 (8.7)	<0.01
Hypertension	23 (5.5)	13 (4.0)	10 (10.9)	0.022
Cancer	1 (0.2)	0	1 (1.1)	0.22
Exposure history
Traveling history to Hubei	309 (74.1)	236 (72.6)	73 (79.3)	0.24
Interval (days), median (IQR)
Onset to admission	3 (1–6)	3 (1–6)	4 (2–7)	>0.99
Onset to diagnosis	2 (0–5)	1 (0–4)	2 (1–6)	<0.01
Bilateral pneumonia	295 (70.7)	211 (64.9)	84 (91.3)	<0.001
Unilateral pneumonia	72 (17.3)	67 (20.6)	5 (5.43)	0.001

Open in a new tab

Values are number (%) except where indicated otherwise. p values indicate differences between mild-moderate and severe-critical patients. p < 0.05 was considered statistically significant.

BMI, body mass index; IQR, interquartile range.

Laboratory Findings in 417 Confirmed COVID-19 Patients

Levels of white blood cells, hemoglobin, triglyceride, low-density lipoprotein, high-density lipoprotein, and cholesterol did not differ between the two groups. These measures were recorded on the day of admission for all patients. They were then divided into groups according to illness severity. There were numerous differences in laboratory findings between the two groups, including the level of lymphocytes (1.44 [1.09–1.96] versus 1.02 [0.83–1.28]), the level of neutrophils (2.55 [1.92–3.42] versus 3.03 [2.28–4.39]), the level of platelets (197 [158–241] versus 151 [129–178]), the cell count of CD4⁺ T lymphocytes (618 [450–805] versus 334.5 [201–458]), the cell count of CD8⁺ T lymphocytes (272 [417–571] versus 177.5 [112–289]), the level of D-dimers (0.33 [0.24–0.47] versus 0.53 [0.36–0.83]), the level of alanine aminotransferase (20 [13.45–28] versus 26.65 [20–38]), the level of aspartate aminotransferase (25 [20–33] versus 34[26-48])estimated glomerular filtration rate (108.53 [97.16–118.90] versus 92.50 [76.15–101.76]), the level of chromium (60 [50–74] versus 72 [57–93]), the level of myoglobin (33.02 [26.12–44.03] versus 56.19 [40.63–110.16]), the level of brain natriuretic peptide (1.74 [0.70–3.55] versus 7.55 [4.32–16.32]), the level of troponin I (0.005 [0.005–0.005] versus 0.009 [0.005–0.017]), the level of lactate dehydrogenase (209 [167–372] versus 336 [213–592]), the level of procalcitonin (0.034 [0.023–0.052] versus 0.0645 [0.049–0.088]), the level of C-reactive protein (6.0 [2.2–16.5] versus 27.5 [12.5–52.8]), the level of interleukin-6 (IL-6) (8.51 [3.82–16.03] versus 24.17 [16.26–43.02]), and erythrocyte sedimentation rate (23 [13–41] versus 41 [24–57]) (Table 2).

Table 2.

Laboratory Findings in 417 Confirmed COVID-19 Patients in the Third People's Hospital of Shenzhen from January 11 to February 18, 2020

Variables	COVID-19 Patients			p -value
Variables	Total	Mild-Moderate	Severe-Critical	p -value
WBC (×10⁹/L)	4.67 (3.79–5.83)	4.64 (3.76–5.72)	4.82 (3.85–5.95)	>0.99
LYM (×10⁹/L)	1.31 (0.99–1.82)	1.44 (1.09–1.96)	1.02 (0.83–1.28)	<0.001
NEU (×10⁹/L)	2.68 (2.01–3.59)	2.55 (1.92–3.42)	3.03 (2.28–4.39)	<0.01
HGB (g/L)	137 (127–146)	137 (126–146)	138 (127–151)	>0.99
PLT (×10⁹/L)	186 (149–231)	197 (158–241)	151 (129–178)	<0.001
CD4⁺ cell count	535 (380–742)	618 (450–805)	334.5 (201–458)	<0.001
CD8⁺ cell count	350 (202–522)	272 (417–571)	177.5 (112–289)	<0.001
CD4⁺/CD8⁺	1.62 (1.15–2.16)	1.55 (1.15–1.99)	1.83 (1.24–2.49)	0.33
D-DIC (μg/mL)	0.36 (0.25–0.53)	0.33 (0.24–0.47)	0.53 (0.36–0.83)	<0.001
ALT (U/L)	21 (15–31)	20 (13.45–28)	26.65 (20–38)	<0.001
AST (U/L)	26 (21–35)	25 (20–33)	34 (26–48)	<0.001
K (mmol/L)	3.89 (3.64–4.16)	3.90 (3.68–4.18)	3.79 (3.54–4.09)	0.30
Na (mmol/L)	138.5 (136.8–139.8)	138.7 (137.3–140.0)	136.8 (134.9–139.1)	<0.001
Cr (μmol/L)	62 (52–76)	60 (50–74)	72 (57–93)	<0.001
eGFR (mL/min)	105.18 (93.55–116.64)	108.53 (97.16–118.90)	92.50 (76.15–101.76)	<0.001
CK (U/L)	67 (51–100)	66 (49–93)	83 (56–170)	0.073
LDH (U/L)	224 (176–422)	209 (167–372)	336 (213–592)	<0.001
MYO (ng/mL)	37.21 (27.94–53.31)	33.02 (26.12–44.03)	56.19 (40.63–110.16)	<0.001
BNP (pg/mL)	2.91 (0.98–8.23)	1.74 (0.70–3.55)	7.55 (4.32–16.32)	<0.001
TnI (μg/L)	0.005 (0.005–0.005)	0.005 (0.005–0.005)	0.009 (0.005–0.017)	<0.001
TG (mmol/L)	0.95 (0.73–1.31)	0.91 (0.70–1.37)	1.02 (0.83–1.24)	>0.99
LDL (mmol/L)	2.32 (1.91–2.78)	2.44 (1.98–2.81)	2.11 (1.83–2.51)	0.96
HDL (mmol/L)	1.17 (0.97–1.39)	1.21 (1.00–1.43)	1.11 (0.90–1.26)	0.33
CHOL (mmol/L)	3.96 (3.43–4.61)	4.12 (3.57–4.65)	3.62 (3.28–4.21)	0.14
PCT (ng/mL)	0.039 (0.025–0.062)	0.034 (0.023–0.052)	0.0645 (0.049–0.088)	<0.001
CRP (mg/L)	8.7 (2.5–24.8)	6.0 (2.2–16.5)	27.5 (12.5–52.8)	<0.001
IL-6 (pg/mL)	11.88 (4.61–19.63)	8.51 (3.82–16.03)	24.17 (16.26–43.02)	<0.001
ESR (mm/h)	28 (14–45)	23 (13–41)	41 (24–57)	<0.001

Open in a new tab

p values indicate differences between mild-moderate and severe-critical patients. p < 0.05 was considered statistically significant.

WBC, white blood cells; LYM, lymphocytes; NEU, neutrophils; HGB, hemoglobin; PLT, platelets; CD4⁺, CD4⁺ T lymphocyte; CD8⁺, CD8⁺ T lymphocyte; D-DIC, D-dimer; ALT, alanine aminotransferase; AST, aspartate aminotransferase; K, potassium; Na, sodium; Cr, chromium; eGFR, estimated glomerular filtration rate; CK, creatine kinase; MYO, myoglobin; BNP, brain natriuretic peptide; TnI, troponin I; LDH, lactate dehydrogenase; TG, triglyceride; LDL, low-density lipoprotein; HDL, high-density lipoprotein; CHOL, cholesterol; PCT, procalcitonin; CRP, C-reactive protein; IL-6, interleukin-6; ESR, erythrocyte sedimentation rate.

Demographics and Baseline Characteristics of 249 Discharged Patients

The study included 249 patients discharged up until February 24, 2020. In this cohort, the median age was 44 years (interquartile range [IQR], 32–57) and 121 (48.6%) were males. Of these patients, 121 (48.6%) were discharged within 20 days after admission. In addition, 128 (51.4%) recovered after 20 days in hospital, among whom 91 (71.1%) were between 18 and 60 years old. The median duration from onset to hospital admission and from hospitalization to diagnosis was 3 days (IQR, 1–5) and 2 days (IQR, 0–4), respectively. The most common symptoms at the onset were fever (173 [69.5%]) and cough (95 [38.2%]). Less common symptoms included expectoration, headache, diarrhea, nausea, and vomiting. Hypertension (29 [7.6%]), chronic liver disease (21 [8.4%]), and diabetes (12 [4.8%]) were the most common co-existing conditions (Table 3).

Table 3.

Epidemiological and Baseline Clinical Features of 249 Discharged COVID-19 Patients in the Third People's Hospital of Shenzhen until February 24, 2020

Characteristics	COVID-19 Patients			p-alue
Characteristics	Total (N = 249)	≤20 days (n = 121)	>20 days (n = 128)	p-alue
Age (years), median (IQR)	44 (32–57)	36 (25–53)	50 (38–61)	<0.001
Age subgroups
0–17 years	21 (8.4)	17 (14.0)	4 (3.1)	–
18–60 years	175 (70.3)	84 (69.4)	91 (71.1)	–
>60 years	53 (21.3)	20 (16.5)	33 (25.8)	–
Male (%)	121 (48.6)	57 (47.1)	64 (50.0)	0.74
BMI (kg/m²)	23.26 (21.26–25.22)	23.25 (21.33–25.15)	23.26 (21.26–25.28)	>0.99
Initial symptoms
Fever	173 (69.5)	71 (58.7)	102 (79.7)	<0.001
Cough	95 (38.2)	43 (35.5)	52 (40.6)	0.49
Expectoration	3 (1.2)	2 (1.7)	1 (0.8)	0.61
Headache	9 (3.6)	3 (2.5)	6 (4.7)	0.50
Myalgia	8 (3.2)	3 (2.5)	5 (3.9)	0.72
Chill	1 (0.4)	0	1 (0.8)	>0.99
Nausea or vomiting	1 (0.4)	0	1 (0.8)	>0.99
Diarrhea	6 (2.4)	2 (1.7)	4 (3.1)	0.68
Co-existing chronic medical conditions
Chronic heart disease	7 (2.8)	4 (3.3)	3 (2.3)	0.72
Chronic lung disease	3 (1.2)	1 (0.8)	2 (1.6)	>0.99
Chronic renal disease	6 (2.4)	2 (1.7)	4 (3.1)	0.68
Chronic liver disease	21 (8.4)	10 (8.3)	11 (8.6)	>0.99
Diabetes	12 (4.8)	4 (3.3)	8 (6.3)	0.43
Hypertension	29 (7.6)	7 (5.8)	12 (9.4)	0.41
Cancer	1 (0.4)	0	1 (0.8)	>0.99
Exposure history
Traveling history to Hubei	197 (79.1)	91 (75.2)	106 (82.8)	0.19
Interval (days), median (IQR)
Onset to admission	3 (1–5)	2 (1–4)	4 (2–7)	<0.001
Onset to diagnosis	2 (0–4)	1 (0–3)	3 (1–6)	<0.001

Open in a new tab

Values are number (%) except where indicated otherwise. p values indicate differences between onset to discharged date ≤20 days and >20 days. p < 0.05 was considered statistically significant.

Compared with patients who recovered within 20 days, those who recovered after 20 days were significantly older (median age, 50 [IQR, 38–61] versus 36 [IQR, 25–53] years; p <.001) and were more likely to have a fever before admission (102 [79.7%] versus 71 [58.7%]) (Table 3).

Laboratory Findings of 249 Discharged Patients

On the day of admission, those of the 249 discharged patients who recovered after 20 days had a lower count of platelets, a higher estimated glomerular filtration rate, and higher level of IL-6 and myoglobin than those who recovered within 20 days; among 249 discharged patients, the median level of platelets, estimated glomerular filtration rate, level of IL-6, and level of myoglobin were 187 × 10⁹/L (IQR, 150–227), 106.58 mL/min (IQR, 94.43–118.12), 10.76 pg/mL (IQR, 4.54–19.07), and 36.85 ng/mL (IQR, 28.59–58.06), respectively (Table 4).

Table 4.

Laboratory Findings in 249 Discharged COVID-19 Patients in the Third People's Hospital of Shenzhen until February 24, 2020

Variables	COVID-19 Patients			p-value
Variables	Total (N = 249)	≤20 days (n = 121)	>20 days (n = 128)	p-value
WBC (×10⁹/L)	4.63 (3.79–5.72)	4.71 (4.06–5.83)	4.54 (3.58–5.71)	>0.99
LYM (×10⁹/L)	1.33 (1–1.77)	1.5 (1.12–1.95)	1.22 (0.98–1.63)	0.054
NEU (×10⁹/L)	2.58 (1.98–3.49)	2.58 (2.03–3.48)	2.58 (1.98–3.50)	>0.99
HGB (g/L)	137 (128–148)	137 (127–147)	138 (130–148)	>0.99
PLT (×10⁹/L)	187 (150–227)	197 (163–244)	174 (146–204)	0.012
CD4⁺ cell count	550 (414–746)	618 (453–841)	519 (377–658)	0.14
CD8⁺ cell count	368 (234–526)	429 (281–585)	335 (218–465)	0.078
CD4⁺/CD8⁺	1.55 (1.12–2.03)	1.55 (1.11–1.92)	1.54 (1.19–2.17)	>0.99
D-DIC (μg/mL)	0.34 (0.24–0.50)	0.31 (0.22–0.44)	0.36 (0.26–0.54)	0.96
ALT (U/L)	21.4 (15.1–30.625)	20 (14.85–32)	23 (16–30)	>0.99
AST (U/L)	27 (21–34)	26 (20–33)	28 (22–35)	>0.99
K (mmol/L)	3.90 (3.65–4.16)	3.94 (3.66–4.17)	3.89 (3.63–4.15)	>0.99
Na (mmol/L)	138.5 (136.8–139.7)	138.7 (137.225–139.7)	138 (136.1–139.7)	>0.99
Cr (μmol/L)	64 (52–77)	62 (49–75)	64 (54–79)	>0.99
eGFR (mL/min)	106.58 (94.43–118.12)	111.11 (99.43–122.05)	103.56 (92.53–115.10)	<0.01
CK (U/L)	67 (51–104)	66 (54–93)	76 (49–114)	>0.99
LDH (U/L)	223 (174–406)	207 (165–384)	234 (181–429)	>0.99
MYO (ng/mL)	36.85 (28.59–58.06)	31.67 (25.30–46.80)	42.73 (32.29–68.32)	0.024
BNP (pg/mL)	1.94 (0.86–8.28)	1.84 (1.1–3.078)	4.32 (0.59–12.39)	>0.99
TnI (μg/L)	<0.006	<0.006	<0.006	>0.99
TG (mmol/L)	0.85 (0.69–1.23)	0.77 (0.66–1.13)	1.0 (0.77–1.37)	0.87
LDL (mmol/L)	2.27 (1.94–2.82)	2.23 (1.89–2.77)	2.32 (1.99–2.86)	>0.99
HDL (mmol/L)	1.21 (1.00–1.41)	1.25 (1.00–1.49)	1.16 (1.01–1.35)	>0.99
CHOL (mmol/L)	3.96 (3.48–4.64)	3.87 (3.45–4.58)	3.96 (3.52–4.72)	>0.99
PCT (ng/mL)	0.038 (0.025–0.059)	0.037 (0.025–0.059)	0.039 (0.026–0.058)	>0.99
CRP (mg/L)	8.6 (2.5–21.6)	6.7 (2.5–18.2)	11.7 (3.2–25.7)	0.22
IL-6 (pg/mL)	10.76 (4.54–19.07)	7.76 (3.44–16.73)	13.54 (6.90–23.51)	0.044
ESR (mm/h)	25 (13–44)	23 (12–40)	27 (15–48)	>0.99

Open in a new tab

All laboratory values are in the form of medians(interquartile range). p values indicate differences between mild-moderate and severe-critical patients. p < 0.05 was considered statistically significant.

Performance of the Auxiliary Tool for Stratifying Risk of Patients

According to the mentioned methods and analysis, all significant variables were involved in a logistic regression model to predict whether a patient was of a mild-moderate type or a severe-critical type. Insignificant variables during model training were eliminated. This left four interpretable and significant variables, namely IL-6, CD4⁺ T lymphocyte, body mass index (BMI), and age. Among the 417 patients, 222 possessed complete data of these four parameters, which were thus collected. Therefore, there were four variables (five parameters with intercept) in our model. The sample size is around 44 (222/5) times larger than the free parameters, which is four times larger than the empirical experience. Thus, the sample size is sufficient to train our model. However, since individual differences will influence the modeling results, it is still a limitation that we did not collect completely clinical test results for each patient when hospitalized at the outset.

The logistic regression model was interpretable for clinical prediction. Relative weights were assigned according to each variable's regression coefficient. The coefficient values for each variable are 0.0695(IL6), -0.0051(CD4⁺ T cell), 0.2799(BMI) and 0.0691(Age) respectively. The odds ratios for each variable are 1.0720(IL6), 0.9949(CD4⁺ T cell), 1.323(BMI) and 1.0716(Age) respectively. All these 4 variables were statistics significant with p<0.05. The area under the curve (AUC) of the training set was 0.921 and that of the testing set was 0.911. Figure 1 shows the sensitivity and specificity of our model.

ROC Curve from the Testing Set

The sensitivity and specificity of the model are indicated in the text.

After model training, cut-off values of the four variables were selected based on statistical results and clinical experience. The median probability of progressing to severely to critically ill was significantly higher in patients with a BMI greater than 27 than in those with a BMI of or below 27 (53.18% versus 7.76%). This trend remained for the parameters of CD4⁺ lymphocytes, IL-6, and age. Patients with a CD4⁺ T lymphocyte level of ≤400/μL showed a lower median probability of progressing to severe-critical status (62.52%) than those with a level >400/μL. Patients with level of IL-6 ≥20 pg/mL were more likely to develop into a severely to critically ill case (median probability 70.01%) than those whose level was lower than 20 pg/mL (3.80%). Patients aged ≥55 years had a higher probability of developing into a severe-critical case (41.15%) than those younger than 55 (1.73%). The logistic regression model was then retrained with two separate ranges under these four variables. For model validation, we applied a repeated k-fold cross-validation method of 10-fold with 10 repeats. Values of R², root mean squared error (RMSE), and mean absolute error (MAE) were used for model performance evaluation. Results showed R² = 0.596, MAE = 0.21, and RMSE = 0.31. In the end, we predicted the probability for each of 16 (2⁴ = 16) types of initial test results, shown in Figure 2. These results represent potential indicators for early initial auxiliary prediction.

The Probability under 16 Types of Initial Test Results

Older age (≥55 years), high BMI (>27 kg/m²), low level of CD4⁺ T lymphocytes (≤400/μL), and high level of IL-6 (≥20 pg/mL) have the highest probability of progressing into severe-critical COVID-19. Index units: age (years), BMI (kg/m²), IL-6 (pg/mL), CD4⁺ T lymphocytes (count/μL).

Discussion

COVID-19 has been spreading worldwide,² putting a huge amount of pressure on medical systems in a short time. Accurate prediction of disease progression for COVID-19 patients is therefore urgent to facilitate appropriate clinical decision making and optimized medical resource allocations. Risk stratification management will help to alleviate burdens of insufficient medical resources and further reduce mortality.¹²

Our current research showed that there were significant differences between the mild-moderate group and the severe-critical group concerning several epidemiological and clinical features (age, gender, and fever). Results were consistent with previous reports that age, gender, and underlying diseases might be associated with disease severity.⁴^,⁹ In terms of laboratory data, 18 out of 27 tested baseline parameters, including levels of blood cells, indicators of myocardial zymogram, liver and renal function, indicators of coagulation function, and levels of infection-related biomarkers were significantly different between the two groups. Hence, we recommended four parameters—age, BMI, and levels of CD4⁺ T lymphocytes and IL-6—as predictors in future multivariate regression analysis. The elderly (≥55 years), a high level of BMI (>27 kg/m²), a low level of CD4⁺ T lymphocytes (≤400/μL), and a high level of IL-6 (≥20 pg/mL) are markers associated with the progression to severe and critical illness among COVID-19 patients.

In a large cohort study of COVID-19 in China, the median age of severe cases was 52 years while that of mild cases was 45 years.⁴ Consistent with our findings, an advanced age was associated with immunity decline characterized by decreases in both humoral and cellular responses according to previous research.¹³ Although underlying reasons remained unknown, we found a correlation between BMI and disease severity. This finding was in line with results showing BMI as an independent risk factor that could predict the incidence of pneumonia among patients with chronic obstructive pulmonary disease. Lymphocytopenia was present in 83.2% of the hospitalized patients. Severe patients showed more prominent laboratory abnormalities, including lymphocytopenia and leukopenia, than mild ones.⁶ Among the severe-critical cohort, we also observed lower levels of lymphocytes and CD4⁺ T lymphocytes. Those with a level of CD4⁺ T lymphocytes at or below 400/μL suffered a higher risk of progression to severe cases. The cytokine storm caused by a vigorous immune response against the virus was a significant predicting factor for the severity of viral pneumonia. Plasma levels of IL-2, IL-7, IL-10, granulocyte colony-stimulating factor, interferon-γ-inducible protein-10, monocyte chemotattractant protein-1, macrophage inflammatory protein-1α, and tumor necrosis factor α in COVID-19 were higher in intensive care unit (ICU) cases than non-ICU cases.⁴ Being one of the important cytokines, IL-6 could complement the function of CD8⁺ T lymphocytes and might aggravate the clinical course.¹⁴ Compared with other cytokines, IL-6 is more widely used and more meaningful clinically.

Data for the four clinical test indexes proposed by our model are easy to obtain. Therefore, we recommend that all related characteristics should be tested for patients immediately after admission. Our model reflected the real-world scenario in clinical practice. Receiver-operating characteristic (ROC) analysis suggested that this model had functional capacity in predicting severity among COVID-19 patients (AUC = 0.911). The score may be regarded as a predictor of prognosis to prompt appropriate early intervention. Hence, we present the probability graph of progressing to severe or critical illness, which could be of clinical reference for early diagnosis.

Despite remarkable findings, the study has a few limitations. First, we did not measure broad-spectrum cytokines and viral loads, which may be related to disease progression and severity. Second, the retrospective single-center design led to missing data and unavoidable bias when identifying participants. A larger sample size would be in favor the generalizability of a prediction model. This model would need to be further verified in other cohorts and in clinical practice. Moreover, we did not compare our model performance with alternative machine-learning models, such as the random forest model and the extreme gradient boosting model. It will add value to compare efficacies of different models in the future.

In summary, we proposed a model containing four routine baseline parameters. This allowed accurate prediction of severity among COVID-19 patients. The model accurately stratified these patients into relevant risk categories. It could further help to produce appropriate clinical decisions and optimized allocations of hospital resources when shifting attention to the server during the pandemic.

Material and Methods

Patients

All patients from Shenzhen were admitted to the Third People's Hospital of Shenzhen once their SARS-CoV-2 sample results, collected in upper respiratory tract and tested by real-time PCR, turned out to be positive. All 417 patients included in this retrospective study were confirmed cases who were admitted to this designated infectious disease hospital in Shenzhen from January 11 to February 18, 2020. Ethical approval had been obtained from the Ethics Committee of the Third People's Hospital of Shenzhen.

Procedures

Nasopharyngeal and oropharyngeal specimens for SARS-CoV-2 tests were collected with synthetic fiber swabs under the guidelines introduced by either the World Health Organization (WHO) or the Chinese Center for Disease Control and Prevention (Chinese CDC). Stored at 2°C–8°C, these specimens were packaged with ice and shipped to the Shenzhen Center for Disease Control and Prevention (Shenzhen CDC). Following standard protocols released by Chinese CDC, RNA was extracted and tested by real-time RT-PCR with SARS-CoV-2-specific primers and probes in level-2 biosafety facilities at Shenzhen CDC.¹

In addition to treatment outcomes, epidemiological, demographic, clinical, laboratory, and radiological profiles of all patients were followed up. Data were sorted into standardized data collection forms and then reviewed by experienced physicians. We defined the “onset date” as the day when symptoms appeared. For patients without initial symptoms, the onset date was defined as the day of admission. According to the WHO interim guidance,¹⁵ all COVID-19 patients were diagnosed and classified into mild to critical pneumonia cases. The National Health Commission of the People's Republic of China issued the seventh version of COVID-19 Treatment Guideline, in which patients were also categorized into mild, moderate (typical), severe, and critically ill cases.¹⁶

In the following analysis, we divided patients into two cohorts, namely a mild-moderate cohort and a severe-critical cohort according to the above classification criteria.

Discharge Criteria

The COVID-19 guidelines (seventh version) issued by the National Health Commission of the People’ Republic of China required that all patients must meet certain criteria before being discharged.¹⁶

Statistical Analysis

Continuous variables were described using median and interquartile range (IQR) values and were compared using the Mann-Whitney U test. Categorical variables were described as frequency rates, percentages, and proportions and were compared using Chi-square test. When a p value of Chi-square test was around α (0.05), attention was paid to ensure that the approximation of the Chi-square test met with requirements. Under this circumstance, Fisher's test was performed instead when necessary.¹⁷ Pairwise comparisons between groups were performed using Bonferroni’s correction. All p values were from two-sided tests, and results were deemed statistically significant at p < 0.05. All statistical analysis was done using R (version 3.6.2).

We selected statistically significant variables to develop a straightforward clinical predicting tool—a logistic regression model—used for early diagnosis. Variables with an insignificant p value were considered as irrelevant and were eliminated during the training process. The area under the ROC curves in the model's sensitivity and specificity results and repeated k-fold cross-validation results were used to assess how well the model performed. Cut-off values of each variable were obtained from statistical results and clinical experience. We then retrained the clinical predicting model to establish an auxiliary diagnosis table showing the probability of developing into a severe-critical case.

Declaration of Interests

None.

Acknowledgments

We thank the patients, nurses, and physicians who provided care for the patients, and the investigators at the Third People's Hospital of Shenzhen. This work is supported by grants from the Science and Technology Innovation Committee of Shenzhen Municipality (202002073000001), National Key Research and Development Program (2020YFC0841700), High-level University Fund (no. G02386301, G02386401), and Guangdong Natural Science Foundation Joint Fund (no. 2019A1515111038).

Published Online: May 21, 2020

Contributor Information

Quanying Liu, Email: liuqy@sustech.edu.cn.

Yingxia Liu, Email: yingxialiu@hotmail.com.

References

1.Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., Ren R., Leung K.S.M., Lau E.H.Y., Wong J.Y., et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 2020;382:1199–1207. doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Coronavirus disease 2019 (COVID-19) situation report-79. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200408-sitrep-79-covid-19.pdf?sfvrsn=4796b143_4
3.Chen N., Zhou M., Dong X., Qu J., Gong F., Han Y., Qiu Y., Wang J., Liu Y., Wei Y., et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., Zhang L., Fan G., Xu J., Gu X., et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Wang D., Hu B., Hu C., Zhu F., Liu X., Zhang J., Wang B., Xiang H., Cheng Z., Xiong Y., et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus–infected pneumonia in Wuhan, China. JAMA. 2020 doi: 10.1001/jama.2020.1585. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Guan W., Ni Z., Hu Y., Liang W.H., Ou C.Q., He J.X., Liu L., Shan H., Lei C.L., Hui D.S.C., et al. Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2002032. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Wu J.T., Leung K., Leung G.M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet. 2020;395:689–697. doi: 10.1016/S0140-6736(20)30260-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Tang B., Bragazzi N.L., Li Q., Tang S., Xiao Y., Wu J. An updated estimation of the risk of transmission of the novel coronavirus (2019-nCov) Infect. Dis. Model. 2020;5:248–255. doi: 10.1016/j.idm.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Zhou F., Yu T., Du R., Fan G., Liu Y., Liu Z., Xiang J., Wang Y., Song B., Gu X., et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–1062. doi: 10.1016/S0140-6736(20)30566-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Shen C., Wang Z., Zhao F., Yang Y., Li J., Yuan J., Wang F., Li D., Yang M., Xing L., et al. Treatment of 5 critically ill patients with COVID-19 with convalescent plasma. JAMA. 2020 doi: 10.1001/jama.2020.4783. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Liu Y., Yang Y., Zhang C., Huang F., Wang F., Yuan J., Wang Z., Li J., Li J., Feng C., et al. Clinical and biochemical indexes from 2019-nCoV infected patients linked to viral loads and lung injury. Sci. China Life Sci. 2020;63(3):364–374. doi: 10.1007/s11427-020-1643-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Ji Y., Ma Z., Peppelenbosch M.P., Pan Q. Potential association between COVID-19 mortality and health-care resource availability. Lancet Glob. Health. 2020;8:e480. doi: 10.1016/S2214-109X(20)30068-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Pawelec G., Barnett Y., Forsey R., Frasca D., Globerson A., McLeod J., Caruso C., Franceschi C., Fülöp T., Gupta S., et al. T cells and aging, January 2002 update. Front. Biosci. 2002;7:d1056–d1183. doi: 10.2741/a831. [DOI] [PubMed] [Google Scholar]
14.Nussing S., Sant S., Koutsakos M., Subbarao K., Nguyen T., Kedzierska K. Innate and adaptive T cells in influenza disease. Front. Med. 2018;12:34–47. doi: 10.1007/s11684-017-0606-8. [DOI] [PubMed] [Google Scholar]
15.World Health Organization Clinical management of severe acute respiratory infection when Novel coronavirus (nCoV) infection is suspected: interim guidance. https://www.who.int/internal-publications-detail/clinical-management-of-severe-acute-respiratory-infection-when-novel-oronavirus-(ncov)-infection-is-suspected
16.The COVID-19 guidelines of the National Health and Health Commission of China (seventh version) https://www.nhc.gov.cn/yzygj/s7653p/202003/46c9294a7dfe4cef80dc7f5912eb1989/files/ce3e6945832a438eaae415350a8ce964.pdf
17.Fisher R.A. Oliver and Boyd; 1954. Statistical Methods for Research Workers. [Google Scholar]

[bib1] 1.Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., Ren R., Leung K.S.M., Lau E.H.Y., Wong J.Y., et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 2020;382:1199–1207. doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] 2.Coronavirus disease 2019 (COVID-19) situation report-79. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200408-sitrep-79-covid-19.pdf?sfvrsn=4796b143_4

[bib3] 3.Chen N., Zhou M., Dong X., Qu J., Gong F., Han Y., Qiu Y., Wang J., Liu Y., Wei Y., et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] 4.Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., Zhang L., Fan G., Xu J., Gu X., et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] 5.Wang D., Hu B., Hu C., Zhu F., Liu X., Zhang J., Wang B., Xiang H., Cheng Z., Xiong Y., et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus–infected pneumonia in Wuhan, China. JAMA. 2020 doi: 10.1001/jama.2020.1585. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] 6.Guan W., Ni Z., Hu Y., Liang W.H., Ou C.Q., He J.X., Liu L., Shan H., Lei C.L., Hui D.S.C., et al. Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2002032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] 7.Wu J.T., Leung K., Leung G.M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet. 2020;395:689–697. doi: 10.1016/S0140-6736(20)30260-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] 8.Tang B., Bragazzi N.L., Li Q., Tang S., Xiao Y., Wu J. An updated estimation of the risk of transmission of the novel coronavirus (2019-nCov) Infect. Dis. Model. 2020;5:248–255. doi: 10.1016/j.idm.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] 9.Zhou F., Yu T., Du R., Fan G., Liu Y., Liu Z., Xiang J., Wang Y., Song B., Gu X., et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–1062. doi: 10.1016/S0140-6736(20)30566-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] 10.Shen C., Wang Z., Zhao F., Yang Y., Li J., Yuan J., Wang F., Li D., Yang M., Xing L., et al. Treatment of 5 critically ill patients with COVID-19 with convalescent plasma. JAMA. 2020 doi: 10.1001/jama.2020.4783. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] 11.Liu Y., Yang Y., Zhang C., Huang F., Wang F., Yuan J., Wang Z., Li J., Li J., Feng C., et al. Clinical and biochemical indexes from 2019-nCoV infected patients linked to viral loads and lung injury. Sci. China Life Sci. 2020;63(3):364–374. doi: 10.1007/s11427-020-1643-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] 12.Ji Y., Ma Z., Peppelenbosch M.P., Pan Q. Potential association between COVID-19 mortality and health-care resource availability. Lancet Glob. Health. 2020;8:e480. doi: 10.1016/S2214-109X(20)30068-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Pawelec G., Barnett Y., Forsey R., Frasca D., Globerson A., McLeod J., Caruso C., Franceschi C., Fülöp T., Gupta S., et al. T cells and aging, January 2002 update. Front. Biosci. 2002;7:d1056–d1183. doi: 10.2741/a831. [DOI] [PubMed] [Google Scholar]

[bib14] 14.Nussing S., Sant S., Koutsakos M., Subbarao K., Nguyen T., Kedzierska K. Innate and adaptive T cells in influenza disease. Front. Med. 2018;12:34–47. doi: 10.1007/s11684-017-0606-8. [DOI] [PubMed] [Google Scholar]

[bib15] 15.World Health Organization Clinical management of severe acute respiratory infection when Novel coronavirus (nCoV) infection is suspected: interim guidance. https://www.who.int/internal-publications-detail/clinical-management-of-severe-acute-respiratory-infection-when-novel-oronavirus-(ncov)-infection-is-suspected

[bib16] 16.The COVID-19 guidelines of the National Health and Health Commission of China (seventh version) https://www.nhc.gov.cn/yzygj/s7653p/202003/46c9294a7dfe4cef80dc7f5912eb1989/files/ce3e6945832a438eaae415350a8ce964.pdf

[bib17] 17.Fisher R.A. Oliver and Boyd; 1954. Statistical Methods for Research Workers. [Google Scholar]

PERMALINK

Predicting Illness Severity and Short-Term Outcomes of COVID-19: A Retrospective Cohort Study in China

Chuming Chen

Haihui Wang

Zhichao Liang

Ling Peng

Fang Zhao

Liuqing Yang

Mengli Cao

Weibo Wu

Xiao Jiang

Peiyan Zhang

Yinfeng Li

Li Chen

Shiyan Feng

Jianming Li

Lingxiang Meng

Huishan Wu

Fuxiang Wang

Quanying Liu

Yingxia Liu

Graphical Abstract

Introduction

Results

Demographics and Baseline Characteristics

Table 1.

Laboratory Findings in 417 Confirmed COVID-19 Patients

Table 2.

Demographics and Baseline Characteristics of 249 Discharged Patients

Table 3.

Laboratory Findings of 249 Discharged Patients

Table 4.

Performance of the Auxiliary Tool for Stratifying Risk of Patients

Figure 1.

Figure 2.

Discussion

Material and Methods

Patients

Procedures

Discharge Criteria

Statistical Analysis

Declaration of Interests

Acknowledgments

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases