Skip to main content
Infectious Diseases of Poverty logoLink to Infectious Diseases of Poverty
. 2022 Feb 17;11:19. doi: 10.1186/s40249-022-00946-4

Comparing different machine learning techniques for predicting COVID-19 severity

Yibai Xiong 1,#, Yan Ma 1,#, Lianguo Ruan 2, Dan Li 3, Cheng Lu 1,, Luqi Huang 4,; the National Traditional Chinese Medicine Medical Team
PMCID: PMC8851750  PMID: 35177120

Abstract

Background

Coronavirus disease 2019 (COVID-19) is still ongoing spreading globally, machine learning techniques were used in disease diagnosis and to predict treatment outcomes, which showed favorable performance. The present study aims to predict COVID-19 severity at admission by different machine learning techniques including random forest (RF), support vector machine (SVM), and logistic regression (LR). Feature importance to COVID-19 severity were further identified.

Methods

A retrospective design was adopted in the JinYinTan Hospital from January 26 to March 28, 2020, eighty-six demographic, clinical, and laboratory features were selected with LassoCV method, Spearman’s rank correlation, experts’ opinions, and literature evaluation. RF, SVM, and LR were performed to predict severe COVID-19, the performance of the models was compared by the area under curve (AUC). Additionally, feature importance to COVID-19 severity were analyzed by the best performance model.

Results

A total of 287 patients were enrolled with 36.6% severe cases and 63.4% non-severe cases. The median age was 60.0 years (interquartile range: 49.0–68.0 years). Three models were established using 23 features including 1 clinical, 1 chest computed tomography (CT) and 21 laboratory features. Among three models, RF yielded better overall performance with the highest AUC of 0.970 than SVM of 0.948 and LR of 0.928, RF also achieved a favorable sensitivity of 96.7%, specificity of 69.5%, and accuracy of 84.5%. SVM had sensitivity of 93.9%, specificity of 79.0%, and accuracy of 88.5%. LR also achieved a favorable sensitivity of 92.3%, specificity of 72.3%, and accuracy of 85.2%. Additionally, chest-CT had highest importance to illness severity, and the following features were neutrophil to lymphocyte ratio, lactate dehydrogenase, and D-dimer, respectively.

Conclusions

Our results indicated that RF could be a useful predictive tool to identify patients with severe COVID-19, which may facilitate effective care and further optimize resources.

Graphical Abstract

graphic file with name 40249_2022_946_Figa_HTML.jpg

Supplementary Information

The online version contains supplementary material available at 10.1186/s40249-022-00946-4.

Keywords: COVID-19, Severity, Machine learning, Support vector machine, Random Forest, Logistic regression

Background

Coronavirus disease 2019 (COVID-19) has been a severe public health event and is still ongoing spreading globally [1, 2]. The main manifestations of COVID-19 include fever, fatigue, and dry cough, most COVID-19 cases are non-severe with favorable outcome [3, 4]. However, patients with severe COVID-19 tend to progress rapidly and experience respiratory failure, respiratory distress syndrome, and septic shock or even death within a short period of time [5], with a high mortality of 53% [6]. Previous studies observed that male, aged over 65, smoking, elevated lactate dehydrogenase, elevated D-dimer, and chest imaging findings associated with COVID-19 severity [710]. However, the studies didn’t demonstrate indicators’ contribution to COVID-19 severity on admission. Hence, it is essential to assess importance of features to disease severity on admission for earlier and more targeted care, further reduce severe disease and prioritize medical resource.

Machine learning as an effective and innovative tool has been reported applications in diabetes [11], cardiovascular diseases [12], cancer [13, 14], sepsis [15], and depression [16], etc., and showed favorable performance. Moreover, machine learning was used to predict COVID-19 diagnosis and treatment outcome, etc. [1721]. However, different machine learning techniques were used to predict COVID-19 severity have not well been reported. Our study aimed to compare different machine learning techniques including random forest (RF), support vector machine (SVM), and logistic regression (LR) to predict COVID-19 severity based on clinical, laboratory, and chest computed tomography (C-CT) features, further identify feature importance to COVID-19 severity. The results can provide reference for clinicians to diagnose and treat patients timely and effectively, and further optimize medical resources, especially in resource-limited areas.

Methods

Study design and participants

A retrospective study was conducted in the JinYinTan Hospital from January 26 to March 28, 2020. Data of demographic, clinical, laboratory, C-CT features, and treatment outcome were reviewed from 287 patients of hospital information system, all cases were laboratory-confirmed by real-time reverse transcriptase polymerase chain reaction assay.

Feature selection

In the present study, 30 features were remained after the initial 86 features were screened by LassoCV in the study. Subsequently, we carried out spearman’s rank correlation to further reduce confounding, features with correlation coefficients greater than 0.4 (r > 0.4)were excluded. For example, white blood cell (WBC) was correlated with neutrophil to lymphocyte ratio (NLR); red blood cell (RBC) and hematocrit (HCT) were both correlated with hemoglobin concentration (HB); albumin (ALB) is associated with cholinesterase (CHE), etc. Additionally, combined with expert opinions, and literature evaluation 7 features were excluded from 30 features, namely WBC, RBC, HCT, ALB, r-glutamyl transpeptidase(r-GT), CHE, creatine kinase (CK), then 23 features were remained (Fig. 1), correlation coefficient between features were less than 0.4 (Fig.2), which were entered into models for further analysis.

Fig. 1.

Fig. 1

Flow chart of feature selection. WBC White blood cell, RBC red blood cell, ALB albumin, r-GT r-glutamyl transpeptidase, CHE cholinesterase, HCT hematocrit, CK creatine kinase

Fig. 2.

Fig. 2

The heat map of correlation between features. Color indicates the value of the correlation coefficient (r). The color intensity is proportional to the correlation coefficient (r), with positive correlations (r > 0) shown and negative correlations (r < 0), lower intensive color indicates lower correlations, in the study, 23 features selected by machine learning techniques, each feature is weakly correlated with each other (r < 0.4). Twenty-three features including C-CT, fever, MT, HR, SBP, HB, NLR, RDW-CV, IMG, IL-6, ESR, IBIL, ALP CysC, LDH, Ca2+, AFU, AMS, RBP, Hs-cTn, MB, PT(s), D-dimer. C-CT chest computed tomography, MT malignant tumor, HR heart rate, SBP systolic blood pressure, NLR neutrophil-to-lymphocyte ratio, HB hemoglobin concentration, RDW-CV red cell volume distribution width, LDH lactate dehydrogenase, IBIL indirect bilirubin, PT prothrombin time, ESR erythrocyte sedimentation rate, AFU α-fucosidase, RBP retinol-Binding protein, IL-6 interleukin-6, Hs-cTn hypersensitive troponin, AMS amylase, CysC cystatin C, IMG immature granulocyte, ALP alkaline phosphatas, MB myoglobin

Twenty-three features included chest computed tomography (C-CT), fever, malignant tumor(MT), heart rate (HR), systolic blood pressure (SBP), hemoglobin concentration (HB), neutrophil-to-lymphocyte ratio (NLR), red cell volume distribution width (RDW-CV), immature granulocyte (IMG), interleukin-6 (IL-6), erythrocyte sedimentation rate (ESR), indirect bilirubin (IBIL), alkaline phosphatas (ALP), cystatin C (CysC), lactate dehydrogenase (LDH), Ca2+, α-fucosidase (AFU), amylase (AMS), retinol-Binding protein (RBP), hypersensitive troponin (Hs-cTn), myoglobin (MB), prothrombin time (PT), D-dimer.

Features and models

Both dichotomous features are containing precisely two distinct values in the form of 0 and 1, for the three classification features such as C-CT, 0, 1 and 2 as three categorical variables were used to represent no lesions, unilateral lesions, and bilateral lesions, respectively; continuous variables were expressed with unit standardization. The 23 features were processed by RF, SVM, and LR modeling to compare the three models in predicting COVID-19 severity on admission.

RF is a classifier that contains multiple decision trees, and the output is determined by the voting results of multiple trees. We used a grid search method to determine the number of classifiers, maximum depth, and minimum number of branch samples in random forest modeling. SVM based on statistical learning theory, is a class of generalized linear classification technique that classifies data binary in a supervised learning manner, and correctly classifies samples by looking for decision boundaries. LR is a generalized linear regression analysis model that uses different kernel functions to estimate data conditions. Similar to random forest, we use grid search method to determine the selection of key parameters of SVM and LR, thereby reducing the training error in modeling.

COVID-19 severity for prediction

COVID-19 severity was determined based on the Diagnosis and Treatment Protocol for COVID-19 (seventh edition) [22]. In our study, given the small sample of 287 patients analyzed from single center, therefore, COVID-19 cases were categorized as non-severe and severe cases. Mild and moderate cases were classified as non-severe cases, severe and critical were classified as severe cases. The outcome variable was coded as binary 0 or 1, and the predictive output variable was defined as severe type (coded as 1) and non-severe type (coded as 0).

Model training and generalization

Of three models, 10-fold cross-validation was used to determine best appropriate data set to further better accuracy on predictions. Firstly, we split the all data into 10 number of folds. Each model has been trained and tested 10 times. For every iteration we selected 1-fold as test data and rest all as training data and ensured that different sets of data were used for model training and testing each time. Finally, we summed up the score for every fold and took the average to get the overall score.

Evaluation of model performance

The model’s performance was evaluated respectively to compare the sensitivity, specificity, accuracy and area under curve (AUC) of receiver operating characteristic (ROC) of different models for severe COVID-19. The cut-off value was determined by the point on ROC curve where the Youden index was highest.

Importance of test types

In order to quantify the importance of clinical, laboratory, and C-CT features on the predicted outcomes, the best performance model was used to provide relative feature importance by computing the contribution of each input feature toward the predictive outcome.

Statistical analysis

Categorical and continuous variables were expressed as n (%) and median (interquartile range, IQR), respectively. The Mann Whitney U test, χ2 test, or Fisher’s exact test were used to compare differences between severe and non-severe cases, as appropriately. The LR, SVM, and RF models were all performed, and AUC were calculated by using python 3.8.3 (Python Software Foundation, Delaware, USA) and anaconda 4.10.1 (Red Hat, Raleigh, North Carolina, USA). Main python module was shown in Additional file 1.

Results

Cohort characteristics

A total of 287 patients enrolled in our study. The median age was 60.0 years (IQR: 49.0–68.0 years) with 63.0 years (IQR: 55.5–70.0 years) for severe cases and 57.0 years (IQR: 46.7–67.0 years) for non-severe cases. A higher proportion (56.4%) was male, and the proportion of severe and non-severe cases were 36.6% and 63.4%, respectively. The most common symptoms reported were cough (61.3%) and fever (30.7%), 23 features of COVID-19 patients in the cohort are demonstrated in Table 1.

Table 1.

Twenty-three features of COVID-19 patients in the cohort

Variables All cases Non-severe cases Severe cases P
(n = 287) (n = 182) (n = 105)
Fever 88 (30.7) 57 (31.3) 31 (29.5) 0.751
MT 16 (5.6) 11 (6.0) 5 (4.8) 0.648
HR, beats/min 88 (80–99) 87 (80–98) 88 (84–99) 0.122
SBP, mmHg 127 (117–138) 125 (115–136) 131 (121–138) 0.008
Laboratory findings
 NLR 3.28 (2.17–5.77) 2.71 (1.70–4.15) 5.73 (3.17–9.97)  < 0.001
 HB 127.0 (117.0–136.0) 128.0 (119.0–136.0) 123 (114.0–136.0) 0.109
 RDW-CV 12.4 (11.9–12.9) 12.3 (11.9–12.8) 12.5 (11.9–13.0) 0.032
 LDH 258.0 (200.0–346.5) 231.0 (184.7–276.5) 343.0 (261.0–452.0)  < 0.001
 IBIL 8.4 (6.1–11.6) 8.9 (6.6–11.8) 7.8 (5.6–10.80) 0.048
 D-Dimer 0.7 (0.4–1.5) 0.5 (0.3–0.9) 1.1 (0.6–1.5)  < 0.001
 PT 11.3 (10.7–12.0) 11.2 (10.6–11.7) 11.7 (10.8–12.6)  < 0.001
 Ca2+ 2.10 (1.99–2.17) 2.09 (22.0–52.25) 2.01 (1.93–2.10)  < 0.001
 ESR 43.0 ( (26.0–58.0) 40.5 (18.0–27.0) 50.3 (35.0–65.5)  < 0.001
 AFU 23.0 (19.0–27.0) 23.0 (18.0–27.0) 23.0 (20.0–27.0) 0.508
 RBP 26.4 (19.0–40.4) 26.4 (20.6–40.5) 26.4 (17.1–40.0) 0.433
 IL-6 8.3 (6.5–11.7) 8.2 (6.5–10.2) 9.2 (7.2–12.9) 0.001
 Hs-cTn 3.6 (1.5–8.3) 2.9 (1.1–5.8) 6.2 (2.8–12.4)  < 0.001
 AMS 62.0 (49.0–79.5) 62.0 (49.3–74.0) 68.0 (49.0–97.0) 0.078
 CysC 0.85 (0.75–1.03) 0.83 (0.73–0.94) 0.95 (0.78–1.16)  < 0.001
 IMG 0.01 (0.01–0.04) 0.01 (0.00–0.03) 0.03 (0.01–0.11)  < 0.001
 ALP 76.0 (59.5–94.0) 76.0 (59.3–95.8) 75.0 (60.0–93.0) 0.806
 MB 42.4 (30.7–64.5) 39.1 (28.0–53.0) 50.1 (38.8–87.3)  < 0.001
C-CTa  < 0.001
 0 63 (22.0%) 63 (34.6%) 0 (0%)
 1 29 (10.1%) 23 (12.6%) 6 (5.7%)
 2 195 (67.9%) 96 (52.7%) 99 (94.3%)

Data were presented median (IQR) or n (%). P values were estimated by comparing variables between severe and non-severe cases

a“0” denotes” no lung lesions”, “1” denotes “unilateral lung lesions”, and “2” denotes “bilateral lung lesions”

MT Malignant tumor, HR Heart rate, SBP Systolic blood pressure, NLR Neutrophil-to-lymphocyte ratio, HB Hemoglobin concentration, RDW-CV Red cell volume distribution width, LDH Lactate dehydrogenase, IBIL Indirect bilirubin, PT Prothrombin time, ESR Erythrocyte sedimentation rate, AFU α-fucosidase, RBP Retinol-Binding protein, IL-6 Interleukin-6, Hs-cTn Hypersensitive troponin, AMS Amylase, CysC cystatin C, IMG Immature granulocyte, ALP Alkaline phosphatas, MB Myoglobin, C-CT Chest computed tomography

Predictive performance

We further analyzed the predictive performance of RF, SVM and LR with AUC, sensitivity, specificity, and accuracy (Table 2, Fig. 3), in terms of predictive performance among the three models, we observed that the overall better performance by AUC of 0.970 were RF for predicting COVID-19 severity at admission compared to SVM with AUC of 0.948 and LR with AUC of 0.928. Moreover, RF also achieved a favorable sensitivity of 96.7%, specificity of 69.5%, and accuracy of 84.5%. SVM had sensitivity of 93.9%, specificity of 79.0%, and accuracy of 88.5%. LR model had sensitivity of 92.3%, specificity of 72.3%, and accuracy of 85.2%.

Table 2.

Predictive performance for COVID-19 severity

Model AUC Sensitivity (%) Specificity (%) Accuracy (%) Cut off YI
RF 0.970 96.7 69.5 84.5 0.62 0.662
SVM 0.948 93.9 79.0 88.5 0.795 0.729
LR 0.928 92.3 72.3 85.2 0.57 0.646

AUC area under curve, YI youden index, RF random Forest, SVM support vector machine, LR logistic regression

Fig. 3.

Fig. 3

ROC curves of RF, SVM, and LR. ROC receiver operating characteristic, AUC area under curve, RF random forest, SVM support vector machine, LR logistic regression

Feature importance

We further analyzed the top 23 features ranked by relative importance to COVID-19 severity by using RF with best performance. C-CT had highest importance to COVID-19 severity, and the following were NLR, LDH count, and D-Dimer, respectively (Fig. 4).

Fig. 4.

Fig. 4

Feature importance: The top 23 features ranked by relative importance to COVID-19 severity by RF. C-CT chest computed tomography, MT malignant tumor, HR heart rate, SBP systolic blood pressure, NLR neutrophil-to-lymphocyte ratio, HB hemoglobin concentration, RDW-CV red cell volume distribution width, LDH lactate dehydrogenase, IBIL indirect bilirubin, PT prothrombin time, ESR erythrocyte sedimentation rate, AFU α-fucosidase, RBP retinol-binding protein, IL-6 interleukin-6, Hs-cTn hypersensitive troponin, AMS amylase, CysC cystatin C, IMG immature granulocyte, ALP alkaline phosphatas, MB myoglobin

Discussion

Previous studies observed that RF, LR, SVM, gradient boosted decision tree, and neural network to predict COVID-19 [2325]. Schwab et al. demonstrated that RF had highest predictive performance for predicting hospital admissions for COVID-19 patients, and critical care admissions for COVID-19 cases in terms of AUC compared to LR, SVM, neural network, and gradient boosting [25]. In the present study, we found that RF could predict COVID-19 severity with best performance compared to SVM and LR. The results highlight the possibility that high-performance of RF are able to predict disease severity, the findings can provide reference to monitor parameters effectively and further optimize medical resource.

Chest CT is a widely used imaging test, which can be useful for initial triage of suspected COVID-19 patients. Previous studies demonstrated that chest CT or X-rays findings were associated with COVID-19 severity based on machine learning [2630], our data observed that chest CT findings with highest importance scores was for diagnosis of severe COVID-19 on admission by RF model, which can help clinicians a fast diagnosis and severity assessment of COVID-19 even before molecular test results are available or laboratory testing is limited. Wang et al. reported that COVID-19 clinical severity could be assessed based on the proposed chest lung-lesion automated deep-learning model [26], given deep learning was likely to be over-fitting for small sample, therefore, which was not used in the study. Following the C-CT ranked feature importance scores were NLR, LDH, D-Dimer, Ca2+, ESR, and AFU, respectively, were confirmed in previous studies [3032]. Ponti et al. reported that NLR was closely in relationship with coagulation cascades in disseminated intravascular coagulation and acute respiratory distress syndrome in severe COVID-19 patients [33]. LDH and D-dimer are consistently reported among hospitalized severe COVID-19 patients [7], while gradual development during the disease course is particularly related to disease worsening in adults and children [34, 35]. However, compared with previous researches, AFU and HB are rarely mentioned in COVID-19 severity. AFU is essential in viral and bacterial infections including the cell-cell adhesion processes that mediate inflammation [36, 37]. Studies illustrated that AFU may be potentially upregulated by chemokines in the later stages of inflammation, such as COVID-19 [37, 38]. We also found that AFU is a significant lab indicator to distinguish severe illness, which may be a possible therapeutic target in the future. In other studies, hemoglobin and red blood cell count were found lower in COVID-19 patients, however, with no significant difference between survivors and non-survivors [39]. COVID-19 patients might suffer from hyper-metabolic states, such as septic shock and multiple organ dysfunction syndromes, which needs an increasing capability of hemoglobin to support the peripheral tissue demands for oxygen [40]. The ability to meet the oxygen demands of the peripheral tissues caused by the COVID-19 infection becomes the leading role to decide the prognosis, according to recent evidence [41].

Our study has several limitations. Firstly, the small sample size from a single study site is a major limitation of the present study, which may undermine reliability of our analysis. Secondly, the data was retrospective collected, some data were incomplete or missing (such as smoking and drinking, and educational level), which were not further analyzed in the study. Further large sample, multi-center studies should be conducted to confirm our results.

Conclusions

Our study indicates that RF may be a prioritization tool in predicting COVID-19 severity with favorable overall performance at admission. The prediction model can potentially assist clinicians to facilitate severe COVID-19 patients’ identification promptly and apply treatment timely, further optimize potentially healthcare resources.

Supplementary Information

Acknowledgements

We thank Prof. Songwang Wang and Yewu Zhang, statistic experts from China CDC, for statistics guidance. We also acknowledge the work and contribution of all participants in the study.

Abbreviations

AFU

α-Fucosidase

ALB

Albumin

ALP

ALkaline phosphatas

AMS

Amylas

ARDS

Acute respiratory distress syndrome

AUC

Area under curve

CHE

Cholinesterase

CK

Creatine kinase

COVID-19

Coronavirus disease 2019

CT

Computed tomography

CysC

Cystatin C

DIC

Disseminated intravascular coagulation

ESR

Erythrocyte sedimentation rate

HB

Hemoglobin concentration

HCT

Hematocrit

HR

Heart rate

hs-cTn

Hypersensitive troponin

IBIL

Indirect bilirubin

IL-6

Interleukin-6

IMG

Immature granulocyte

LASSO

Least Absolute Shrinkage and Selectionator Operator

LDH

Lactate dehydrogenase

LR

Logistic regression

MB

Myoglobin

MT

Malignant tumor

NLR

Neutrophil-to-lymphocyte ratio

PT

Prothrombin time

RBP

Retinol-binding protein

RBC

Red blood cell

RDW-CV

Red cell volume distribution width

RF

Random forest

ROC

Receiver operating characteristic

γ-GT

γ-Glutamyl transpeptidase

SBP

Systolic blood pressure

SVM

Support vector machine

WBC

White blood cell

YI

Youden index

Authors' contributions

YB-X and YM drafted the manuscript. YM and DL participated in the design of the study and performed the statistical analysis. RL-G and the National Traditional Chinese Medicine Medical Team carried out the verification of data and the patients’ enrollment. LC and LQ-H conceived of the study, and revised the manuscript. All authors read and approved the final manuscript.

Funding

This work was supported by National Administration of Traditional Chinese Medicine: 2019 Project of building evidence-based practice capacity for TCM (Grant No. 2019XZZX-LG) and Innovation Team and Talents Cultivation Program of National Administration of Traditional Chinese Medicine (ZYYCXTD-D-202005).

Availability of data and materials

The datasets generated and/or analyzed during the present study are not publicly available due to sensitivity of the patients’ personal information but are available from the corresponding author on reasonable request.

Declarations

Ethics approval and consent to participate

Ethics approval was granted by China Academy of Chinese Medical Sciences (N. CACMS-IRB2020-001-1).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Footnotes

Yibai Xiong and Yan Ma have contributed equally to this work

Contributor Information

Cheng Lu, Email: lv_cheng0816@163.com.

Luqi Huang, Email: Huangluqi01@126.com.

References

  • 1.Zhu J, Yan W, Zhu L, Liu J. COVID-19 pandemic in BRICS countries and its association with socio-economic and demographic characteristics, health vulnerability, resources, and policy response. Infect Dis Poverty. 2021;10:97. doi: 10.1186/s40249-021-00881-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ma Y, Mishra SR, Han XK, Zhu DS. The relationship between time to a high COVID-19 response level and timing of peak daily incidence: an analysis of governments' Stringency Index from 148 countries. Infect Dis Poverty. 2021;10:96. doi: 10.1186/s40249-021-00880-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Geng MJ, Wang LP, Ren X, Yu JX, Chang ZR, Zheng CJ, et al. Risk factors for developing severe COVID-19 in China: an analysis of disease surveillance data. Infect Dis Poverty. 2021;10:48. doi: 10.1186/s40249-021-00820-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–1062. doi: 10.1016/S0140-6736(20)30566-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hu B, Guo H, Zhou P, Shi ZL. Characteristics of SARS-CoV-2 and COVID-19. Nat Rev Microbiol. 2021;19:141–154. doi: 10.1038/s41579-020-00459-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Li X, Xu S, Yu M, Wang K, Tao Y, Zhou Y, et al. Risk factors for severity and mortality in adult COVID-19 inpatients in Wuhan. J Allergy Clin Immunol. 2020;146:110–118. doi: 10.1016/j.jaci.2020.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zheng Z, Peng F, Xu B, Zhao J, Liu H, Peng J, et al. Risk factors of critical & mortal COVID-19 cases: a systematic literature review and meta-analysis. J Infect. 2020;81:e16–e25. doi: 10.1016/j.jinf.2020.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Xiong Y, Ma Y, Tian Y, Zhang C, Yang W, Liu B, et al. A longitudinal cohort study using a modified child-pugh score to escalate respiratory support in COVID-19 patients—Hubei Province, China, 2020. China CDC Wkly. 2021;3:423–429. doi: 10.46234/ccdcw2021.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bai XY, Xin TY, Yan M, Wei Y, Bin L, Guo RL, et al. Factors defining the development of severe illness in patients with COVID-19: a retrospective study. Biomed Environ Sci. 2021;34:984. doi: 10.3967/bes2021.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Xu PP, Tian RH, Luo S, Zu ZY, Fan B, Wang XM, et al. Risk factors for adverse clinical outcomes with COVID-19 in China: a multicenter, retrospective, observational study. Theranostics. 2020;10:6372–6383. doi: 10.7150/thno.46833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Silva K, Lee WK, Forbes A, Demmer RT, Barton C, Enticott J. Use and performance of machine learning models for type 2 diabetes prediction in community settings: a systematic review and meta-analysis. Int J Med Inform. 2020;143:104268. doi: 10.1016/j.ijmedinf.2020.104268. [DOI] [PubMed] [Google Scholar]
  • 12.Krittanawong C, Virk HUH, Bangalore S, Wang Z, Johnson KW, Pinotti R, et al. Machine learning prediction in cardiovascular diseases: a meta-analysis. Sci Rep. 2020;10:16057. doi: 10.1038/s41598-020-72685-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lu W, Fu D, Kong X, Huang Z, Hwang M, Zhu Y, et al. FOLFOX treatment response prediction in metastatic or recurrent colorectal cancer patients via machine learning algorithms. Cancer Med. 2020;9:1419–1429. doi: 10.1002/cam4.2786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Castaldo R, Cavaliere C, Soricelli A, Salvatore M, Pecchia L, Franzese M. Radiomic and genomic machine learning method performance for prostate cancer diagnosis: systematic literature review. J Med Internet Res. 2021;23:e22394. doi: 10.2196/22394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Fleuren LM, Klausch TLT, Zwager CL, Schoonmade LJ, Guo T, Roggeveen LF, et al. Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive Care Med. 2020;46:383–400. doi: 10.1007/s00134-019-05872-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lee Y, Ragguett RM, Mansur RB, Boutilier JJ, Rosenblat JD, Trevizol A, et al. Applications of machine learning algorithms to predict therapeutic outcomes in depression: A meta-analysis and systematic review. J Affect Disord. 2018;241:519–532. doi: 10.1016/j.jad.2018.08.073. [DOI] [PubMed] [Google Scholar]
  • 17.Kwon JM, Kim KH, Jeon KH, Lee SE, Lee HY, Cho HJ, et al. Artificial intelligence algorithm for predicting mortality of patients with acute heart failure. PLoS ONE. 2019;14:e0219302. doi: 10.1371/journal.pone.0219302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Schalekamp S, Huisman M, van Dijk RA, Boomsma MF, Freire Jorge PJ, de Boer WS, et al. Model-based prediction of critical illness in hospitalized patients with COVID-19. Radiology. 2021;298:E46–e54. doi: 10.1148/radiol.2020202723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal. BMJ. 2020;369:m1328. doi: 10.1136/bmj.m1328. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wu G, Yang P, Xie Y, Woodruff HC, Rao X, Guiot J, et al. Development of a clinical decision support system for severity risk prediction and triage of COVID-19 patients at hospital admission: an international multicentre study. Eur Respir J. 2020;56:2001104. doi: 10.1183/13993003.01104-2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Li WT, Ma J, Shende N, Castaneda G, Chakladar J, Tsai JC, et al. Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis. BMC Med Inform Decis Mak. 2020;20:247. doi: 10.1186/s12911-020-01266-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Diagnosis and Treatment Protocol for Novel Coronavirus Pneumonia (Trial Version 7). Chin Med J (Engl). 2020; 133:1087–1095. [DOI] [PMC free article] [PubMed]
  • 23.Yue H, Yu Q, Liu C, Huang Y, Jiang Z, Shao C, et al. Machine learning-based CT radiomics method for predicting hospital stay in patients with pneumonia associated with SARS-CoV-2 infection: a multicenter study. Ann Transl Med. 2020;8:859. doi: 10.21037/atm-20-3026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Gao Y, Cai GY, Fang W, Li HY, Wang SY, Chen L, et al. Machine learning based early warning system enables accurate mortality risk prediction for COVID-19. Nat Commun. 2020;11:5033. doi: 10.1038/s41467-020-18684-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Schwab P, DuMont SA, Dietz B, Bauer S. Clinical predictive models for COVID-19: systematic study. J Med Internet Res. 2020;22:e21439. doi: 10.2196/21439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Wang G, Liu X, Shen J, Wang C, Li Z, Ye L, et al. A deep-learning pipeline for the diagnosis and discrimination of viral, non-viral and COVID-19 pneumonia from chest X-ray images. Nat Biomed Eng. 2021;5:509–521. doi: 10.1038/s41551-021-00704-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Li L, Qin L, Xu Z, Yin Y, Wang X, Kong B, et al. Using artificial intelligence to detect COVID-19 and community-acquired pneumonia based on pulmonary CT: evaluation of the diagnostic accuracy. Radiology. 2020;296:e65–e71. doi: 10.1148/radiol.2020200905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Bouchareb Y, Moradi Khaniabadi P, Al Kindi F, Al Dhuhli H, Shiri I, Zaidi H, et al. Artificial intelligence-driven assessment of radiological images for COVID-19. Comput Biol Med. 2021;136:104665. doi: 10.1016/j.compbiomed.2021.104665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Reeves RA, Pomeranz C, Gomella AA, Gulati A, Metra B, Hage A, et al. Performance of a severity score on admission chest radiography in predicting clinical outcomes in hospitalized patients with coronavirus disease (COVID-19) AJR Am J Roentgenol. 2021;217:623–632. doi: 10.2214/AJR.20.24801. [DOI] [PubMed] [Google Scholar]
  • 30.Chen Y, Ouyang L, Bao FS, Li Q, Han L, Zhang H, et al. A multimodality machine learning approach to differentiate severe and nonsevere COVID-19: Model development and validation. J Med Internet Res. 2021;23:e23948. doi: 10.2196/23948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Liu J, Liu Y, Xiang P, Pu L, Xiong H, Li C, et al. Neutrophil-to-lymphocyte ratio predicts critical illness patients with 2019 coronavirus disease in the early stage. J Transl Med. 2020;18:206. doi: 10.1186/s12967-020-02374-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Feng DY, Zhou YQ, Zhou M, Zou XL, Wang YH, Zhang TT. Risk factors for mortality due to ventilator-associated pneumonia in a Chinese hospital: a retrospective study. Med Sci Monit. 2019;25:7660–7665. doi: 10.12659/MSM.916356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Ponti G, Maccaferri M, Ruini C, Tomasi A, Ozben T. Biomarkers associated with COVID-19 disease progression. Crit Rev Clin Lab Sci. 2020;57:389–399. doi: 10.1080/10408363.2020.1770685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Terpos E, Ntanasis-Stathopoulos I, Elalamy I, Kastritis E, Sergentanis TN, Politou M, et al. Hematological findings and complications of COVID-19. Am J Hematol. 2020;95:834–847. doi: 10.1002/ajh.25829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhou MY, Xie XL, Peng YG, Wu MJ, Deng XZ, Wu Y, et al. From SARS to COVID-19: what we have learned about children infected with COVID-19. Int J Infect Dis. 2020;96:710–714. doi: 10.1016/j.ijid.2020.04.090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Tu Z, Lin YN, Lin CH. Development of fucosyltransferase and fucosidase inhibitors. Chem Soc Rev. 2013;42:4459–4475. doi: 10.1039/c3cs60056d. [DOI] [PubMed] [Google Scholar]
  • 37.Liu T, Liu R, Zhu L, Zou X, Guan H, Xu Z. Development of a UHPLC-MS method for inhibitor screening against α-l-1,3-fucosidase. Anal Bioanal Chem. 2019;411:1467–1477. doi: 10.1007/s00216-019-01575-x. [DOI] [PubMed] [Google Scholar]
  • 38.Ali S, Jenkins Y, Kirkley M, Dagkalis A, Manivannan A, Crane IJ, et al. Leukocyte extravasation: an immunoregulatory role for alpha-l-fucosidase? J Immunol. 2008;181:2407–2413. doi: 10.4049/jimmunol.181.4.2407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Taneri PE, Gómez-Ochoa SA, Llanaj E, Raguindin PF, Rojas LZ, Roa-Díaz ZM, et al. Anemia and iron metabolism in COVID-19: a systematic review and meta-analysis. Eur J Epidemiol. 2020;35:763–773. doi: 10.1007/s10654-020-00678-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Lin GL, McGinley JP, Drysdale SB, Pollard AJ. Epidemiology and immune pathogenesis of viral sepsis. Front Immunol. 2018;9:2147. doi: 10.3389/fimmu.2018.02147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Gattinoni L, Coppola S, Cressoni M, Busana M, Rossi S, Chiumello D. COVID-19 does not lead to a "typical" acute respiratory distress syndrome. Am J Respir Crit Care Med. 2020;201:1299–1300. doi: 10.1164/rccm.202003-0817LE. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The datasets generated and/or analyzed during the present study are not publicly available due to sensitivity of the patients’ personal information but are available from the corresponding author on reasonable request.


Articles from Infectious Diseases of Poverty are provided here courtesy of BMC

RESOURCES