Abstract
The SARS-CoV2 virus, which causes COVID-19 (coronavirus disease) has become a pandemic and has expanded all over the world. Because of increasing number of cases day by day, it takes time to interpret the data thus the limitations in terms of both treatment and findings are emerged. Due to such limitations, the need for clinical decisions making system with predictive algorithms has arisen. Predictive algorithms could potentially ease the strain on healthcare systems by identifying the diseases. In this study, we design clinical predictive models that estimate, using artificial intelligence and data, which patients are susceptible to receive a COVID-19 disease. To evaluate the predictive performance of our models, accuracy, AUROC, and scores calculated. From 12,727 individuals, models were tested with basic information (sex, age) and the patient’s type of case, which is the combination of their symptoms, their travel during the last 14 days, their contact with an infected person or their participation in a festival requiring a gathering. We used 5 machine learning algorithms (LR, SVM, k-NN, RF, XGBoost) and 1 deep learning algorithm (ANN). Our models were validated with train-test split approach. The experimental results indicate that our predictive models identify patients that have COVID-19 disease at an accuracy of 73% and AUC of 69%. It is observed that predictive models trained on patients’ basic information and type of case could be used to predict COVID-19 infection in Senegal and can be helpful for medical experts to optimize the resources efficiently.
Keywords: SARS-CoV2, Coronavirus, Machine Learning, Deep Learning, Artificial Intelligence
References
- 1.World Health Organization, Report of the WHO-China joint mission on coronavirus disease (COVID-19). 2020 https://www.who.int/docs/default-source/coronaviruse/who-china-joint-mission-on-covid-19-final-report.pdf.
- 2.Zoabi, Y. Deri-Rovoz, S. & Shomron, N. Machine learning-based prediction of COVID-19 diagnosis based on symptoms. 2021 10.1038/s41746-020-00372-6. [DOI] [PMC free article] [PubMed]
- 3.Dong, E., Du, H. & Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 10.1016/S14733099(20)30120-1 (2020). [DOI] [PMC free article] [PubMed]
- 4.World Health Organization, Health topics, coronavirus. 2020 https://www.who.int/health-topics/coronavirus#tab=tab_3.
- 5.National Institute of Infection Diseases, Field briefing: diamond princess COVID-19 cases. 2020 https://www.niid.go.jp/niid/en/2019-ncov-e/9407-covid-dp-fe-01.html.
- 6.Del Rio C, Malani PN. Novel coronavirus – important information for clinicians. J Am Med Assoc 2019;323(11):2020. doi:10.1001/jama.2020.1490 [DOI] [PubMed]
- 7.Wang D, et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China. J Am Med Assoc2020;323(11) DOI: 1061-1069. doi:10.1001/jama.2020.1585. [DOI] [PMC free article] [PubMed]
- 8.Alakus, B.T., Turkoglu, I. Comparison of deep learning approaches to predict COVID-19 infection. 2020 10.1016/j.chaos.2020.110120 [DOI] [PMC free article] [PubMed]
- 9.Gavrilova, Y. Artificial Intelligence vs Machine Learning vs Deep Learning: Essentials. 2020. https://serokell.io/blog/ai-ml-dl-difference
- 10.Piryonesi, S. Madeh; El-Diraby, Tamer E. (2020-03-01). “Data Analytics in Asset Management: Cost-Effective Prediction of the Pavement Condition Index". Journal of Infrastructure Systems. 26 (1): 04019036. doi:10.1061/(ASCE)IS.1943-555X.0000512. ISSN 1943-555X.
- 11.Hastie, T.; Tibshirani, R.; Friedman, J.H. (2009). “10. Boosting and Additive Trees". The Elements of Statistical Learning (2nd ed.). New York: Springer. pp. 337–384. ISBN 978-0-387-84857-0. Archived from the original on 2009-11-10
- 12.Piryonesi, S. Madeh; El-Diraby, Tamer E. (2021-02-01). “Using Machine Learning to Examine Impact of Type of Performance Indicator on Flexible Pavement Deterioration Modeling". Journal of Infrastructure Systems. 27 (2): 04021005. doi:10.1061/(ASCE)IS.1943-555X.0000602. ISSN 1076-0342.
- 13.Ho, Tin Kam (1995). Random Decision Forests (PDF). Proceedings of the 3rd International Conference on Document Analysis and Recognition, Montreal, QC, 14–16 August 1995. pp. 278–282. Archived from the original (PDF) on 17 April 2016. Retrieved 5 June 2016.
- 14.Ho TK (1998). “The Random Subspace Method for Constructing Decision Forests” (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 20 (8): 832–844. doi:10.1109/34.709601.
- 15.Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome (2008). The Elements of Statistical Learning (2nd ed.). Springer. ISBN 0-387-95284-5.
- 16.Piryonesi S. Madeh; El-Diraby Tamer E. (2020-06-01). “Role of Data Analytics in Infrastructure Asset Management: Overcoming Data Size and Quality Problems". Journal of Transportation Engineering, Part B: Pavements. 146 (2): 04020022. doi:10.1061/JPEODX.0000175
- 17.Piryonesi, S. Madeh; El-Diraby, Tamer E. (2021-02-01). “Using Machine Learning to Examine Impact of Type of Performance Indicator on Flexible Pavement Deterioration Modeling". Journal of Infrastructure Systems. 27 (2): 04021005. doi:10.1061/(ASCE)IS.1943-555X.0000602. ISSN 1076-0342.
- 18.Fix, Evelyn; Hodges, Joseph L. (1951). Discriminatory Analysis. Nonparametric Discrimination: Consistency Properties (PDF)(Report). USAF School of Aviation Medicine, Randolph Field, Texas.
- 19.Altman Naomi S. “An introduction to kernel and nearest-neighbor nonparametric regression” (PDF) The American Statistician. 1992;46(3):175–185. [Google Scholar]
- 20.Piryonesi S. Madeh; El-Diraby Tamer E. (2020-06-01). “Role of Data Analytics in Infrastructure Asset Management: Overcoming Data Size and Quality Problems". Journal of Transportation Engineering, Part B: Pavements. 146 (2): 04020022. doi:10.1061/JPEODX.0000175
- 21.Hastie, Trevor. (2001). The elements of statistical learning: data mining, inference, and prediction: with 200 full-color illustrations. Tibshirani, Robert., Friedman, J.H. (Jerome H.). New York: Springer. ISBN 0-387-95284-5. OCLC 46809224
- 22.Tolles, Juliana; Meurer, William J (2016). “Logistic Regression Relating Patient Characteristics to Outcomes". JAMA. 316 (5):533–4. doi:10.1001/jama.2016.7653. ISSN 0098-7484. OCLC 6823603312. PMID 27483067 [DOI] [PubMed]
- 23.Batista, A.F., Miraglia, J.L., Donato, T.H.R., and Filho, A.D.P.C., COVID-19 diagnosis prediction in emergency care patients: a machine learning approach, medRxiv, 2020. doi: 10.1101/2020.04.04.20052092.
- 24.Mandrekar JN. Receiver operating characteristic curve in diagnostic test assessment. J Thoracic Oncol 2010;5(9):1315–16. doi:10.1097/JTO.0b013e3181ec173d. [DOI] [PubMed]
- 25.Hajian-Tilaki K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian J Intern Med 2013;4(2):627–35. [PMC free article] [PubMed]
- 26.Kamarudin AN, Cox T, Kolamunnaage-Dona R. Time-dependent ROC curveanalysis in medical research: current methods and applications. BMC Med Res Methodol 2017;17(53). doi:10.1186/s12874-017-0332-6. [DOI] [PMC free article] [PubMed]
- 27.Wynants L, Calster BV, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction model for diagnosis and prognosis of covid-19: systematic review and critical appraisal. BMJ 2020;369. doi:10.1136/bmj.m1328. [DOI] [PMC free article] [PubMed]
- 28.Pierce R. Evaluating information: validity, reliability, accuracy, triangulation. Res Methods Polit 2008:79–99. doi:10.4135/9780857024589.
- 29.Jiang X, Coffee M, Bari A, Wang J, Jiang X, et al. Towards an artificial intelligence framework for data-driven prediction of coronavirus clinical severity. Compu Mater Continua 2020;63(1):537–51. doi:10.32604/cmc.2020.010691
- 30.Schwab, P., Schütte, A.D., Dietz, B., and Bauer, S. “predCOVID-19: a systematic study of clinical predictive models for coronavirus disease 2019, arXiv:2005. 08302, 2020.
- 31.World Health Organization. Laboratory testing for coronavirus disease 2019 (COVID-19) in suspected human cases: interim guidance. https://www.who.int/publications-detail/laboratory-testing-for-2019-novel-coronavirus-in-suspected-human-cases-20200117. (Updated on March 19, 2020).
- 32.Wölfel R, Corman VM, Guggemos W, et al. Virological assessment of hospitalized patients with COVID-2019. Nature 2020;581:465–9. doi:10.1038/s41586-020-2196-x. [DOI] [PubMed]
- 33.Wang, W., Xu, Y., Gao, R., and et al. “Detection of SARS-CoV-2 in different types of clinical specimens,” JAMA, 323(18), 1843–4, 220. doi: 10.1001/jama.2020.3786. [DOI] [PMC free article] [PubMed]
- 34.Fang Y, Zhang H, Xie J, et al. Sensitivity of chest CT for COVID-19: comparison to RT-PCR. Radiology 2020. doi:10.1148/radiol.2020200432. [DOI] [PMC free article] [PubMed]
