Abstract
Graph analytics are now considered the state-of-the-art in many applications of communities detection. The combination between the graph’s definition in mathematics and the graphs in computer science as an abstract data structure is the key behind the success of graph-based approaches in machine learning. Based on graphs, several approaches have been developed such as shortest path first (SPF) algorithms, subgraphs extraction, social media analytics, transportation networks, bioinformatic algorithms, etc. While SPF algorithms are widely used in optimization problems, Spectral clustering (SC) algorithms have overcome the limits of the most state-of-art approaches in communities detection. The purpose of this paper is to introduce a graph-based approach of communities detection in the novel coronavirus Covid-19 countries’ datasets. The motivation behind this work is to overcome the limitations of multiclass classification, as SC is an unsupervised clustering algorithm, there is no need to predefine the output clusters as a preprocessing step. Our proposed approach is based on a previous contribution on an automatic estimation of the k number of the output clusters. Based on dynamic statistical data for more than 200 countries, each cluster is supposed to group countries having similar behaviors of Covid-19 propagation.
Keywords: Covid-19, Coronavirus, Machine learning, Graph analytics, Spectral clustering, Communities detection
References
- 1.Yanga J., Zhenga Y., Goua X. Prevalence of comorbidities and its effects in coronavirus disease 2019 patients: a systematic review and meta-analysis. Int. J. Infect. Dis. 2020;94:91–95. doi: 10.1016/j.ijid.2020.03.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sohrabi C. World Health Organization declares global emergency: A review of the 2019 novel coronavirus (COVID-19) International Journal of Surgery. 2020;76:71–76. doi: 10.1016/j.ijsu.2020.02.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liu P., Tan X.-z. 2019 novel coronavirus (2019-nCoV) pneumonia. Radiology. 2020;295:19. doi: 10.1148/radiol.2020200257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wang C., Horby P.W., Hayden F.G., Gao G.F. A novel coronavirus outbreak of global health concern. The Lancet. 2020;395:470–473. doi: 10.1016/S0140-6736(20)30185-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pan L., Mu M., Yang P., Sun Y., Wang R., Yan J. Clinical characteristics of COVID-19 patients with digestive symptoms in Hubei, China: a descriptive, cross-sectional, multicenter study. The American journal of gastroenterology, vol. 2020;115(5):766–773. doi: 10.14309/ajg.0000000000000620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rothan H.A., Byrareddy S.N. The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak. Journal of autoimmunity. 2020;109:102433. doi: 10.1016/j.jaut.2020.102433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.S. Lai, I. I. Bogoch, A. Watts, K. Khan, Z. Li, and A. Tatem, "Preliminary risk analysis of 2019 novel coronavirus spread within and beyond China," ed, 2020.
- 8.Chinazzi M., Davis J.T., Ajelli M., Gioannini C., Litvinova M., Merler S. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science. 2020;368:395–400. doi: 10.1126/science.aba9757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.L. Wang and A. Wong, "COVID-Net: A tailored deep convolutional neural network design for detection of COVID-19 cases from chest radiography images," arXiv preprint arXiv:2003.09871, 2020. [DOI] [PMC free article] [PubMed]
- 10.Qin G., Gao L. Spectral clustering for detecting protein complexes in protein-protein interaction (PPI) networks. Mathematical and Computer Modelling. 2010;52:2066–2074. [Google Scholar]
- 11.Mahmoud H., Masulli F., Rovetta S., Russo G. Community detection in protein-protein interaction networks using spectral and graph approaches. in International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics, LNCS. 2013;8452:62–75. [Google Scholar]
- 12.C.-T. Kuo, P. B. Walker, O. Carmichael, and I. Davidson, "Spectral clustering for medical imaging," in 2014 IEEE International Conference on Data Mining, 2014, pp. 887-892.
- 13.Xia K., Gu X., Zhang Y. Oriented grouping-constrained spectral clustering for medical imaging segmentation. Multimedia Systems. 2020;26:27–36. [Google Scholar]
- 14.Zhou C., Su F., Pei T., Zhang A., Du Y., Luo B. COVID-19: challenges to GIS with big data. Geography and Sustainability, vol. 2020;1(1):77–87. [Google Scholar]
- 15.Mollalo A., Vahedi B., Rivera K.M. GIS-based spatial modeling of COVID-19 incidence rate in the continental United States. Science of The Total Environment. 2020;728:138884. doi: 10.1016/j.scitotenv.2020.138884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.K. Elasnaoui and Y. Chawki, "Using X-ray Images and Deep Learning for Automated Detection of Coronavirus Disease," Journal of Biomolecular Structure and Dynamics, pp. 1-22, 2020. [DOI] [PMC free article] [PubMed]
- 17.E. E.-D. Hemdan, M. A. Shouman, and M. E. Karar, "Covidx-net: A framework of deep learning classifiers to diagnose covid-19 in x-ray images," arXiv preprint arXiv:2003.11055, 2020.
- 18.R. Dandekar and G. Barbastathis, "Neural Network aided quarantine control model estimation of global Covid-19 spread," arXiv preprint arXiv:2004.02752, 2020.
- 19.Ait Z., Mouden El, Taj R.M., Jakimi A., Hajar M. Towards for Using Spectral Clustering in Graph Mining. Communications in Computer and Information Science. 2018;872:144–159. [Google Scholar]
- 20.Z. A. El Mouden, A. Jakimi, and M. Hajar, "An application of spectral clustering approach to detect communities in data modeled by graphs," in Proceedings of the 2nd International Conference on Networking, Information Systems & Security, ACM, 2019.
- 21.N. Afiqah-Aleng, M. Altaf-Ul-Amin, S. Kanaya, and Z.-A. Mohamed-Hussein, "Polycystic ovarian syndrome novel proteins and significant pathways identified using graph clustering approach," Reproductive BioMedicine Online, 2019. [DOI] [PubMed]
- 22.Tang M., Marin D., Ayed I.B., Boykov Y. Kernel cuts: Kernel and spectral clustering meet regularization. International Journal of Computer Vision. 2019;127:477–511. [Google Scholar]
- 23.W. Casaca, G. Taubin, and L. G. Nonato, "Graph laplacian for spectral clustering and seeded image segmentation," in Anais do XXVIII Concurso de Teses e Dissertacoes, 2020, pp. 31-36.
- 24.X. Yang, C. Deng, F. Zheng, J. Yan, and W Liu, "Deep spectral clustering using dual autoencoder network," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019, pp. 4066-4075.
- 25.Z. Ait El Mouden, A. Jakimi, and M. Hajar, "An Algorithm of Conversion Between Relational Data and Graph Schema," in International Conference Europe Middle East & North Africa Information Systems and Technologies to Support Learning, 2019, Smart Innovation Systems and Technologies, vol. 1ll, pp. 594-602.
- 26.P. De Meo, E. Ferrara, G. Fiumara, and A. Provetti, "Generalized louvain method for community detection in large networks," in 2011 11th International Conference on Intelligent Systems Design and Applications, 2011, pp. 88-93.
- 27.A. Y. Ng, M. I. Jordan, and Y. Weiss, "On spectral clustering: Analysis and an algorithm," in Advances in neural information processing systems, 2002, pp. 849-856.
- 28.Afzalan M., Jazizadeh F. An automated spectral clustering for multi-scale data. Neurocomputing. 2019;347:94–108. [Google Scholar]
- 29.V. Batagelj and A. Mrvar, "Pajek: Program for analysis and visualization of large networks," Timeshift-The World in Twenty-Five Years: Ars Electronica, pp. 242-251, 2004.
- 30.M. Bastian, S. Heymann, and M. Jacomy, "Gephi: an open source software for exploring and manipulating networks," in Third international AAAI conference on weblogs and social media, 2009.
- 31.J. Webber, "A programmatic introduction to neo4j," in Proceedings of the 3rd annual conference on Systems, programming, and applications: software for humanity, 2012, pp. 217-218.
