Skip to main content
. 2022 Jan 11;4(1):44–59. doi: 10.1109/TAI.2022.3142241

TABLE II. ML Articles Describing the Dataset, the Author's Identity, the Nation of Publishing, the Method Used in the Article, and the Findings for Analyzing the COVID-19 Disease.

S. No. Author Country Datasets used Methods applied Task and Algorithm used Results
1 Khanday et al. [38] India 212 reports by GitHub ML algorithm Logistic regression, Naive Bayes and classification used According to the study, logistic regressions and multinomial Nia"ve Bayes are 96% more accurate than commonly used algorithms.
2 Burdick et al. [39] USA 197 patients of United States health systems Support vector Kuhntucker model Logistic regression, classification Their findings revealed that this algorithm has a higher diagnostic odds ratio (12.58) for anticipating ventilation and effectively triaging patients than a comparator early warning system, such as the Modified Early Warning Score (MEWS), which had (0.78) sensitivity, whereas this algorithm had (0.90) sensitivity, resulting in higher specificity (p 0.05), It also indicates that it is capable of accurately identifying 16% more patients than a commonly used scoring method, resulting in fewer false positive findings.
3 Varun et al. [40] USA Total reported cases are 184 319 ML algorithm Convolutional neural networks, classifications In response to the crisis, New York City's medical and academic centres issued a call to action to AI researchers to leverage their electronic medical record data to better understand SARS-COV-2 patients. Due to a shortage of ventilators and a reported need for a quick and accurate method of triaging patients at risk for respiratory failure, our goal was to develop a machine-learning algorithm for frontline physicians in the emergency department and inpatient floors to better risk-assess patients and predict who would require intubation and mechanical ventilation.
4 Luca et al. [41] Italy 85 dataset of chest X-rays ML algorithm K-nearest neighbors’ classifier Authors present a method for automatically detecting COVID-19 disease by analyzing medical photos in this publication. We use supervised ML methods to develop a model using 85 chest X-rays that are freely available for research reasons. The experiment demonstrates that the proposed technique is efficient in distinguishing between COVID-19 disease and other lung diseases.
5 Constantin et al. [42] Germany 500 chest CTs dataset and 152 datasets of COVID-19 patients Support vector Kuhntucker model Convolutional neural network, classifications The researchers discovered that combining ML with a clinically embedded software platform allowing for speedier development, deployment, and adoption in medical practise. Finally, they developed a fully automated lung segmentation and opacity measurement approach that was ready for medical usage and performed at human levels even in difficult situations in just ten days.
6 Lamiaa et al. [43] Egypt 5000 cases of COVID-19 ML algorithm Linear regression model The results demonstrated that the specified models, such as exponential, 4, 5, and 6 degree polynomial regression models, are brilliant, especially the 4 degree model, which will aid the government in planning operations for one month. They also included a well-known log that will rise through the regression model, resulting in the epidemic peak and end in 2020. There is also a final report on the total number of COVID-19 patients.
7 Dan et al. [44] Israel Total 6995 patients in Sheba Medical Centre Support vector Kuhntucker model Artificial neural network and classifications The most relevant variables in the models were the APACHE II score, white blood cell count, time from onset of symptoms to admission, oxygen saturation, and blood lymphocytes count. Machine-learning algorithms exhibited excellent efficacy in predicting significant COVID-19 when compared to the most effective strategies available. As a result, artificial intelligence might be utilised to accurately predict COVID-19 patient risk, enhance patient triage and in hospital allocation, better prioritise medical resources and improve overall COVID-19 pandemic management.
8 Joep et al. [45] Netherlands Total 319 patients Gradient Boosting algorithm Logistic regression and classification The CO-RADS scoring system on chest CT provides a sensitive and specific approach for diagnosing COVID-19, especially if RT–PCR testing are rare during an outbreak. Combining a predictive machine-learning model with diagnostic chest CT for COVID-19 could increase accuracy even more. To improve the model, they look into more possible predictors. However, because up to 9% of RT–PCR positive patients are not diagnosed by chest CT or our ML model, RT–PCR should remain the gold standard of testing.
9 Christopher et al. [46] Germany Total 368 independent variables ML algorithm Naive Bayes and Classifications They mainly focused on variables and factors that increasing COVID-19 incidence in Germany, using the multimethod ESDA technique, which also provides an appropriate insight into spatial and spatial nonstationaries of COVID-19 occurrence. Variables like infrastructure, built environment densities, and socioeconomic factors all showed a link with COVID-19 after being examined on a county level in Germany. Their findings suggest that avoiding needless travel and social isolation can be effective approaches to limit contamination.
10 Hoyt et al. [47] U.S. Total 290 patients Support vector Kuhntucker model Logistic Regression and Classification In the entire population, the findings revealed no link between mortality and therapy, although hydroxychloroquine was connected to a statistically significant (p = 0.011) improvement in survival, with an adjusted hazard ratio of 0.29 and a CI of 0.11–0.75. Despite the fact that the algorithm predicted an adjusted survival of 82.6% in the treated group and 51.2% in the untreated group, the algorithm detected a 31% improvement in the COVID-19 population after ML applications, demonstrating the critical role of ML in medicine.
11 María et al. [48] International Food for each of the 170 countries ML algorithm K-means clustering According to the data, countries with the highest death rates consume more fats, whereas those with the lowest death rates consume more grains and have a lower overall average calorie intake.
12 Shinwoo et al. [49] USA Toal 790 Korean immigrants ML algorithm Artificial neural network, classifications Artificial neural network (ANN) analysis, a statistical model capable of investigating complex nonlinear interactions of variables, was applied. The algorithm has properly predicted a person's flexibility, familiarity with everyday discernments, and racial actions toward Asians in the United States since the beginning of the COVID-19 epidemic, offering critical advice for public health. practitioners
13 Yigrem et al. [50] Southern Ethiopia Total 244 samples ML algorithm Logistic regression, classification More than half of the study participants reported coronavirus disease-related stress, showing that there is a strong association between COVID-19-related stress and health-care employees.
14 Abolfazl et al. [51] USA. Total database of 57 candidate from the US Centres for Disease and Control and Johns Hopkins University ML algorithm ANN, classification According to Getis-Ord Gi, the results showed that the supplied model (logistic regression) demonstrated that these components and factors define the presence/absence of the COVID-19 hotspot in a geographic information system (p 0.05). As a result, the findings were useful in identifying the impact of potential risk variables connected to COVID-19 for public health decision-makers.
15 Rustam et al. [52] Pakistan Time series COVID-19 database LR, LASSO, SVM, ER Texture data are used as input and supervised learning such as linear regression, LASSO Regression, support vector machine, exponential smoothing used ES outperforms all other models, followed by LR and LASSO, which are also good at projecting new confirmed instances.
16 Sharma [53] India CT Image database Residual neural network Image data are used as input and custom vision software of Microsoft azure based on ML techniques is used 91% accuracy achieved
Peng, Nagata [54] Brazil various countries COVID-19 data Support Vector Regression (SVR) Text data are used as input and support vector regression and kernel functions used It is clear that caution is required when using ML.
17 Ardabili et al. [55] Germany 5 countries COVID data MLP, ANFIS Time-series data as an input and genetic algorithm and particle swarm optimization and supervised learning algorithm is used High generalization
18 Nemati et al. [56] USA 1182 hospitalized patients COVID-19 dataset SVM Text data are used as input Significant results have been achieved in predicting recovery time
19 Sun et al. [57] USA COVID-19 patients’ data of Massachusetts, Georgia, and New Jersey Gradient boosting algorithm Texture data are used as input Better prediction rate
20 Burdick et al. [58] USA COVID-19 Patient Dataset ML algorithm Text data are used as input and ML and MEWS used Good prediction rate
21 Kavadi et al. [59] India Indian COVID-19 Dataset Support vector Kuhntucker model Text data are used as input and propose a partial derivative regression and nonlinear ML (PDR-NML) method is used Better prediction rate
22 Banerjee et al. [60] U.K. D-19 data from Midstream ANN Text data are used as input A higher rate of infection detection prediction is attained.
23 Wang et al. [61] China COVID-19 Data Logistic model + prophet method Time-series data as an input and Fb Prophet model used Good prediction rate
24 Han et al. [62] China CT datasets AD3D-MIL algorithm (A Deep 3D-Multiple Instance Learning) Image data are used as input and attention-based deep 3-D multiple instance learning (AD3D-MIL) is used An accuracy of 97.9% is obtained
25 Vaid et al. [63] Canada JHU CSSE database developed a ML model to uncover hidden patterns based on reported cases and to predict potential infections. Text data are used as input Good prediction rate
26 Elaziz et al. [64] Egypt Two chest X-ray COVID-19 dataset KNN +Manta-Ray Foraging Optimization Image data are used as input and CNN used For two datasets, accuracy of 96.09% and 98.09% was obtained.
27 Ahamad et al. [65] Bangladesh Patient COVID-19 data Extreme Gradient Boosting, Decision Tree, RF, SVM, Gradient Boosting Machine Text data are used as input and Random Forest, XGBoost, Gradient Boosting Machine and SVM is used XGB outperformed other proposed methods
28 Brinati et al. [66] Hasan [67] Italy time series COVID-19 dataset Ensemble empirical mode Text data are used as input and support vector machines and random forest algorithm used Better prediction rate
Wuhan Decomposition (EEMD) + ANN)
29 Farid et al. [68] Egypt CT images COVID-19 dataset SVM, NB, CNN, RF, as well as JRIP Image data are used as input and Composite hybrid feature extraction (CHFS) used The proposed CHFS has a higher prediction rate than CNN.
30 Shaban et al. [69] Egypt CT images COVID-19 dataset Enhanced KNN Image data are used as input and Genetic Algorithm (GA) and KNN classifier is used Good detection rate
31 Ou et al. [70] China Pandemic COVID-19 data Neural network Text data are used as input and support vector machines and RF algorithm used Good identification rate
32 Samuel et al. [71] USA COVID-19 dataset LR, Naive Bayes (NB), Linear regression (LiR), KNN Text data are used as input and logistics regression (LR) and KNN is used NB outperformed other techniques
33 Pinter et al. [72] Germany COVID-19 dataset of Hungary data Adaptive network-based fuzzy inference system and Multilayered perceptron-imperialist competitive algorithm Text data are used as input and adaptive network-based fuzzy inference system and multilayered perceptron-imperialist competitive algorithm are used Good prediction rate
34 Carrillo-Larco and Castillo-Cara. [73] U.K. COVID-19 patients’ data K-Means algorithm Text data are used as input and unsupervised ML used Better classification rate
35 Benıtez-Pena et al. [74] Spain patients’ COVID-19 data RF and Support Vector Regression (SVR) Text data are used as input High prediction rate
36 Zhong et al. [75] China patient COVID-19 blood sample data SVM, KNN, RF, LR Text data are used as input Better severity detection
37 Yadav et al. [76] India COVID-19 Synthetic dataset SVR Text data are used as input and support vector regression (SVR) is used Polynomial regression, SVR outperformed LiR,
38 Chang et al. [101] Australia COVID-19 dataset ABM approach Australian Census-based epidemic model Agent based modelling using a fine-grained computational simulation applied
39 Zhang et al. [102] Africa Africa CDC dataset PHSM data (Oxford COVID-19 Government response tracker dataset) Text data are used as input Descriptive analyses were done to establish the different cases
40 Andrikopoulos and Greg [103] Australia COVID-19 dataset Australian Centre for behavioral research in diabetes, diabetes Australia adapted a resource developed Text data are used as input It is clear that people with diabetes are at greater risk of serious health impacts in pandemics such as COVID-19 than people without diabetes