Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2022 Dec 24:1–39. Online ahead of print. doi: 10.1007/s11042-022-14305-w

Machine learning and deep learning approach for medical image analysis: diagnosis to detection

Meghavi Rana 1, Megha Bhushan 1,
PMCID: PMC9788870  PMID: 36588765

Abstract

Computer-aided detection using Deep Learning (DL) and Machine Learning (ML) shows tremendous growth in the medical field. Medical images are considered as the actual origin of appropriate information required for diagnosis of disease. Detection of disease at the initial stage, using various modalities, is one of the most important factors to decrease mortality rate occurring due to cancer and tumors. Modalities help radiologists and doctors to study the internal structure of the detected disease for retrieving the required features. ML has limitations with the present modalities due to large amounts of data, whereas DL works efficiently with any amount of data. Hence, DL is considered as the enhanced technique of ML where ML uses the learning techniques and DL acquires details on how machines should react around people. DL uses a multilayered neural network to get more information about the used datasets. This study aims to present a systematic literature review related to applications of ML and DL for the detection along with classification of multiple diseases. A detailed analysis of 40 primary studies acquired from the well-known journals and conferences between Jan 2014–2022 was done. It provides an overview of different approaches based on ML and DL for the detection along with the classification of multiple diseases, modalities for medical imaging, tools and techniques used for the evaluation, description of datasets. Further, experiments are performed using MRI dataset to provide a comparative analysis of ML classifiers and DL models. This study will assist the healthcare community by enabling medical practitioners and researchers to choose an appropriate diagnosis technique for a given disease with reduced time and high accuracy.

Keywords: Machine learning, Deep learning, Medical image processing, Convolutional neural network, Transfer learning, Healthcare, Tumor classification

Introduction

The significance of disease classification and prediction can be observed from the previous years. The important properties and features given in a dataset should be well-known to identify the exact cause along with the symptom of the disease. Artificial Intelligence (AI) has shown promising results by classifying and assisting in decision making. Machine Learning (ML), a subset of AI, has accelerated many research related to the medical field. Whereas, Deep Learning (DL) is a subset of ML that deals with neural network layers, analyzing the exact features required for disease detection [34, 71, 94]. The existing studies from 2014 to present, discusses many applications and algorithms developed for enhancing the medical field by providing accurate results for a patient. Using data, ML has driven advanced technologies in many areas including natural language processing, automatic speech recognition, and computer vision to deliver robust systems such as driverless cars, automated translation, etc. Despite all advances, the application of ML in medical care remained affected with hazards. Many of these issues were raised from medical care stating the goal of making accurate predictions using the collected data and managed by the medical system.

AI examines a given dataset using various techniques to get the required features or highlights from a huge amount of data resulting in difficulty for tracking down an ideal arrangement of significant features and excluding repetitive ones. Considering such features is inconvenient and accuracy metrics becomes erroneous. Hence, choosing a small subset from a wide scope of features will upgrade the efficiency of the model. Subsequently, the exclusion of inconvenient and repetitive features will decline the dimensionality of the information, speed up the learned model similar to boosting [37]. From the existing features, the significant features are extracted using practical approaches such as Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA). Particularly, choosing a feature has two essential clashing objectives, first, boosting the presentation of arrangement and second, limiting the count of features to conquer the issue of dimensionality. Hence, selection of features is considered as an essential task for aforementioned objectives. Later, research related to the features improvement was enhanced by using choice-based multi-target strategies. Thus, in this review, strategies to choose efficient features will be focused.

Cancer disease was identified using multiple techniques of image segmentation, feature selection, and regression using Root Mean Square Error (RMSE), with the parameters such as recognizing patterns, detecting objects, and classifying the image [7]. Brain tumor was detected using six classifiers and Transfer Learning (TL) techniques for image segmentation with Magnetic Resonance Imaging (MRI) of the brain [28]. Also, a TL approach was implemented to identify lung cancer and brain disease in [55]. It analyzed MRI and Computer-Tomography (CT) scan images by using supervised learning Support Vector Machine (SVM) classifiers. The image analysis process has been well understood in the existing studies. However, the techniques using ML and DL are continuously being updated. Therefore, it is a complex task for researchers to identify an accurate method for analyzing images and feature selection techniques varying with every method. The key contributions of this study include:

  • (i)

    Classification of diseases after reviewing primary studies,

  • (ii)

    Recognition of various image modalities provided by existing articles,

  • (iii)

    Description of tools along with reliable ML and DL techniques for disease prediction,

  • (iv)

    Dataset description to provide awareness of available sources,

  • (v)

    Experimental results using MRI dataset to compare different ML and DL methods,

  • (vi)

    Selection of suitable features and classifiers to get better accuracy, and.

  • (vii)

    Insights on classification as well as review of the techniques to infer future research.

The significance of this review is to enable physicians or clinicians to use ML or DL techniques for precise and reliable detection, classification and diagnosis of the disease. Also, it will assist clinicians and researchers to avoid misinterpretation of datasets and derive efficient algorithms for disease diagnosis along with information on the multiple modern medical imaging modalities of ML and DL.

The study presented consists of 11 sections. The organization of the section is described as follows: Section 2 discusses the background of study, Section 3 discusses the review techniques, search criteria, source material and the quality assessment. Section 4 summarizes the current techniques and important parameters to acquire good accuracy. Section 5 gives an insight of medical image modalities. Section 6 sums up the tools and techniques being used in ML and DL models. Section 7 discusses the datasets used by the authors previously and gives an insight of data. Section 8 represents the experimental section using ML classifiers and DL models over brain MRI dataset. Section 9 recaps the analytic discussion about the techniques, datasets being used, tools in ML and DL, journals studied for the given article. Discussion, conclusion and future scope is discussed in Sections 10 and 11, respectively.

Background

This section discusses the preliminary terms which are required to comprehend this review. Further, it also presents the statistical analysis of ML and DL techniques used for medical image diagnosis.

Machine learning

ML is a branch of AI where a machine learns from the data by identifying patterns and automates decision-making with minimum human intervention [96, 24, 12]. The most important characteristic of a ML model is to adapt independently, learn from previous calculations and produce reliable results when new datasets are exposed to models repeatedly. The two main aspects include (i) ML techniques help the physicians to interpret medical images using Computer Aided Design (CAD) in a small period of time, and (ii) algorithms used for challenging tasks like segmentation with CT scan [81], breast cancer and mammography, segmenting brain tumors with MRI. Traditional ML models worked on structured datasets where the techniques were predefined for every step, the applied technique fails if any of the steps were missed. The process of evaluating the data quality used by ML and DL algorithms is essential [1622, 61]. Whereas, new algorithms adapt the omission of data based on the requirement for robustness of the algorithm. Figure 1 illustrates the process used by ML algorithms for the prediction and diagnosis of disease.

Fig. 1.

Fig. 1

ML process [10]

Deep learning

DL models enable machines to achieve the accuracy by advancements in techniques to analyze medical images. In [58], the heart disease was diagnosed using the labelled chest X-Rays, cardiologist reviewed and relabelled all the data while discarding the data other than heart failure and normal images. To extract the exact features from the images, data augmentation and TL were used with 82% accuracy, 74% specificity and 95% sensitivity for heart failure. In [14], an automatic feature selection, using histopathology images with the labelling of positive and negative cancer images, was developed with minimum manual work. Two networks named Deep Neural Network (DNN) 2-F and DNN1-F were used with PCA to reduce features in DNN whereas for unsupervised feature learning a single-layer network of K-means centroids was used. Later, the results of unsupervised (93.56%) and supervised (94.52%) learning were compared. The DL model automates the feature extraction procedure to handle data efficiently [14, 74]. Figure 2 depicts the process used by DL algorithms for the prediction and diagnosis of various diseases.

Fig. 2.

Fig. 2

DL process

To process the medical images for better prediction and accuracy, ML and DL techniques were used as shown in Figs. 1 and 2, respectively. As input, medical images from various modalities are taken into consideration, and then algorithms are applied to these images. Further, the input image is segmented based on various factors, these segments were used to extract the essential and maximum features using feature extraction techniques. After the extraction of the required features, they are further refined to obtain actual features used for the identification of diseases [60]. Also, ML approaches were used to denoise the medical images for better prediction and accuracy in [46]. Once the feature selection and noise removal from the data are achieved, the classification of the images according to the disease using classifiers like SVM, Decision Tree (DT), etc. was attained.

ML is the process where computers learn from data and use algorithms to carry out a task without being explicitly programmed. It uses pattern recognition to make predictions with new dataset. Alternatively, DL is modeled according to the human brain including a complex structure of algorithms enabling machines to process images, text and documents. It uses layered-structure algorithms such as Convolutional Neural Network (CNN), Artificial Neural Network (ANN), etc., to analyze the data with logics. Comparatively, DL is more capable of processing huge amount of data than ML models.

Review technique

In this section, an overview of the technique used to conduct this review systematically is discussed. It provides the details of the electronic databases used to search, retrieve information, and discuss the research questions framed to execute the review successfully. The systematic review guidelines implemented by [49, 50] were followed for this literature review.

Research questions

In this review, following review questions will be discussed:

  1. What are the present techniques of ML and DL used for medical imaging?
    • 1.1
      What are the considered parameters while selecting the classifiers?
    • 1.2
      What are the evaluation metrics used to evaluate classification models?
  2. What are various medical image modalities for classifying the diseases?

  3. What are the tools and techniques used for medical imaging?

  4. What are various datasets used by several researchers in the domain of healthcare?

  5. What are the results of comparative analysis of ML classifiers and DL models based on experiments using MRI dataset?

Source material

The guidelines given in [49, 50] are followed for searching the existing literature related to the area of ML and DL in medical imaging. Following electronic database sources are used for searching:

Search criteria

This review consists of the articles written in English language between the years 2014–2022. The review process can be considered as the filtering process for attaining the quality research articles with the inclusion and exclusion criteria at various stages. The search was based on the keywords as shown in Table 1 to retrieve research articles from various journals, conferences, book chapters, and other sources.

Table 1.

Keywords used

S.No General Keyword Specific Keywords Duration Type of article
1 Learning ML, DL, Prediction, Classification, Neural networks, AI, Python 2014–2022 Journal, Conferences, Workshops, Book chapters, Society, Transcripts
2 ML Healthcare, TL, Feature selection, Disease diagnose, Radiology, COVID medical image analysis, BI-RADS, Iris images, Diabetes, Denoising 2014–2022 Journal, International and national conferences, Society, Book chapters, Archives, Articles
3 DL Noisy labels, CNN, Medical aid, Heart, Augmentation, ANFC Classifier, DNN, ANN
4 Medical Imaging Image segmentation, Imaging fusion, Automated breast scan, TL, Multiview CNN, Trends in imaging
5 Healthcare Medical industry, Health industry, Monitoring and recognition using ML and DL, Integrated healthcare system, Patients, Chronic heart failure, Heart disease prediction.

ML Machine Learning, DL Deep Learning, ANFC Adaptive Neuro-Fuzzy Classifier, BI-RADS Breast Imaging Reporting and Data System, CNN Convolutional Neural Network, AI Artificial Intelligence, ANN Artificial Neural Network, DNN Deep Neural Network, TL Transfer Learning

The journals and conferences included were taken from IEEE, Science Direct, Springer, Oxford Publication, etc. The article selection method is depicted in Fig. 3. As depicted in Fig. 3, the initial search consisted of 16,900 articles which were refined to 250 based on the specific keywords used as shown in Table 1. Then 100 articles were retrieved based on their titles and were reduced to 75 articles based on their abstract and introduction. Finally, 40 articles were selected as primary studies based on the criteria of exclusion and inclusion.

Fig. 3.

Fig. 3

Article selection method

Quality assessment

The quality of this review was assured after inclusion and exclusion criteria discussed in sub-section 3.3. These primary studies were from various journals, conferences, workshops, and others (manuscripts, online records, and society publications). To retrieve the quality articles, analysis of each article was done to maintain fairness and validation (external and internal) of the results based on the CRD guidelines [50].

Table 2 presents the top 20 highly influential and cited articles related to the classification of diseases, identification of tools and techniques, explanation for the cause of disease, and solutions to the diagnosed disease (source: https://scholar.google.co.in).

Table 2.

Top 20 cited articles

Title Year J/BC/O/C Name of the J/BC/O/C Number of citations
Deep learning in medical image analysis. 2017 J Annual Review of Biomedical Engineering 2963
An overview of deep learning in medical imaging focusing on MRI. 2019 J Zeitschrift für Medizinische Physik (Journal of Medical Physics) 1259
Deep learning in medical imaging: General overview. 2017 J Korean Journal of Radiology 918
Medical image fusion: A survey of the state of the art. 2014 J Information Fusion 879
Application of deep convolutional neural network for automated detection of myocardial infarction using ECG signals. 2017 J Information Sciences 672
Survey of machine learning algorithms for disease diagnostic. 2017 J Journal of Intelligent Learning Systems and Applications 491
Automated diagnosis of arrhythmia using combination of CNN and LSTM techniques with variable length heart beats. 2018 J Computers in Biology and Medicine 468
Deep learning of feature representation with multiple instance learning for medical image analysis. 2014 C IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 390
Machine learning in medical imaging. 2018 J Journal of American College of Radiology 385
Preparing medical imaging data for machine learning. 2020 O Radiological Society of North America (RSNA) 333
Machine learning approaches in medical image analysis: From detection to diagnosis. 2016 J Medical Image Analysis 280
Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis. 2020 J Medical Image Analysis 259
Transfer learning improves supervised image segmentation across imaging protocols. 2014 J IEEE Transactions on Medical Imaging 232
Liver disease prediction using SVM and Naïve Bayes algorithms. 2015 J International Journal of Science, Engineering and Technology Research (IJSETR) 225
Coronavirus disease (COVID‑19) cases analysis using machine‑learning applications 2021 J Applied Nanoscience 206
Deep learning for cardiovascular medicine: a practical primer. 2019 J European Heart Journal 169
Deep learning in medical image analysis. 2020 BC Deep Learning in Medical Image Analysis 159
Machine learning and deep learning in medical imaging: Intelligent imaging. 2019 J Journal of Medical Imaging and Radiation Sciences 139
Detection technologies and recent developments in the diagnosis of COVID-19 infection. 2021 J Applied Microbiology and Biotechnology 139
A review of challenges and opportunities in machine learning for health. 2020 O AMIA Summits on Translational Science Proceedings 125

J Journal, BC Book Chapter, C Conference, O Other (Manuscripts, online records, Society publications, Proceedings)

Data extraction

Initially, many challenges were faced to extract the relevant data for this review, therefore, some researchers were approached to acquire the necessary information. The method for extracting the required data in this review is as follow:

  • One of the authors extracted the data after a thorough review of 40 articles.

  • The acquired results of the review were cross checked by another author to maintain consistency.

  • During the process of cross checking (in case of a conflict), issues were resolved by meetings between the authors.

ML and DL techniques for medical imaging

Research question 1 is answered in this section to provide an overview of the current techniques of ML and DL for medical imaging. Further, followed by various parameters considered for selecting the classifiers and the evaluation metrics used to evaluate classification models. The existing literature review is divided according to the diseases such as breast cancer, brain tumor, lung disease, diabetes, multiple disease detection, etc.

Breast disease

In this subsection, articles related to breast disease symptoms, detection, classification, prediction and diagnosis using ML and DL methods are discussed. In [33], significant features were identified using BI-RADS (Breast Imaging Reporting and Data System) to develop a CAD system for obtaining breast ultrasound. Also, 10-fold cross validation technique was used upon the benign and malignant lesions. As a result, 77% accuracy was achieved using the SVM classifier. However, some methods with a few algorithms handling the vast variety of data need to be understood and analyzed precisely [84]. CNN was used to train the system with the available clinical data and to comprehend the complex structure. Moreover, it was suggested to study radiomics and expansion of CADx to get the tumor signs using a CAD system. Breast cancer disease was classified using the parameters like Area Under Curve (AUC), sensitivity, and specificity [100]. A CAD system was developed using CNN where a large number of features were required, using multiview features. These features provide the maximum details of the image data to be extracted for the accuracy of detection and classification.

DL was used for analyzing medical images and also, the limitations along with success of DL techniques for medical imaging were discussed in [86]. Recent ML and DL technologies were reviewed for the classification and detection of medical imaging modalities [39]. It provided an insight on the progress of the technology used in the medical field. Various ML techniques used for image processing and DL techniques with the architecture of the algorithm were discussed. To study the technologies, the evaluation of various images such as histological images, thermography images, mammography, ultrasound and MRI using the CAD system was explored. Moreover, the system included ML techniques like SVM, ANN, DT, Naive bayes, K-Nearest Neighbor (KNN), etc.

Brain disease

The concept of TL was used for image segmentation where the MRI scan of the brain was segmented using voxel wise classification [7]. ML classifiers were applied for the classification of multiple diseases. Later, the results obtained were compared with the existing results to detect the disease.

A brief introduction of DNN in medical image analysis to diagnose the brain tumor using brain tissues is provided in [56]. It indicated the ways for applying DL to the entire process of MRI scanning, image retrieval, segmentation and disease prediction. It also focused on image acquisition to image retrieval, and from feature segmentation to prediction of disease. The entire process was divided into two parts: (i) the signal processing of MRI including the image restoration and image registration, and (ii) usage of DL for disease detection and prediction-based reports in the form of text and images. Also, the influence of DL in medical imaging was discussed in [82]. Image segmentation approaches using DL included tumor segmentation, brain and lung’s structure with bone tissues or cells. Patches were taken as input and 2-Dimensional Convolutional Neural Network (2D-CNN) was used to preprocess these at a later stage.

Lung disease

DL has the ability to automate the process of image interpretation which enhances the clinical decision making, identifying the disease and predicting the best treatment for the patient by reviewing the pros and cons of the DL techniques [51]. These techniques were used for the cardiovascular medication, following are the steps for implementing DL model: (i) problem identification, (ii) data selection, (iii) hardware and software selection, (iv) data preparation, (v) feature selection, and (vi) splitting of data for training as well as validation process. In [13], a disease was analyzed automatically using labeled data and achieved the accuracy by processing medical images using DL models. The automatic prediction of the disease using ML techniques and the concept of big data was summarized to detect the patterns [23]. The advantages and disadvantages for each algorithm were also discussed.

Diabetes

A comparative analysis of the classification algorithms based on iris images, using an iridology chart, was done for the diagnosis of diabetes [76]. Type-2 diabetes was detected by identifying the center of the pupil of an eye at the early stage using the I-Scan-2. Also, a filter-based feature selection method was used with the combination of five classifiers namely binary tree, SVM, neural network model, Random Forest (RF) and adaptive boosting model. Later, in [77] a study was compiled using the textural, statistical and various features (62 features of iris) to detect the same disease, however, an iridology chart was not used. ML and DL techniques were used to diagnose the errors in existing diagnostic systems [81]. These techniques were used to analyze the medical images and extract the features which are required for the diagnosis of errors in existing diagnostic systems. Both supervised and unsupervised algorithms were used for the prediction of the disease in specific datasets.

It was observed that DL technique is a way more powerful to investigate medical images [65]. Various techniques such as image classification, object detection, pattern recognition, etc. were used for the proper decision-making. It improved medical treatments by predicting the early symptoms of a disease. Moreover, an overview of ML and DL techniques used in the medical field was given for providing knowledge to the future researchers. In [78], techniques such as rubber sheet normalization, ML classifiers, PCA, etc. were used with self-created data and computed six parameters (i) accuracy, (ii) sensitivity, (iii) specificity, (iv)AUC, (v) precision, and (vi) F-score for accurate prediction of Type-2 diabetes.

Multiple disease detection

Multiple diseases were identified with different radiology techniques like MRI imaging for breast cancer along with brain tumor, CAD for breast cancer along with skin lesions, and X-Rays for chest analysis [46]. Also, ML techniques were used to attain better accuracy with denoising techniques including homographic wavelet, soft thresholding, non-homomorphic and wavelet thresholding. A CAD system using CNN was proposed to diagnose breast lesions as benign and malignant to assist the radiologists [100]. It was implemented using Inception-v3 architecture to extract the multiview features from Automated Breast Ultrasound (ABUS) images. For the implementation of the model, 316 breast lesions data were trained and evaluated. ML feature extraction scheme was compared with the given method, resulting in 10% increase in AUC value.

A review on image fusion was presented in [42], it reduced the randomness and improved the quality of available images. Various methods and challenges related to image fusion were also summarized. In [44], ML and DL techniques focusing on small labeled dataset were discussed as it was considered one of the important factors in decision making. Further, noisy data in medical images was analyzed with pros and cons of various ML algorithms.

In [4], data augmentation techniques were used to evaluate the dermatology diseases such as acne, atopic dermatitis, impetigo, psoriasis, and rosacea. To diagnose the mentioned diseases, the model was retrained in two phases: (i) with data augmentation, and (ii) without data augmentation using TensorFlow Inception V3. For statistical analysis, both the models were then compared and six parameters namely: (i) Positive Predictive Value (PPV), (ii) Negative Predictive Value (NPV), (iii) Matthew’s Correlation Coefficient (MCC), (iv) sensitivity, (v) specificity, and (vi) F1 score were calculated resulting in an increase of 7.7% average correlation coefficient.

Multiple diseases like diabetes, heart disease, liver disease, dengue and hepatitis were identified by recognizing the pattern in the available data and classifying them further using ML classifiers [29, 27, 47]. It used high-dimensional and multimodal dataset to predict the diseases accurately. The deteriorating condition of a patient was predicted using ML techniques like ML pipelines, classifiers (SVM and 5-fold cross-validation) with the baseline variables from MRI imaging [79]. AI applications in medical imaging, DL tools for the prediction and pattern recognition were described in [87]. In addition, apart from AI techniques, ANN and CNN were also useful for predicting the disease by analyzing the image pattern and classification of the disease can be carried out with the help of classifiers [62, 63].

Various algorithms were reviewed to detect the error in the diagnosis system implying the importance of ML and DL for early diagnosis of the disease [81]. Whereas, [104] discussed the three main challenges: (i) to cope up with image variations, (ii) learning from weak labels, and (iii) interpreting the results with accuracy for the diagnosis of cancer through given medical images. It concluded that TL was used to cope up with image variations. The concept of Multiple Instance Learning (MIL) and weighted TL were used to overcome the weakly labeled data and improve the accuracy of the disease classification for better medical results, respectively. It was suggested to comprehend the relation between image label and image collection instead of learning about the individual instance. The main advantage of the used technique is that it does not require the local manual annotations.

Table 3 represents the current ML and DL techniques for medical imaging, various parameters considered while selecting the classifiers, identified disease and evaluation metrics. Also, early tumor detection can assist clinicians to treat patients timely.

Table 3.

Summary of existing works related to ML and DL techniques for medical imaging

S.No. Article Year Technique(s) Parameter(s) Identified Disease(s) Performance Metric(s)
Cancer
 1 [104] 2016

● Weighting-based TL approach

● Supervised learning

● SVM with a gaussian kernel

● Maximum mean discrepancy

● Lung cancer

● Brain disease

-
 2 [23] 2018

● Radiomics

● Extension of CAD

● CNN

● DL

● TL

● Tumor signatures

● Features extracted from radiomics

● Breast cancer -
 3 [81] 2019

● CNN

● Image segmentation

● Feature selection using information retrieval techniques.

● Regression using RMSE

● Clustering

● Object detection

● Pattern recognition

● Image classification

● Cancer -
 4 [44] 2020

● DL

● Semi-supervised learning

● Labeled data

● Loss functions

● Data re-weighed

● Multiple disease (breast lesion detection, cancer detection) -
 5 [103] 2020 ● CAD based on CNN

● AUC

● Sensitivity

● Specificity

● Multiview features

● Five human diagnostics

● Breast cancer classification (benign and malignant)

● Sensitivity: 88.6%

● Specificity: 87.6%

● AUC: 0.9468

 6 [86] 2020

● Robust DL

● CAD tool

● Big data

● TL

● Interpretable AI

● Clinical data ● Lesion detection -
 7 [39] 2020

● DL

● ML

● Accuracy

● FMeasure

● AUC

● Precision

● Breast cancer

DDSM dataset:

● Accuracy: 97.4%

● AUC: 0.99

Inbreast dataset:

● Accuracy: 95.5%

● AUC: 0.97

BCDR dataset:

● Accuracy: 96.6%

● AUC: 0.96

 8 [38] 2021

● Imagescope (Aperio Imagescope)

● Normalized median intensity

● Color appearance matrix

● Annotated image

● Pathology

● Cancer analysis

Quality performance:

● QSSIM: 97.59%

● SSIM: 98.22%

● PCC: 98.43%

Tumor
 9 [84] 2016

● ANN

● RF

● SVM

● 10-fold cross-validation

● Image feature ● Breast tumor

Accuracy:

● SVM: 77.7%

● RF: 78.5%

 10 [56] 2017

● DL

● Feature selection algorithm

● Pooling

● 2D-CNN

● Big data

● Numerical or nominal values

● Lung tumor

● Brain disease

-
 11 [82] 2019

● ML

● DL

● MRI

● Brain tissues ● Brain tumor -
 12 [102] 2020

● Pixel intensity

● Filtering

● Side detection

● Segmentation

● FLASH (reduction of red eye from images)

● Human face ● Brain tumor -
 13 [97] 2020

● CAD

● TL

● Fuzzy feature selection

● Correlation feature selection

● Hand crafted features ● Breast tumors (benign and malignant)

Accuracy:

● Benign: 100%

● Malignant: 96%

 Multiple disease
 14 [42] 2014

● Fusion algorithms

● Morphological knowledge

● Neural network

● Fuzzy logic

● SVM

● Principal components feature

● Wavelets

● Brain

● Breast

● Prostate

● Lungs

-
 15 [29] 2017

● CAD

● Naïve bayes

● SVM

● Functional trees

● 13 features from 76 features

● Heart

● Diabetes

● Liver

● Dengue

● Hepatitis

Accuracy:

● Heart disease using SVM: 94.6%

● Diabetes using Naïve bayes: 95%

● Liver disease using functional tree: 97.1%

● Hepatitis disease using feed forward neural network: 98%

● Dengue disease using rough set theory: 100%

 16 [33] 2020

● Radiography

● MIL

● Imaging annotation

● Lung cancer

● Breast cancer

● Unsupervised feature learning: 93.56%

● Fully supervised feature learning: 94.52%

● MIL performance of coarse label: 96.30%

● Supervised performance of fine label: 95.40%

 17 [32] 2020 ● ML

● Trained models

● Human expert’s narrows

● Integrated disease -
 18 [23] 2020

● Supervised and unsupervised ML algorithms

● DT

● Bootstrap methods

● Clinical data ● Multiple diseases -
Skin disease
 19 [4] 2019 ● Tensorflow inception version-3

● Sensitivity

● Specificity

● PPV, NPN, MCC

● F1 score

● Acne

● Atopic dermatitis

● Impetigo

● Psoriasis

● Rosacea

Acne:

Sensitivity: 73.3%, Specificity: 95%, PPV: 78.6%, NPV: 93.4%, MCC: 70.1%, F1 score: 75.9%

Atopic dermatitis:

Sensitivity: 63.3%, Specificity: 87.5%, PPV: 55.9%, NPV: 90.5%, MCC: 48.6%, F1 score: 59.4%

Impetigo:

Sensitivity: 63.3%, Specificity: 93.3%, PPV: 70.4%, NPV: 91.1%, MCC: 59%, F1 score: 66.7%

Psoriasis:

Sensitivity: 66.7%, Specificity: 89.2%, PPV: 60.6%, NPV: 91.5%, MCC: 53.9%, F1 score: 63.5%

Rosacea:

Sensitivity: 60%, Specificity: 91.7%, PPV: 64.3%, NPV: 90.2%, MCC: 53%, F1 score: 62.1%

Diabetes
 20 [76] 2018

● I-Scan-2

● Integro differential operator

● CHT

● Rubber sheet normalization

● Iridology chart

● GLCM

● Filter based feature selection method (fisher-score discrimination, t-test, chi-square test)

● Classifiers (BT, SVM, AB, GL, NN, RF)

● Centre point and radius of pupil and iris

● Statistical, texture and discrete wavelength

● Type 2 - diabetes Accuracy: 89.66%
 21 [77] 2018

● I-Scan-2

● Integro differential operator

● Rubber sheet normalization

● 2D-DWT

● Five classifiers (BT, RF, AB, SVM, NN)

● Accuracy

● Specificity

● Sensitivity

● Diabetes

Accuracy: 59.63%

Specificity: 96.87%

Sensitivity: 98.8%

 22 [78] 2019

● ML based classification method (DT classifiers, SVM, ensemble classifiers)

● Iris segmentation

● Rubber sheet normalization

● Modified T-test

● PCA

● Accuracy

● Sensitivity

● Specificity

● Precision

● F-score

● AUC

● Type 2- diabetes Accuracy: More than 95%
Breast disease
 23 [98] 2020

● Segmentation methods

● Watershed method

● Clustering techniques

● Graph based techniques

● Classifier techniques

● Morphology techniques

● Hybrid techniques

● Evaluation metrics ● Breast disease -
 24 [60] 2021

● Genetic based artificial bee colony algorithm

● Ensemble classifiers (SVM, RF, DT, Naïve bayes, bagging, boosting)

● Optimization parameters

● Cost based functions

● Fitness value

● Modification rate

● Recursive feature elimination

● Chest pain Accuracy: More than 90%
Covid 19
 25 [53] 2021

● ML (supervised and unsupervised)

● Data fusion

- ● Covid-19

Accuracy with supervised ML: 92%

Accuracy with unsupervised ML: 7.1%

 26 [72] 2021

● RNN

● CNN

● Hybrid DL model

● Cough voice samples

● Blood samples

● Temperature

● Covid-19

Accuracy with CT scan images: Above 94%

Accuracy with x-ray images:

Between 90-98%

 27 [73] 2021

● Nucleic acid-based

● Serological techniques

- ● Covid-19 -
 28 [36] 2021 ● Gradient-boosting machine model with DT base-learners

● Cough

● Fever

● 60 + age

● Headache

● Sore throat

● Shortness of breath

● Covid-19 Accuracy: Above 80%
Heart disease
 29 [18] 2020 ● DL algorithms

● Sensitivity

● Specificity

● Heart disease Accuracy: 82%
 30 [90] 2022

● CNN

● Normalization

● Mean absolute deviation

● Sensitivity

● Specificity

● Cardiovascular disease Median quality score: 19.6%
 31 [48] 2022

Metaheuristics optimization-based features selection algorithms:

● SALP swarm optimization algorithm

● Emperor penguin optimization algorithm

● Tree growth optimization algorithm

● Aortic stenosis

● Mitral stenosis

● Mitral valve prolapses

● Mitral regurgitation

● Valvular heart diseases

Accuracy:

Five classes: 98.53%

Four classes: 98.84%

Three classes: 99.07%

Two classes: 99.70%

 32 [59] 2022

● Multifiltering

● REP tree

● M5P tree

● Random tree

● LR

● Naïve bayes

● J48

● Jrip

● Age

● Chest pain

● Blood pressure

● Cholesterol

● Fasting blood sugar

● Heart rate

● Slope

● ST depression

● Thalassemia

● Cardiovascular disease

Accuracy: 100%

Lowest MAE: 0.0011

Lowest RMSE: 0.0231

Prediction time: 0.01 s

Respiratory disease
 33 [6] 2020

● DL

● Hilbert-huang transform

● Multichannel lung sounds using statistical features of frequency modulations ● Chronic obstructive pulmonary disease

Accuracy: 93.67%

Sensitivity: 91%

Specificity: 96.33%

 34 [43] 2021

● AI

● ML

-

● Pulmonary function tests

● Diagnosis of a range of obstructive and restrictive lung diseases

-
 35 [69] 2020 ● CNN-MOE

Audio recordings:

● Crackle

● Wheeze

● Crackle and wheeze

● Normal

● Time labels (onset and offset)

Respiratory disease

Accuracy:

4-class: 80%

3-class: 91%

2-class: 86-90%

 36 [57] 2019 ● Improved bi-resnet DL architecture ● Annotated respiratory cycles Respiratory disease Accuracy: 50.16%
Other
 37 [7] 2015

● TL

● Segmentation through voxel wise classification

● MRI scanners

● MRI brain-segments

● White matter, gray matter, and cerebrospinal fluid segmentation

● Lesion segmentation

- Minimized classification error: 60%
 38 [79] 2018

● ML pipelining

● SVM classifier

● 5-fold cross-validation

● CMR imaging

● Baseline left ventricular

● Ejection fraction

● Left ventricular circumferential strain

● Pulmonary regurgitation

-

Minor deterioration: 82%

Major deterioration: 77%

 39 [27] 2019

● Image segmentation

● Feature selection

● Radiomic analysis

● Semantic analysis

● Lesion classification

● Pacs-side algorithm

● Weighted sum

● Feature map

- -
 40 [26] 2020

● Data mining

● Pattern classification

● Neural nets

● CNN

● Lenet5

● Max pooling

● Feature extraction

● IRIS manipulation using SVM techniques

-

Accuracy:

SVM: 82%

CNN: 93.57%

CNN Convolutional Neural Network, SVM Support Vector Machine, ML Machine Learning, DL Deep Learning, MRI Magnetic Resonance Imaging, PCA Principal Component Analysis, BT Binary Tree, RF Random Forest, NN Neural Network, AB Adaptive Boosting, CAD Computer-aided Diagnosis System, ANN Artificial Neural Network, AUC Area Under Curve, RMSE Root Mean Square Error, 2D-DWT Two-Dimensional Discrete Wavelet Transform, MAE Mean Absolute Error, QSSIM Quaternion Structure Similarity Index Metric, SSIM Structure Similarity Index Metric, PCC Pearson Correlation Coefficient, MoE Mixture of Experts, MIL Multiple Instance Learning, PPV Positive Predictive Value, NPV Negative Predictive Value, MCC Matthew’s Correlation Coefficient, TL Transfer Learning.

Modalities for medical image

Research question 2 (refer subsection 3.1) is addressed in this section, various medical image modalities (I-Scan-2, CT-Scan, MRI, X-Ray, Mammogram and Electrocardiogram (ECG)) used for classifying the diseases in the primary studies are shown in Table 4. As observed, following modalities were used for the evaluation of medical data using ML and DL techniques.

Table 4.

Modalities for medical imaging and digital signal

Article I-Scan- 2 CT- SCAN MRI/ X-Ray Mammogram ECG
[18] - - + - -
[60] - - + - -
[33] - + + + -
[49] - - + + -
[82] - - - - -
[76] - + - - +
[77] + - - - -
[78] + - - - -
[42] - + + - -
[103] - + + - -
[32] - - + - -
[98] - - + - -
[53] - + - - -
[83] - + - - -

CT-SCAN Computed Tomography Scan, MRI Magnetic Resonance Imaging, X-Ray X Radiation, ECG Electrocardiogram. “+” and “-” signify that the article does and does not support the corresponding parameter, respectively

  • MRI: It uses magnetic resonance for obtaining electromagnetic signals. These signals are generated from human organs, which further reconstructs information about human organ structure [91]. MRIs with high resolution have more structural details which are required to locate lesions and disease diagnosis.

  • CT-Scan: It is a technology which generates 3-D images from 2-D X-Ray images using digital geometry [88].

  • Mammogram: For the effective breast cancer screening and early detection of abnormalities in the body, mammograms are used. Calcifications and masses are considered as the most common abnormalities resulting in breast cancer [5].

  • ECG: It is used to measure the heart activity electrically and to detect the cardiac problems in humans [8, 9, 105].

Tools and techniques

This section addresses research question 3 (refer subsection 3.1). After a thorough analysis of primary studies, various techniques (refer Table 6) and tools (refer Fig. 4) related to ML and DL techniques for healthcare were identified [67, 89]. It was observed that techniques have used scanned images with the help of image modalities such as MRI, CT-Scan, X-Rays, and so on. Also, in order to automate the process of image segmentation and classification, programming languages like R, MATLAB and Python were used to obtain accurate results. The subsections 6.1 and 6.2 precisely explain the tools and techniques used in primary studies for medical images, respectively.

Table 6.

ML and DL techniques used for medical imaging

Article DT AB ANN/CNN SVM TL RF Bayes Net PCA/ICA Others
[55] - - + - - - - - +
[7] - - - + + - - - -
[28] - - - + + - - - -
[18] - - - - - - - - -
[60] - - - - - - - - +
[46] - - - - - - - - -
[84] - - + + - + - - -
[104] - - + - + - - - -
[49] - - - - - - - + +
[82] + - - + - - - + -
[56] - - + - - - - - -
[76] - + + + - + + - -
[77] - + - + - + + - -
[78] + - - + - - - + +
[42] - - + + + - - + -
[44] - - - - - - - - -
[29] - - - + - + + - -
[79] - - - + - - - - +
[27] - - - - - - - - -
[103] + - - + - + + - -
[38] - - + - + - - + +
[97] - - - - - - - + -
[32] - - - - - - - - +
[98] - - - - - - - - +
[53] - - - - - - - - -
[26] - - + + - - - - -

DT Decision Tree, AB Adaptive Boosting, ANN Artificial Neural Network, CNN Convolutional Neural Network, SVM Support Vector Machine, RF Random Forest, ML Machine Learning, TL Transfer Learning, PCA Principal Component Analysis, ICA Independent Component Analysis.

Fig. 4.

Fig. 4

Tools used for medical image analysis

Tools used for medical images

Figure 4 depicts the percentage of various tools (Table 5) used in the primary studies for the implementation of ML and DL models where MATLAB and NumPy have the percentage of 38 and 37, respectively, which signify the popularity of these tools among researchers. R and TensorFlow are the second most used tools with a percentage of 13 and 12, respectively.

Table 5.

Tool description

Tool Description
TensorFlow It is a platform independent tool which takes an input from a multidimensional array called tensor and displays the flow of instructions using a flowchart [37]. The Google brain team created TensorFlow to enhance ML and DNN research.
Numpy NumPy is the abbreviation for Numerical Python. It is a multidimensional array library of objects and routines for processing the given arrays [60].
MATLAB It is a programming platform to design and analyze a system [33, 83]. It uses a matrix-based language combining the variables for iterative analysis expressed in matrix [100].
R Studio It is an open-source language to implement a task for evaluating the use of data augmentation techniques in ML image recognition.

Techniques used for medical images

This subsection includes the description and identification of the most common ML and DL techniques (i) used for disease classification, detection and diagnosis, (ii) based on type of disease, and (iii) used for EEG and MEG data processing.

Description of techniques

  • CNN: It is a combination of DNNs which comprises three components, used to analyze the images (Refer Fig. 5). The components of CNN are as follow:
    1. Convolutional Layer: It is responsible to apply the filters systematically to create feature maps for summarizing features present in the input image.
    2. Pooling Layer: It is used for ordering the repeated layers in a model. It operates on each feature map, received from the convolutional layer, to produce a new set of feature maps pooled together. Pooling operation is used to reduce the feature map size with required pixels or values in each feature map, hence, reducing the overfitting problem. It consists of two main functions namely, average pooling and maximum pooling.
    3. Fully-Connected Layer: It is simply the feed-forward neural network where input is received from the final pooling layer. Based on the extracted features, a fully connected layer predicts the image class.
  • ANN: The flowchart shown in Fig. 6 depicts the working of ANN architecture. The model extracts the data required from the input image to further normalize it accordingly. While the images are normalized, random weights are assigned to all the connections present in the network. Furthermore, the dataset is divided in the ratio of 80:20. Then the training algorithm is selected for the error attainment, and if errors are identified then the weights should be recalibrated. At last, the model is tested and validated for further evaluation.

  • TL: It introduces a concept of transferring selected features from a predefined model to another model for solving the problem. It selects the feature or learns from the previous model and applies the features or learning to the new model to address different issues. ML models are capable of addressing only one specific task, however, TL can be applied to more than one problem making it more reliable and efficient (Refer Fig. 7).

  • RF: Fig. 8 illustrates the working of RF algorithm, where the algorithm randomly selects the data from a given dataset. Further, the GINI index given in Eq. 1 is applied to select the best possible split of the dataset. The splitting is applied to the dataset until the dataset becomes too small for splitting.

    graphic file with name M1.gif 1
    where,
    T
    dataset
    n
    number of classes
    pj
    relative frequency of class j in T
  • DT: It is a supervised ML algorithm which divides the problem into small sub problems. It consists of root node, internal node and leaf node. As shown in Fig. 9, root node, internal node, and leaf node depict the optimized version of the best selected feature, new subsets or features and outcome of each internal node, respectively.

  • SVM: It is a supervised ML algorithm used for the classification and regression problems. It is well known for predicting the class of unknown data. Also, it categorizes the unknown data into one of the two categories based on the labeled dataset (Refer Fig. 10).

Fig. 5.

Fig. 5

CNN architecture

Fig. 6.

Fig. 6

ANN architecture

Fig. 7.

Fig. 7

TL architecture

Fig. 8.

Fig. 8

RF architecture

Fig. 9.

Fig. 9

DT architecture

Fig. 10.

Fig. 10

SVM architecture

ML and DL techniques

Table 6 summarizes ML and DL techniques such as Naïve bayes [43, 69], KNN [6], DTs [36, 48], neural networks, and SVM [59, 73, 90] which are used for medical imaging in primary studies. Here, column 1 represents articles and row 1 represents various techniques. Further, “+” and “-” signify that the article does and does not support the corresponding technique, respectively. The most reliable ML and DL techniques based on the type of disease are shown in Table 7. The most significant ML and DL techniques for EEG and MEG data processing are shown in Table 8.

Table 7.

ML and DL techniques based on the type of disease

Disease Technique
Breast ML Ensemble learning (RF, Gradient boosting, AdaBoost classifiers)
DL CNN
Brain ML SVM, Naïve bayes
DL CNN, TL
Lung ML Supervised ML
DL DNN
Diabetes ML RF, LR
DL Ensemble model, CNN

ML Machine Learning, DL Deep Learning, RF Random Forest, CNN Convolutional Neural Network, SVM Support Vector Machine, DNN Deep Neural Network, LR Linear Regression, TL Transfer Learning

Table 8.

ML and DL techniques for EEG and MEG data processing

Article Year Classifier/Model Medical Test/Data
ML
 [41] 2017 SVM EEG
 [85] 2017 LS-SVM and FD EEG
 [68] 2017 LS-SVM EEG
 [15] 2017 SVM EEG
 [95] 2018 RF classifier EEG
 [52] 2019 Feature based techniques:LR, Linear SVM, FFNN, SCNN, Ra-SCNN MEG
 [54] 2019 KNN EEG
 [99] 2019 Gradient Boosting EEG
 [40] 2022 SVM EEG
 [1] 2022 KNN EEG
 [70] 2021 SVM with a radial basis function kernel MEG
DL
 [25] 2019 ANN EEG
 [35] 2020 ANN EEG
 [106] 2017 Softmax Classifier EEG
 [101] 2018 DNN EEG
 [31] 2021 Hybrid DNN (CNN and LSTM) EEG
 [92] 2021 CNN-RNN EEG
 [80] 2021 DeepMEG-MLP MEG
 [66] DeepMEG-CNN
2019 EEGNet-8, LF-CNN and VAR-CNN MEG
 [30] 2021 ANN MEG

SVM Support Vector Machine, LS-SVM Least Square-SVM, FD Fractal Dimensions, RF Random Forest, KNN K-Nearest Neighbor, SCNN Spatial Summary Convolutional Neural Network, Ra-SCNN SCNN model augmented with attention focused Recurrence, ANN Artificial Neural Network, DNN Deep Neural Network, LSTM Long Short-Term Memory Networks, CNN-RNN Convolutional Neural Network-Recurrent Neural Network, MLP Multi-Layer Perceptron, EEG Electroencephalogram, MEG Magnetoencephalography

Dataset description

Following section addresses the research question 4 (refer subsection 3.1) by providing the details of the datasets used in primary studies for implementing ML and DL algorithms. Table 9 summarizes the description of dataset(s) such as MRI, X-Rays, lesion data, infra-red images and CT-Scan. The accessibility to a dataset is divided as (i) public (available at online repositories), and (ii) own created (created by the authors).

Table 9.

Dataset description

Dataset Article Dataset Description Accessibility (public or own created)
MRI [7] Brain MRI tissues Public
[33] 4D DCE MRI images Own created
Combined [28] MRI and CT-Scan Public
[46] MRI and X-Rays Public
[42] MRI, CT, PET, Ultrasound, Mammography, Infrared, Microscopic, Molecular, Multi-modal medical image Own created
[27] CT, MRI, PET, Spec Public
[103] CT, MRI, PET, Mammography, Digital breast tomosynthesis, Radiography Public
Lesion [7]

White matter lesion

Multiple-sclerosis lesion

Public
[84] 283 pathology benign and malignant lesions Public
Infra-red Images [77] 338 Infrared images of both eyes Own created
[78] 200 Infrared images Own created
Others [4] Open-source dermatological images Public
[60]

Ten different datasets with number of features selected:

Dermatology dataset (32), Heart-C dataset (15), Lung cancer dataset (55), Pima Indian dataset (9), Hepatitis dataset (18), Iris dataset (5), Wisconsin cancer dataset (10), Lympho dataset (17), Diabetes disease dataset (8), Stalog disease dataset (12)

Own created
[29]

i. High-dimensional and multimodal bio-medical data.

ii. Cleveland heart dataset

iii. 303 cases and 76 attributes / features.

Public

MRI Magnetic Resonance Imaging, CT Computer Tomography, DCE Dynamic Contrast-Enhanced, PET Positron Emission Tomography. Dermatology dataset (32): 32 features from dermatology dataset, Heart-C dataset (15): 15 features from Heart-C dataset, Lung cancer dataset (55): 55 features from lung cancer dataset, Pima Indian dataset (9): 9 features from Pima Indian dataset, Hepatitis dataset (18): 18 features from hepatitis dataset, Iris dataset (5): 5 features from iris dataset, Wisconsin cancer dataset (10): 10 features from Wisconsin cancer dataset, Lympho dataset (17): 17 features from Lympho dataset, Diabetes dataset (8): 8 features from diabetes dataset, Stalog disease dataset (12): 12 features from stalog disease dataset

Experimental description

Research question 5 (refer subsection 3.1) is addressed in this section. MRI dataset is used for the experiments to show the comparative analysis of ML classifiers and DL models. Dataset¹ description and experimental setup are discussed in subsections 8.1 and 8.2, respectively. Similarly, the methodology and results are discussed in subsections 8.3 and 8.4, respectively.

Dataset description

The experiments to classify the brain tumor include the publicly available tumor dataset. (https://www.kaggle.com/datasets/masoudnickparvar/brain-tumor-mri-dataset). The MRI dataset contains the 711 images of meningioma tumor and no tumor. Dataset is divided into two parts: testing and training with different image resolutions.

Experimental setup

The whole series of experiments were performed on a 64-bit computer with an Intel(R) 221 Core(TM) i3-10110U CPU @ 2.10 GHz 2.59 GHz, 8GB RAM. To train and validate the model, code was implemented in python language in Google colab platform.

Methodology

Figure 11 depicts the methodology used in the experiments for disease classification. It is described as follows:

Fig. 11.

Fig. 11

Methodology used for disease prediction

  1. Import dataset: Dataset¹ is retrieved from the public website which is divided into two categories namely: no tumor and meningioma tumor. The dimensions of images given in the dataset were different from one another, which was further resized to 200 × 200.

  2. Label dataset: Dataset is labeled in the form of 0 and 1, where 0 and 1 indicate the data having no tumor and data having meningioma tumor, respectively.

  3. Split dataset: Further, the dataset is splitted in the ratio of 80:20 for training (80%) and testing (20%) dataset.

  4. Feature scaling and feature selection: ML algorithms work on numbers without knowing what the number represents. Feature scaling helps to resolve the given problem by scaling the features into a specific defined range, so that one feature does not dominate the other one. In this experiment, PCA technique is used to reduce the feature count and select the required features.

  5. Apply ML classifiers: For this experiment, ML classifiers (SVM, RF, DT, LR) and DL models (CNN, ResNet50V2) are used, which further classified the dataset into two categories i.e., 0 and 1.

  6. Prediction and testing the model: The model was tested with testing data (20% of the dataset) and predicted the disease accurately for the given dataset.

  7. Metrics calculation: The prediction for dataset using classifiers is illustrated with the help of a confusion matrix shown in Fig. 13. It calculates the four parameters, TP, TN, FP and FN, along with the accuracy metrics.

Fig. 13.

Fig. 13

Publisher by number of citations for a ML articles b DL articles

Results

This subsection discusses the results obtained by ML classifiers as shown in Fig. 12; Table 10. In Fig. 12a, b, c, and d illustrate the confusion matrix obtained from SVM, LR, RF, and DT, respectively. Table 10 shows the values of accuracy obtained after implementing the considered ML classifiers and DL models for the MRI dataset. The results show that CNN and RF have better accuracy with 97.6% and 96.93%, respectively.

Fig. 12.

Fig. 12

Confusion matrix for a SVM, b LR, c RF, and d DT

Table 10.

Accuracy results for MRI images using ML classifier/DL model

S.No. ML classifier/ DL model Accuracy (in %)
1. CNN 97.6
 2. RF 96.93
 3. SVM 95.05
 4. DT 93.35
 5. LR 93.01
 6. ResNet50V2 85.71

SVM Support Vector Machine, LR Logistic Regression, RF Random Forest, DT Decision Tree, CNN Convolutional Neural Network.

Analytical discussion

The primary studies were analyzed based on the publisher citation count, year wise publications, keywords, various diseases, techniques, imaging modalities and type of publication.

Publisher by citations

A schematic view of the influential publishers in the concerned domain is presented by the citations of the articles published in it. Figure 13 shows all the publishers considered for this review in between 2014 and 2022. Moreover, it depicts the number of citations of ML and DL articles with respect to the publishers in Fig. 14a and b, respectively. Due to many types of indexing procedures along with time, there is a variation in the count of citations in Google Scholar. It was observed that most of the articles for ML and DL were published in ScienceDirect and IEEE publishers with the maximum citation 2425 and 42,866, respectively.

Fig. 14.

Fig. 14

Year wise publication of ML and DL in healthcare

Scholarly articles published between 2014 and 2022

In this subsection, Fig. 14 depicts that out of 40 primary studies, the most published articles for ML were from the year 2020 with a count of 10, which is equivalent to 25% of the total. Followed by the year 2021 with 8 (20%), 2019 with 6 (15%), and 2022 with 4 (10%). Other years like 2017 and 2018 have the same count of 3 with 7.5%, 2014 and 2016 have the same count of 2 with 5%, and 2015 has the count of 1 with 2.5%. Thus, it can be observed that the maximum number of articles for primary study was considered from the year 2020 and minimum from 2014 to 2017.

Most commonly used keywords in the primary studies

Word cloud is the simple way to identify the relevant terms and themes utilized in the referenced research articles. Figure 15 depicts the word cloud which represents larger font for the most often used keywords and smaller font for less frequent keywords.

Fig. 15.

Fig. 15

Word cloud for frequently used keywords

Disease types

Figure 16 depicts the percentage of multiple diseases diagnosed in the primary studies. As observed, breast disease is the most common disease with the highest percentage (21%) among all. Brain tumor took the second place (18%) followed by diabetes (16%) and lung disease (16%). Also, other diseases such as eye, liver, skin, hepatitis and cancer were diagnosed using various techniques.

Fig. 16.

Fig. 16

Percentage of diseases

Publication by techniques and statistical analysis of techniques

It was observed that researchers have used multiple techniques to attain better results as shown in Table 5. For classification, ML classifiers like SVM, RF and Naïve bayes were combinedly used for the same. Detection was performed using neural networks such as ANN or CNN, and TL was performed frequently due to its capability of breaking down the large datasets. Figure 17 depicts the percentage of various techniques used in primary studies. It summarizes that SVM (20%) is the most widely used technique for medical image classification.

Fig. 17.

Fig. 17

Percentage of ML and DL techniques in healthcare

The statistical analysis of ML and DL techniques for medical diagnosis is represented in Fig. 18.

Fig. 18.

Fig. 18

Statistical analysis of ML and DL techniques for medical diagnosis

Imaging modalities

Figure 19 demonstrates the multiple image modalities used for the evaluation of medical images. However, MRI/X-Ray dominates the subject area with 45%. The second most used modality is CT-Scan (30%), followed by mammogram (10%) and I-Scan-2 (10%). Moreover, to automate the process of retrieving and analyzing the features, computer modalities such as CAD was included for the detection of hepatitis and cancer [55, 60].

Fig. 19.

Fig. 19

Percentage of modalities used in medical imaging

Type of publication

Figure 20 illustrates the distribution of articles according to the type of publications considered for this review. Majority of the articles were considered from journals with 70%, book Chaps. (8%), conference proceeding papers (7%), workshop articles (2%) and others (13%) including the society articles, online database articles, articles from publications like Bentham Science, springer archives and the transcript.

Fig. 20.

Fig. 20

Percentage of type of publication

Discussion

From this study, it was observed that the variability in the literature occurred due to uncertainty of the evaluated data and models (refer Fig. 21). Data uncertainty was caused due to the multiple sources such as transmission noise, missing values and measurement noise. Whereas, model uncertainty was observed due to the less understanding of architecture and prediction of future data with parameters. The observed uncertainty was helpful to attain different results with various methods. Recently, many advanced technologies were introduced to attain enormous amounts of raw data in different scenarios.

Fig. 21.

Fig. 21

Data and model uncertainties

Further, while reviewing the literature, it has been observed that focusing on every aspect of data (noisy or clear) is important as it impacts the results. The utilization of an appropriate algorithm to analyze images can be used for increasing the success ratio. Thus, variation in expected standard results is due to the use of raw data which may incorporate a certain amount of noise (refer Fig. 22). CNN is not much sensitive to the noise due to which it can extract information from noisy data [44]. Moreover, Hermitian basis functions were used for extracting the accumulated data from the ECG data which reduce the effects of Gaussian noise.

Fig. 22.

Fig. 22

Noisy data [11]

Therefore, dealing with the uncertainty of data and models with ML along with DL techniques is the most important issue to be handled by researchers. These techniques are useful for obtaining accurate and better results for decision making in every respective domain [2, 345, 64, 75, 93]. Therefore, there is a need to deal with the variance in ML and DL algorithms such as RF, Rubber Sheet Normalization, DT, bagging-boosting, ANN, CNN, SVM, TL, Bayes Net, and GLCM. Further, such strategies can be used to deal with ambiguity in medical data for achieving high performance. Based on this review, it has been observed that medical professionals may be able to treat tumors promptly if they are identified early.

Conclusions and future work

This study provides an overview of various ML and DL approaches for the disease diagnosis along with classification, imaging modalities, tools, techniques, datasets and challenges in the medical domain. MRI and X-Ray scans are the most commonly used modalities for the disease diagnosis. Further, among all the tools and techniques studied, MATLAB and SVM dominated, respectively. It was observed that MRI dataset is widely used by researchers. Also, a series of experiments using MRI dataset has provided a comparative analysis of ML classifiers and DL models where CNN (97.6%) and RF (96.93%) have outperformed other algorithms. This study indicates that there is a need to include denoising techniques with DL models in the healthcare domain. It also concludes that various classical ML and DL techniques are extensively applied to deal with data uncertainty. Due to the superior performance, DL approaches have recently become quite popular among researchers. This review will assist healthcare community, physicians, clinicians and medical practitioners to choose an appropriate ML and DL technique for the diagnosis of disease with reduced time and high accuracy.

Future work will incorporate DL approaches for the diagnosis of all diseases considering noise removal from any given dataset. The additional aspects and properties of DL models for medical images can be explored. To increase the accuracy, enormous amount of data is required, therefore, the potential of the model should be improved to deal with large datasets. Also, different data augmentation techniques along with required features of the dataset can be explored to attain better accuracy.

Data availability

Data sharing is applicable to this article as MRI datasets was analyzed during the current study.

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Meghavi Rana, Email: meghavi.rana16@outlook.com.

Megha Bhushan, Email: mb.meghabhushan@gmail.com.

References

  • 1.Abdulbaqi AS, Younis MT, Younus YT, Obaid AJ. A hybrid technique for EEG signals evaluation and classification as a step towards neurological and cerebral disorders diagnosis. Int J Nonlinear Anal Appl. 2022;13(1):773–781. [Google Scholar]
  • 2.Aceto G, Persico V, Pescapé A. The role of information and communication technologies in healthcare: taxonomies, perspectives, and challenges. J Netw Comput Appl. 2018;107:125–154. doi: 10.1016/j.jnca.2018.02.008. [DOI] [Google Scholar]
  • 3.Acharya UR, Fujita H, Oh SL, Hagiwara Y, Tan JH, Adam M. Application of deep convolutional neural network for automated detection of myocardial infarction using ECG signals. Inf Sci. 2017;415:190–198. doi: 10.1016/j.ins.2017.06.027. [DOI] [Google Scholar]
  • 4.Aggarwal 1LP. Data augmentation in dermatology image recognition using machine learning. Skin Res Technol. 2019;25(6):815–820. doi: 10.1111/srt.12726. [DOI] [PubMed] [Google Scholar]
  • 5.Al-Najdawi N, Biltawi M, Tedmori S. Mammogram image visual enhancement, mass segmentation and classification. Appl Soft Comput. 2015;35:175–185. doi: 10.1016/j.asoc.2015.06.029. [DOI] [Google Scholar]
  • 6.Altan G, Kutlu Y, Allahverdi N. Deep learning on computerized analysis of chronic obstructive pulmonary disease. IEEE J Biomed Health Inf. 2019;24(5):1344–1350. doi: 10.1109/JBHI.2019.2931395. [DOI] [PubMed] [Google Scholar]
  • 7.Anbeek P, Vincken KL, Van Bochove GS, Van Osch MJ, van der Grond J. Probabilistic segmentation of brain tissue in MR imaging. NeuroImage. 2005;27(4):795–804. doi: 10.1109/TMI.2014.2366792. [DOI] [PubMed] [Google Scholar]
  • 8.Arya R, Kumar A, Bhushan M (2021) Affect recognition using brain signals: a survey. In: Computational methods and data engineering. Springer, Singapore, pp 529–552. 10.1007/978-981-15-7907-3_40
  • 9.Arya R, Kumar A, Bhushan M, Samant P. Big five personality traits Prediction using brain signals. Int J Fuzzy Syst Appl (IJFSA) 2022;11(2):1–10. doi: 10.4018/IJFSA.296596. [DOI] [Google Scholar]
  • 10.Available at: https://techblog.cdiscount.com/a-brief-overview-of-automatic-machine-learning-solutions-automl/
  • 11.Available at: Noisy Data in Data Mining | Soft Computing and Intelligent Information Systems (ugr.es)
  • 12.Beam AL, Kohane IS. Big data and machine learning in health care. JAMA. 2018;319(13):1317–1318. doi: 10.1001/jama.2017.18391. [DOI] [PubMed] [Google Scholar]
  • 13.Bhatt C, Kumar I, Vijayakumar V, Singh KU, Kumar A. The state of the art of deep learning models in medical science and their challenges. Multimedia Syst. 2021;27(4):599–613. doi: 10.1007/s00530-020-00694-1. [DOI] [Google Scholar]
  • 14.Bhattacharya P, Tanwar S, Bodkhe U, Tyagi S, Kumar N. Bindaas: Blockchain-based deep-learning as-a-service in healthcare 4.0 applications. IEEE Trans Netw Sci Eng. 2019;8(2):1242–1255. doi: 10.1109/TNSE.2019.2961932. [DOI] [Google Scholar]
  • 15.Bhattacharyya A, Pachori RB, Upadhyay A, Acharya UR. Tunable-Q wavelet transform based multiscale entropy measure for automated classification of epileptic EEG signals. Appl Sci. 2017;7(4):385. doi: 10.3390/app7040385. [DOI] [Google Scholar]
  • 16.Bhushan M, Goel S. Improving software product line using an ontological approach. Sādhanā. 2016;41(12):1381–1391. doi: 10.1007/s12046-016-0571-y. [DOI] [Google Scholar]
  • 17.Bhushan M, Goel S, Kumar A, Negi A (2017) Managing software product line using an ontological rule-based framework. In: 2017 International Conference on Infocom Technologies and Unmanned Systems (Trends and Future Directions)(ICTUS). IEEE, pp 376–382
  • 18.Bhushan M, Goel S, Kaur K. Analyzing inconsistencies in software product lines using an ontological rule-based approach. J Syst Softw. 2018;137:605–617. doi: 10.1016/j.jss.2017.06.002. [DOI] [Google Scholar]
  • 19.Bhushan M, Goel S, Kumar A. Improving quality of software product line by analysing inconsistencies in feature models using an ontological rule-based approach. Expert Syst. 2018;35(3):e12256. doi: 10.1111/exsy.12256. [DOI] [Google Scholar]
  • 20.Bhushan M, Negi A, Samant P, Goel S, Kumar A. A classification and systematic review of product line feature model defects. Software Qual J. 2020;28(4):1507–1550. doi: 10.1007/s11219-020-09522-1. [DOI] [Google Scholar]
  • 21.Bhushan M, Kumar A, Samant P, Bansal S, Tiwari S, Negi A (2021) Identifying quality attributes of FODA and DSSA methods in domain analysis using a case study. In: 2021 10th international conference on System Modeling & Advancement in Research Trends (SMART). IEEE, pp 562–567
  • 22.Bhushan M, Duarte JÁG, Samant P, Kumar A, Negi A. Classifying and resolving software product line redundancies using an ontological first-order logic rule based method. Expert Syst Appl. 2021;168:114167. doi: 10.1016/j.eswa.2020.114167. [DOI] [Google Scholar]
  • 23.Caballé-Cervigón N, Castillo-Sequera JL, Gómez-Pulido JA, Gómez-Pulido JM, Polo-Luque ML. Machine learning applied to diagnosis of human diseases: a systematic review. Appl Sci. 2020;10(15):5135. doi: 10.3390/app10155135. [DOI] [Google Scholar]
  • 24.Cabitza F, Rasoini R, Gensini GF. Unintended consequences of machine learning in medicine. JAMA. 2017;318(6):517–518. doi: 10.1001/jama.2017.7797. [DOI] [PubMed] [Google Scholar]
  • 25.Caliskan A, Badem H, Basturk A, YUKSEL M. Diagnosis of the parkinson disease by using deep neural network classifiers. IU-J Electr Electron Eng. 2017;17(2):3311–3318. [Google Scholar]
  • 26.Chaganti SY, Nanda I, Pandi KR, Prudhvith TG, Kumar N (2020) Image classification using SVM and CNN. In: 2020 International Conference on Computer Science, Engineering and Applications (ICCSEA). IEEE, pp 1–5
  • 27.Currie G, Hawk KE, Rohren E, Vial A, Klein R. Machine learning and deep learning in medical imaging: intelligent imaging. J Med imaging radiation Sci. 2019;50(4):477–487. doi: 10.1016/j.jmir.2019.09.005. [DOI] [PubMed] [Google Scholar]
  • 28.De Bruijne M. Machine learning approaches in medical image analysis: from detection to diagnosis. Med Image Anal. 2016;33:94–97. doi: 10.1016/j.media.2016.06.032. [DOI] [PubMed] [Google Scholar]
  • 29.Fatima M, Pasha M. Survey of machine learning algorithms for disease diagnostic. J Intell Learn Syst Appl. 2017;9(01):1. doi: 10.4236/jilsa.2017.91001. [DOI] [Google Scholar]
  • 30.Feng Y, Xiao W, Wu T, Zhang J, Xiang J, Guo H (2021) An automatic identification method for the blink artifacts in the magnetoencephalography with machine learning. Appl Sci 11(5):2415
  • 31.George ST, Subathra MSP, Sairamya NJ, Susmitha L, Premkumar MJ. Classification of epileptic EEG signals using PSO based artificial neural network and tunable-Q wavelet transform. Biocybern Biomed Eng. 2020;40(2):709–728. doi: 10.1016/j.bbe.2020.02.001. [DOI] [Google Scholar]
  • 32.Ghassemi M, Naumann T, Schulam P, Beam AL, Chen IY, Ranganath R (2020) A review of challenges and opportunities in machine learning for health. AMIA Summits on Translational Science Proceedings, 2020, 191. Available at: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7233077/ [PMC free article] [PubMed]
  • 33.Giger ML. Machine learning in medical imaging. J Am Coll Radiol. 2018;15(3):512–520. doi: 10.1016/j.jacr.2017.12.028. [DOI] [PubMed] [Google Scholar]
  • 34.Greenspan H, Van Ginneken B, Summers RM. Guest editorial deep learning in medical imaging: overview and future promise of an exciting new technique. IEEE Trans Med Imaging. 2016;35(5):1153–1159. doi: 10.1109/TMI.2016.2553401. [DOI] [Google Scholar]
  • 35.Grover S, Bhartia S, Yadav A, Seeja KR. Predicting severity of Parkinson’s disease using deep learning. Procedia Comput Sci. 2018;132:1788–1794. doi: 10.1016/j.procs.2018.05.154. [DOI] [Google Scholar]
  • 36.Gupta R, Pandey G, Chaudhary P, Pal SK. Machine learning models for government to predict COVID-19 outbreak. Digit Government: Res Pract. 2020;1(4):1–6. doi: 10.1038/s41746-020-00372-6. [DOI] [Google Scholar]
  • 37.Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3(Mar):1157–1182. [Google Scholar]
  • 38.Hoque MZ, Keskinarkaus A, Nyberg P, Seppänen T. Retinex model based stain normalization technique for whole slide image analysis. Comput Med Imaging Graph. 2021;90:101901. doi: 10.1016/j.compmedimag.2021.101901. [DOI] [PubMed] [Google Scholar]
  • 39.Houssein EH, Emam MM, Ali AA, Suganthan PN. Deep and machine learning techniques for medical imaging-based breast cancer: a comprehensive review. Expert Syst Appl. 2021;167:114161. doi: 10.1016/j.eswa.2020.114161. [DOI] [Google Scholar]
  • 40.Itälinna V (2021) Detecting mild traumatic brain injury from MEG data using normative modelling and machine learning [DOI] [PMC free article] [PubMed]
  • 41.Jaiswal AK, Banka H. Epileptic seizure detection in EEG signal using machine learning techniques. Australas Phys Eng Sci Med. 2018;41(1):81–94. doi: 10.1007/s13246-017-0610-y. [DOI] [PubMed] [Google Scholar]
  • 42.James AP, Thiruvenkadam S, Paul JS, Braun M. Special issue on medical image computing and systems. Inf Fusion. 2014 doi: 10.1016/j.inffus.2013.12.002. [DOI] [Google Scholar]
  • 43.Kaplan A, Cao H, FitzGerald JM, Iannotti N, Yang E, Kocks JW … Mastoridis P (2021) Artificial intelligence/machine learning in respiratory medicine and potential role in asthma and COPD diagnosis. J Allergy Clin Immunol Pract 9(6):2255–2261. 10.1016/j.jaip.2021.02.014 [DOI] [PubMed]
  • 44.Karimi D, Dou H, Warfield SK, Gholipour A. Deep learning with noisy labels: exploring techniques and remedies in medical image analysis. Med Image Anal. 2020;65:101759. doi: 10.1016/j.media.2020.101759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kashani MH, Madanipour M, Nikravan M, Asghari P, Mahdipour E. A systematic review of IoT in healthcare: applications, techniques, and trends. J Netw Comput Appl. 2021;192:103164. doi: 10.1016/j.jnca.2021.103164. [DOI] [Google Scholar]
  • 46.Kaur P, Singh G, Kaur P. A review of denoising medical images using machine learning approaches. Curr Med imaging. 2018;14(5):675–685. doi: 10.2174/1573405613666170428154156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Kedia S, Bhushan M (2022) Prediction of mortality from heart failure using machine learning. In: Proceedings of the 2nd International Conference on Emerging Frontiers in Electrical and Electronic Technologies (ICEFEET), pp 1–6. 10.1109/ICEFEET51821.2022.9848348
  • 48.Khan SI, Qaisar SM, Pachori RB. Automated classification of valvular heart diseases using FBSE-EWT and PSR based geometrical features. Biomed Signal Process Control. 2022;73:103445. doi: 10.1016/j.bspc.2021.103445. [DOI] [Google Scholar]
  • 49.Kitchenham B, Charters S (2007) Guidelines for performing systematic literature reviews in software engineering
  • 50.Kitchenham B, Brereton OP, Budgen D, Turner M, Bailey J, Linkman S. Systematic literature reviews in software engineering–a systematic literature review. ‎Inf Softw Technol. 2009;51(1):7–15. doi: 10.1016/j.infsof.2008.09.009. [DOI] [Google Scholar]
  • 51.Krittanawong C, Johnson KW, Rosenson RS, Wang Z, Aydar M, Baber U … Narayan SM (2019) Deep learning for cardiovascular medicine: a practical primer. Eur Heart J 40(25):2058–2073. 10.1093/eurheartj/ehz056 [DOI] [PMC free article] [PubMed]
  • 52.Kostas D, Pang EW, Rudzicz F. Machine learning for MEG during speech tasks. Sci Rep. 2019;9(1):1–13. doi: 10.1038/s41598-019-38612-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Kwekha-Rashid AS, Abduljabbar HN, Alhayani B (2021) Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Appl Nanosci:1–13. 10.1007/s13204-021-01868-7 [DOI] [PMC free article] [PubMed]
  • 54.Lahmiri S, Shmuel A. Accurate classification of seizure and seizure-free intervals of intracranial EEG signals from epileptic patients. IEEE Trans Instrum Meas. 2019;68(3):791–796. doi: 10.1109/TIM.2018.2855518. [DOI] [Google Scholar]
  • 55.Latif J, Xiao C, Imran A, Tu S (2019) Medical imaging using machine learning and deep learning algorithms: a review. In: 2019 2nd International conference on computing, mathematics and engineering technologies (iCoMET). IEEE, pp 1–5. 10.1109/ICOMET.2019.8673502
  • 56.Lee JG, Jun S, Cho YW, Lee H, Kim GB, Seo JB, Kim N. Deep learning in medical imaging: general overview. Korean J Radiol. 2017;18(4):570–584. doi: 10.3348/kjr.2017.18.4.570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Ma Y, Xu X, Yu Q, Zhang Y, Li Y, Zhao J, Wang G (2019) LungBRN: a smart digital stethoscope for detecting respiratory disease using bi-resnet deep learning algorithm. In: 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS). IEEE, pp 1–4. 10.1109/BIOCAS.2019.8919021
  • 58.Matsumoto T, Kodera S, Shinohara H, Ieki H, Yamaguchi T, Higashikuni Y … Komuro I (2020) Diagnosing heart failure from chest X-ray images using deep learning. Int Heart J 61(4):781–786. 10.1536/ihj.19-714 [DOI] [PubMed]
  • 59.Nadakinamani RG, Reyana A, Kautish S, Vibith AS, Gupta Y, Abdelwahab SF, Mohamed AW (2022) Clinical data analysis for prediction of cardiovascular disease using machine learning techniques. Comput Intell Neurosci 2022:1–13. 10.1155/2022/2973324 [DOI] [PMC free article] [PubMed] [Retracted]
  • 60.Nagarajan SM, Muthukumaran V, Murugesan R, Joseph RB, Munirathanam M (2021) Feature selection model for healthcare analysis and classification using classifier ensemble technique. Int J Syst Assur Eng Manag:1–12. 10.1007/s13198-021-01126-7
  • 61.Negi A, Kaur K (2017) Method to resolve software product line errors. In: International conference on information, communication and computing technology. Springer, Singapore, pp 258–268
  • 62.Oh SL, Ng EY, Tan S, Acharya UR. Automated diagnosis of arrhythmia using a combination of CNN and LSTM techniques with variable length heart beats. Comput Biol Med. 2018;102:278–287. doi: 10.1016/j.compbiomed.2018.06.002. [DOI] [PubMed] [Google Scholar]
  • 63.Oh SL, Ng EY, Tan S, Acharya UR. Automated beat-wise arrhythmia diagnosis using modified U-net on extended electrocardiographic recordings with heterogeneous arrhythmia types. Comput Biol Med. 2019;105:92–101. doi: 10.1016/j.compbiomed.2018.12.012. [DOI] [PubMed] [Google Scholar]
  • 64.Pal S, Hitchens M, Varadharajan V, Rabehaja T. Policy-based access control for constrained healthcare resources in the context of the internet of things. J Netw Comput Appl. 2019;139:57–74. doi: 10.1016/j.jnca.2019.04.013. [DOI] [Google Scholar]
  • 65.Pal S, Mishra N, Bhushan M, Kholiya PS, Rana M, Negi A (2022) Deep learning techniques for prediction and diagnosis of diabetes mellitus. In: 2022 International Mobile and Embedded Technology Conference (MECON). IEEE, pp 588–593. 10.1109/MECON53876.2022.9752176
  • 66.Pantazis D, Adler A (2021) MEG source localization via deep learning. Sensors 21(13):4278 [DOI] [PMC free article] [PubMed]
  • 67.Pathan S, Bhushan M, Bai A (2020) A study on health care using data mining techniques. J Crit Reviews 7(19):7877–7890. 10.31838/jcr.07.19.896
  • 68.Patidar S, Panigrahi T. Detection of epileptic seizure using Kraskov entropy applied on tunable-Q wavelet transform of EEG signals. Biomed Signal Process Control. 2017;34:74–80. doi: 10.1016/j.bspc.2017.01.001. [DOI] [Google Scholar]
  • 69.Pham L, McLoughlin I, Phan H, Tran M, Nguyen T, Palaniappan R (2020) Robust deep learning framework for predicting respiratory anomalies and diseases. In: 2020 42nd annual international conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, pp 164–167. 10.1109/EMBC44109.2020.9175704 [DOI] [PubMed]
  • 70.Pirrone D, Weitschek E, Di Paolo P, De Salvo S, De Cola MC. EEG Signal Processing and supervised machine learning to early diagnose Alzheimer’s Disease. Appl Sci. 2022;12(11):5413. doi: 10.3390/app12115413. [DOI] [Google Scholar]
  • 71.Plis SM, Hjelm DR, Salakhutdinov R, Allen EA, Bockholt HJ, Long JD, Calhoun VD. Deep learning for neuroimaging: a validation study. Front NeuroSci. 2014;8:229. doi: 10.3389/fnins.2014.00229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Poongodi M, Hamdi M, Malviya M, Sharma A, Dhiman G, Vimal S. Diagnosis and combating COVID-19 using wearable Oura smart ring with deep learning methods. Personal Uniquit Comput. 2022;26(1):25–35. doi: 10.1007/s00779-021-01541-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Rai P, Kumar BK, Deekshit VK, Karunasagar I, Karunasagar I. Detection technologies and recent developments in the diagnosis of COVID-19 infection. Appl Microbiol Biotechnol. 2021;105(2):441–455. doi: 10.1007/s00253-020-11061-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Rana M, Bhushan M (2022) Advancements in healthcare services using deep learning techniques. In: 2022 International Mobile and Embedded Technology Conference (MECON). IEEE, pp 157–161. 10.1109/MECON53876.2022.9752020
  • 75.Ray PP, Dash D, De D. Edge computing for internet of things: a survey, e-healthcare case study and future direction. J Netw Comput Appl. 2019;140:1–22. doi: 10.1016/j.jnca.2019.05.005. [DOI] [Google Scholar]
  • 76.Samant P, Agarwal R. Comparative analysis of classification based algorithms for diabetes diagnosis using iris images. J Med Eng Technol. 2018;42(1):35–42. doi: 10.1080/03091902.2017.1412521. [DOI] [PubMed] [Google Scholar]
  • 77.Samant P, Agarwal R. Machine learning techniques for medical diagnosis of diabetes using iris images. Comput Methods Programs Biomed. 2018;157:121–128. doi: 10.1016/j.cmpb.2018.01.004. [DOI] [PubMed] [Google Scholar]
  • 78.Samant P, Agarwal R. Analysis of computational techniques for diabetes diagnosis using the combination of iris-based features and physiological parameters. Neural Comput Appl. 2019;31(12):8441–8453. doi: 10.1007/s00521-019-04551-9. [DOI] [Google Scholar]
  • 79.Samad MD, Wehner GJ, Arbabshirani MR, Jing L, Powell AJ, Geva T … Fornwalt BK (2018) Predicting deterioration of ventricular function in patients with repaired tetralogy of Fallot using machine learning. Eur Heart J Cardiovasc Imaging 19(7):730–738. 10.1093/ehjci/jey003 [DOI] [PMC free article] [PubMed]
  • 80.Saminu S, Xu G, Shuai Z, Abd El Kader I, Jabire AH, Ahmed YK … Ahmad IS (2021) A recent investigation on detection and classification of epileptic seizure techniques using EEG signal. Brain Sci 11(5):668 [DOI] [PMC free article] [PubMed]
  • 81.Schoepf UJ, Zwerner PL, Savino G, Herzog C, Kerl JM, Costello P. Coronary CT angiography. Radiology. 2007;244(1):48–63. doi: 10.1148/radiol.2441052145. [DOI] [PubMed] [Google Scholar]
  • 82.Selvikvåg Lundervold A, Lundervold A (2018) An overview of deep learning in medical imaging focusing on MRI. arXiv e-prints, arXiv-1811. 10.1016/j.zemedi.2018.11.002
  • 83.Shakeel PM, Burhanuddin MA, Desa MI (2020) Automatic lung cancer detection from CT image using improved deep neural network and ensemble classifier. Neural Comput Appl:1–14
  • 84.Shan J, Alam SK, Garra B, Zhang Y, Ahmed T. Computer-aided diagnosis for breast ultrasound using computerized BI-RADS features and machine learning methods. Ultrasound Med Biol. 2016;42(4):980–988. doi: 10.1016/j.ultrasmedbio.2015.11.016. [DOI] [PubMed] [Google Scholar]
  • 85.Sharma M, Pachori RB. A novel approach to detect epileptic seizures using a combination of tunable-Q wavelet transform and fractal dimension. J Mech Med Biology. 2017;17(07):1740003. doi: 10.1142/S0219519417400036. [DOI] [Google Scholar]
  • 86.Shen D, Wu G, Suk HI. Deep learning in medical image analysis. Annu Rev Biomed Eng. 2017;19:221–248. doi: 10.1007/978-3-030-33128-3_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Singh SN, Bhushan M (2022) Smart ECG monitoring and analysis system using machine learning. In: Proceedings of the 2022 IEEE VLSI Device Circuit and System (VLSI DCS), pp 304–309. 10.1109/VLSIDCS53788.2022.9811433
  • 88.Singh VK, Kolekar MH. Deep learning empowered COVID-19 diagnosis using chest CT scan images for collaborative edge-cloud computing platform. Multimed Tools Appl. 2022;81:3–30. doi: 10.1007/s11042-021-11158-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Singh VJ, Bhushan M, Kumar V, Bansal KL (2015) Optimization of segment size assuring application perceived QoS in healthcare. In: Proceedings of the world congress on engineering, vol 1, pp 1–3
  • 90.Spadarella G, Perillo T, Ugga L, Cuocolo R (2022) Radiomics in cardiovascular disease imaging: from pixels to the heart of the problem. Curr Cardiovasc Imaging Rep:1–11. 10.1007/s12410-022-09563-z
  • 91.Strijkers GJ, Drost MR, Nicolay K (2011) Diffusion MRI: theory, methods, and applications. Diffusion imaging in muscle, pp 672–689
  • 92.Sun J, Cao R, Zhou M, Hussain W, Wang B, Xue J, Xiang J. A hybrid deep neural network for classification of schizophrenia using EEG Data. Sci Rep. 2021;11(1):1–16. [Google Scholar]
  • 93.Sworna NS, Islam AM, Shatabda S, Islam S. Towards development of IoT-ML driven healthcare systems: a survey. J Netw Comput Appl. 2021;196:103244. doi: 10.1016/j.jnca.2021.103244. [DOI] [Google Scholar]
  • 94.Tajbakhsh N, Shin JY, Gurudu SR, Hurst RT, Kendall CB, Gotway MB, Liang J. Convolutional neural networks for medical image analysis: full training or fine tuning? IEEE Trans Med Imaging. 2016;35(5):1299–1312. doi: 10.1109/TMI.2016.2535302. [DOI] [PubMed] [Google Scholar]
  • 95.Tzimourta KD, Tzallas AT, Giannakeas N, Astrakas LG, Tsalikakis DG, Angelidis P, Tsipouras MG. A robust methodology for classification of epileptic seizures in EEG signals. Health Technol. 2019;9(2):135–142. doi: 10.1007/s12553-018-0265-z. [DOI] [Google Scholar]
  • 96.Vamathevan J, Clark D, Czodrowski P, Dunham I, Ferran E, Lee G … Zhao S (2019) Applications of machine learning in drug discovery and development. Nat Rev Drug Discov 18(6):463–477. 10.1038/s41573-019-0024-5 [DOI] [PMC free article] [PubMed]
  • 97.Virmani J, Agarwal R. Deep feature extraction and classification of breast ultrasound images. Multimedia Tools Appl. 2020;79(37):27257–27292. doi: 10.1007/s11042-020-09337-z. [DOI] [Google Scholar]
  • 98.Virmani J, Agarwal R. A review of Segmentation Algorithms Applied to B-Mode breast ultrasound images: a characterization Approach. Arch Comput Methods Eng. 2021;28(4):2567–2606. doi: 10.1007/s11831-020-09469-3. [DOI] [Google Scholar]
  • 99.Wang X, Gong G, Li N. Automated recognition of epileptic EEG states using a combination of symlet wavelet processing, gradient boosting machine, and grid search optimizer. Sensors. 2019;19(2):219. doi: 10.3390/s19020219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Wang Y, Choi EJ, Choi Y, Zhang H, Jin GY, Ko SB. Breast cancer classification in automated breast ultrasound using multiview convolutional neural networks with transfer learning. Ultrasound Med Biol. 2020;46(5):1119–1132. doi: 10.1016/j.ultrasmedbio.2020.01.001. [DOI] [PubMed] [Google Scholar]
  • 101.Wani SM, Sabut S, Nalbalwar SL (2019) Detection of epileptic seizure using wavelet transform and neural network classifier. In: Computing, communication and signal processing. Springer, Singapore, pp 739–747
  • 102.Widodo CE, Adi K, Gernowo R (2020) Medical image processing using python and open cv. In: Journal of Physics: Conference Series, vol 1524, no 1. IOP Publishing, pp 012003. 10.1088/1742-6596/1524/1/012003
  • 103.Willemink MJ, Koszek WA, Hardell C, Wu J, Fleischmann D, Harvey H … Lungren MP (2020) Preparing medical imaging data for machine learning. Radiology 295(1):4–15. 10.1148/radiol.2020192224 [DOI] [PMC free article] [PubMed]
  • 104.Xu Y, Mo T, Feng Q, Zhong P, Lai M, Eric I, Chang C (2014) Deep learning of feature representation with multiple instance learning for medical image analysis. In: 2014 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 1626–1630. 10.1109/ICASSP.2014.6853873
  • 105.Yanowitz FG (2012) Introduction to ECG interpretation. LDS Hospital and Intermountain Medical Center
  • 106.Zubarev I, Zetter R, Halme HL, Parkkonen L. Adaptive neural network classifier for decoding MEG signals. NeuroImage. 2019;197:425–434. doi: 10.1016/j.neuroimage.2019.04.068. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data sharing is applicable to this article as MRI datasets was analyzed during the current study.


Articles from Multimedia Tools and Applications are provided here courtesy of Nature Publishing Group

RESOURCES