Skip to main content
Computational and Mathematical Methods in Medicine logoLink to Computational and Mathematical Methods in Medicine
. 2022 Nov 16;2022:9410222. doi: 10.1155/2022/9410222

Hybrid Diagnosis Models for Autism Patients Based on Medical and Sociodemographic Features Using Machine Learning and Multicriteria Decision-Making (MCDM) Techniques: An Evaluation and Benchmarking Framework

M E Alqaysi 1,2, A S Albahri 1,, Rula A Hamid 3
PMCID: PMC9683965  PMID: 36439957

Abstract

Method

The three-phase framework integrated the MCDM and ML to develop the diagnosis models and evaluate and benchmark the best. Firstly, the new ASD-dataset-combined medical tests and sociodemographic characteristic features is identified and preprocessed. Secondly, developing the hybrid diagnosis models using the intersection process between three FS techniques and five ML algorithms introduces 15 models. The selected medical tests and sociodemographic features from each FS technique are weighted before feeding the five ML algorithms using the fuzzy-weighted zero-inconsistency (FWZIC) method based on four psychiatry experts. Thirdly, (i) formulate a dynamic decision matrix for all developed models based on seven evaluation metrics, including classification accuracy, precision, F1 score, recall, test time, train time, and AUC. (ii) The fuzzy decision by opinion score method (FDOSM) is used to evaluate and benchmark the 15 models concerning the seven evaluation metrics.

Results

Results reveal that (i) the three FS techniques have obtained a size different from the others in the number of the selected features; the sets were 39, 38, and 41 out of 48 features. Each set has its weights constructed by FWIZC. Considered sociodemographic features have been mostly selected more than medical tests within FS techniques. (ii) The first three best hybrid models were “ReF-decision tree,” “IG-decision tree,” and “Chi2-decision tree,” with score values 0.15714, 0.17539, and 0.29444. The best diagnosis model (ReF-decision tree) has obtained 0.4190, 0.0030, 0.9946, 0.9902, 0.9902, 0.9902, 0.9902, and 0.9951 for the C1=train time, C2=test time, C3=AUC, C4=CA, C5=F1 score, C6=precision, and C7=recall, respectively. The developed framework would be beneficial in advancing, accelerating, and selecting diagnosis tools in therapy with ASD. The selected model can identify severity as light, medium, or intense based on medical tests and sociodemographic weighted features.

1. Introduction

Autism spectrum disorder (ASD) is a neurodevelopmental disorder that impairs an individual's social, communication, and learning abilities [1], in addition to restriction, repetitive patterns of behavior, interests, or activities. This disease begins in childhood and may last for life. Many children suffer from this disease, which greatly affects their behavior [2]. ASD affects families in terms of the pressure parents are exposed to their son, whether psychological or material (money). In contrast, the cost of treatment for this disease is expensive. Leo Kanner described autism disease for the first time in 1943, attempting to comprehend the association between autism and sociodemographic characteristics, including socioeconomic class, sex, maternal education, age, and race [3]. Every year, the World Health Organization (WHO) diagnoses autism globally in one out of every 160 children [4, 5].

The introduction of the presented study has discussed five important questions and provided the appropriate answers.

The first question is, “What are ASD diagnoses and symptoms?”

Psychiatric diseases are considered one of the most difficult types in the diagnosis process due to the overlap in symptoms resulting from the lack of experience and experts in this field. Nonetheless, doctors and medical personnel regard the diagnosis of autism in children in their first two years to be a difficult undertaking. Although there are several clinical tests for the early detection of ASD, they are complex diagnostics that are rarely utilized unless there is a considerable risk of developing ASD [6]. In contrast, the diagnosis of autism can occur at any age of the patient. Early detection helps to recover faster and significantly reduces the consumption of resources such as time and money [7]. One of the challenges researchers face is consuming time and money in diagnosing autism. The symptoms of autism differ from one patient to another. In addition, the disease's severity is different [8]. Clinical symptoms are different, such as fluttering, isolation of the patient from others, and lack of speech and interaction.

On the other hand, these symptoms can be observed through the Autism Diagnostic Observation Schedule (ADOS) and Autism Diagnostic Interview-Revised (ADI-R) questionnaires related to the behavior and action of the child [9, 10]. As mentioned above, there is an essential need for a method or tool that contributes to the autistic diagnosis process. So, the second question should be discussed: “How can ML and AI techniques benefit the ASD diagnosis process?”

ML and AI techniques play an important role in the diagnosis process contributing to early ASD diagnosis and providing excellent support for controlling and treatment [1113]. In addition, after considerable advancements in computer science and information technology, ML is being used to detect and assess a variety of illnesses, such as lung cancer, hepatitis, heart disease, COVID-19, and diabetes [1419]. ML models classify and predict various medical fields effectively. Several methods of ML have also been used for the diagnosis of ASD, such as random forest (RF), naive Bayes, and K-nearest neighbor (KNN), and deep learning, such as methods of recurrent neural network (RNN) and convolutional neural network (CNN). However, researchers face a lack of accuracy in diagnosing autism and nonoptimal data selection in ASD diagnosis regarding the affected features. Therefore, ML and AI techniques should continue to make more contributions to diagnosing autism based on the new datasets adopted in this research path. The third question must be presented: “What are the research directions for ASD diagnosis in the literature review based on AI and ML?”

Various trends have arisen in recognizing, diagnosing, and evaluating autism using AI and ML. First, magnetic resonance imaging (MRI) is a cross-sectional scan of the brain and a medical imaging technique that reveals pathological alterations in live tissues [20]. MRI is involved in diagnosing many wide diseases. Despite its effectiveness in the diagnosis process, it needs financial resources because of the high price of devices and manufacturing. In addition, not all hospitals have MRI devices, especially in remote and rural areas. Second, the electroencephalogram (EEG) employs small metal discs (electrodes) implanted in the scalp; this test monitors the electrical activity in the brain. Poor spatial resolution is the primary drawback of EEG recordings [21, 22]. In addition, it does not provide the maximum diagnostic accuracy for ASD. Besides, this path requires a specialist doctor with a long experience to give a correct result in the diagnosis process. Third, sociodemographic diagnosis is based on sociodemographic features (i.e., sex, age, and race) and can depend on ADOS and ADI-R that notes the behavior of the patient [5, 9, 10].

In conclusion, each diagnostic approach has limits concerning the employed diagnostic characteristics. Each technique alone does not give a reliable diagnosis procedure. These instructions could not have been carried out if the right diagnostic procedure had focused on certain characteristics while ignoring others. Accordingly, medical tests have a role in the diagnosis of utmost diseases. The integration of medical tests and sociodemographic features should be considered in the diagnosis process. Despite the above, little attention has been given to medical tests among sociodemographic features for ASD diagnosis in any research direction. Here, the fourth important question must be discussed: “What is the current scenario of literature for the diagnosis of ASD using sociodemographic and medical test features?” It needs to be further answered.

In the study of [6], authors utilized early detection ASD datasets of different stages of life (toddler, child, adolescent, and adult) and had used different feature selection (FS) such as correlation feature selection, gain ratio, information gain (IG), and ReliefF (ReF). In addition, they utilized different ML like decision tree, support vector machine (SVM), and AdaBoost. The features of a dataset are based on sociodemographics and use of feature transformation (Logarithmic, ZScore, Sine) then evaluation by various metrics such as classification accuracy (CA), sensitivity, specificity, area under curve (AUC), Kappa statistics, and Logloss. In the study by [23], detection of ASD was attempted using ML and deep learning techniques such as logistic regression (LR), SVM, naive Bayes, KNN, ANN, and CNN. Also, the features of the dataset are based on sociodemographics only. The applied metrics are CA, specificity, and sensitivity to evaluate the developed model. In the study [24], the adultization of different ML included AdaBoost, KNN, and ID3 with FS techniques such as correlation feature selection, gain index, IG, fast correlated-based filter, and Chi-Squared (Chi2). They were then evaluated by metrics: CA, specificity, sensitivity, and AUC. The authors in [25] deal with the data imbalance technique applied to the demographic ASD dataset using naive Bayes, decision tree (c4.5), RIPPER, and RF. Also, the study used methods such as the synthetic minority oversampling technique (SMOTE), random oversampling (ROS), and random undersampling (RUS) to achieve data balance and were evaluated by metrics such as specificity, sensitivity, Matthews' correlation coefficient, F1 score, false positive rate, precision, and AUC. In [26], diagnosis of ASD based on resampling techniques methods of resampling techniques to a normal distribution of ASD data improved accuracy in the prediction of autism and avoids the problem of data heterogeneity. The authors used naive Bayes, and RF with SMOTE, ROS, and RUS to achieve data balance then evaluated by CA, specificity, sensitivity, and receiver operating characteristics (ROC). In the study of [27], children between the ages of 4 and 11 were diagnosed with ASD using the categorization approach with 19 sociodemographic features. For classification, the linear discriminant analysis (LDA) and KNN algorithms are employed then evaluated by metrics such as CA, F1 score, and precision. The authors of [28], dealt with the diagnosis and prediction of autism using decision tree algorithm based on medical and family characteristics, therefore facilitating access to ASD knowledge and supporting professionals and physicians in their clinical decisions by An Ontology-Driven Decision Support for Autism Diagnosis and Treatment, and were evaluated by various metrics such as CA, specificity, and sensitivity. The data attributes are categorized under 13 categories: (1) diagnostic history, (2) review of systems, (3) prenatal/early postnatal history, (4) pulmonary, (5) developmental history, (6) hematologic, (7) endocrine/metabolic, (8) cardiovascular, (9) gastrointestinal, (10) current medications, (11) mental health, (12) genetic, and (13) immunologic.

The above literature shows a variance in feature selection techniques, machine learning algorithms, and performance evaluation metrics. In addition, selecting the developed optimal model for accurate ASD diagnosis is challenging. However, no study has been presented for evaluating and benchmarking the developed hybrid diagnosis models for selecting the best one, which is the study's aim. There are three main issues facing this aim. The first issue concerns the importance of ASD features, especially since most literature studies have not elaborated on important features that affect model classification. In light of whether the features are highly relevant or less, the second issue, evaluation metrics, is faced. In other words, the studies demonstrate diversity in evaluating the model performance by using some metrics as criteria and ignoring others. Their assessment is varied for designing and implementing an accurate diagnosis models. Accordingly, the evaluation metrics of the classification models are still comparative. They overlap with other models for multievaluation criteria, trade-offs, and criteria importance categorized under complex multicriteria decision-making (MCDM) problems. For the third issue about dataset availability, perhaps the most important challenge faced by most researchers is the lack of special integrated sociodemographic and medical test features in providing an efficient model for diagnosing autism. Besides, the number of ASD features used for diagnosing autism in the literature varies, and there is no precise justification for using some features and neglecting others. Therefore, the presented study used an integrated ASD dataset with sociodemographic and medical test features.

The last question to be discussed is, “What is the useful solution to select the best diagnosis model of ASD by integrating medical tests and sociodemographic features?”

The FS process benefits appear in selecting the ML model, which gives high diagnostic accuracy. The development of the diagnosis ML model concerning the selection of ASD features plays an important role in choosing the optimal special diagnosis model based on the approach used or the techniques. In the process of identifying features' importance, each of the features has a different significance. Accordingly, the filter approach performs the FS step as preprocessing before the learning step without involving a learning algorithm. The filter is independent of the learning algorithm and relies on underlying attributes of data [29]. In addition, popular ML algorithms can enhance the diagnosis of ASD and can match the new hybrid diagnosis model using exhaustive and best-researched algorithms. These algorithms include decision tree [30], naive Bayes [31], KNN [32], SVM [33], and AdaBoost [34]. The algorithms used are very realizable due to their great precision and adaptability for obtaining superior outcomes.

On the other hand, MCDM is defined as “an extension of decision theory that encompasses all decisions with numerous objectives. A technique for evaluating options based on distinct, sometimes contradictory criteria and merging them into a single overall evaluation” [35, 36]. MCDM is an umbrella term for a collection of formal techniques that strive to explicitly account for many factors when assisting individuals or groups in evaluating important decisions [3739]. Numerous subjective weighting methods have been proposed; however, when it comes to weighting criteria, the analytic hierarchy process (AHP) [4044] and best-worst method (BWM) [45, 46] methods have a high success rate. Nonetheless, the inconsistency issue of their weighing techniques has been addressed [17, 37, 4751]. Therefore, the fuzzy-weighted with zero-inconsistency (FWZIC) method has been introduced [52]. FWZIC can assign weights for each set of medical tests and sociodemographic features resulting in each FS with zero inconsistencies regardless of the number of features. FWZIC computes and calculates the weight coefficient values of each feature separately and accurately to attain zero consistency. Compared to zero pairwise comparisons, FWZIC eliminates the potential for mistakes. Recently, the FWZIC method acquired attention and has been used in several studies [13, 5356]. FWZIC method can process zero inconsistency. In addition, other MCDM methods can process the ranking issues using the fuzzy decision by opinion score method (FDOSM). This method is utilized for selecting the best rank (best solution). FDOSM utilized an ideal/optimal solution concept, eliminated inconsistency and two preferences, decreased the number of comparisons, provided fair and implicit comparisons, and needed fewer mathematical operations. In addition, it addressed the normalization and weight concerns that plagued MCDM techniques. FDOSM attempts to deal with ambiguous and fuzzy data by employing triangular fuzzy numbers (TFNs). The FDOSM technique offered a mathematical model to address MCDM issues involving a single context of decision-making followed by a group context of decision-making and has been used in [49, 54, 57].

This research paper presents a clear conception of the diagnosis of autism. This study led to a solution to the research gap for ASD diagnosis to present a dataset of medical tests integrated with sociodemographic features. In this study, the combination of medical tests and the sociodemographic behavior of the patient give a strong solution to increase the diagnosis procedure. The main objective is to develop a new framework for selecting the optimal diagnostic model capable of identifying autism severity levels such as light, medium, or intense. In this regard, the study contributions can be summarized in the following points:

  • (1)
    Develop hybrid diagnosis models for ASD patients based on medical tests and sociodemographic characteristic features by
    1. Intersection process between three FS techniques and five ML algorithms
    2. Construct weights for each set of FS techniques based on specialized psychiatry experts using the FWIZC method
    3. Develop 15 hybrid diagnosis models based on the weighted dataset
  • (2)

    Develop an MCDM framework to evaluate and benchmark the 15 hybrid diagnosis models using the FDOSM method

2. Research Methodology

The research methodology can discuss the direction of the study in three phases. Firstly, the data identification and preprocessing, after that, the second phase is the development of hybrid diagnosis Models. Finally, the third phase is the evaluation and benchmarking framework. Figure 1 illustrates the methodology of the study.

Figure 1.

Figure 1

Research methodology of evaluation and benchmarking ASD diagnosis models.

2.1. Phase 1: Data Identification and Preprocessing

The data obtained is real data from a diploma study at the Informatics Institute for Postgraduate Studies (IIPS). These data consist of 49 sociodemographic and medical test features and 538 patients. Besides, the “severity” feature is considered the class that includes three categories of labels: light, medium, and intense. The features are described in Table 1.

Table 1.

Description of the real ASD dataset.

No. Feature name Medical tests Sociodemographic Range Data type
1 Sex Male, female Categorical data
2 The blood type of the patient A-, A+, B-, B+, O-, O+, AB-, AB+ Categorical data
3 The blood type of the mother A-, A+, B-, B+, O-, O+, AB-, AB+ Categorical data
4 The blood type of the father A-, A+, B-, B+, O-, O+, AB-, AB+ Categorical data
5 Relative relation No, yes Categorical data
6 Toxoplasmosis No, yes Categorical data
7 Unnatural medicines for mother No, yes Categorical data
8 Folic acid for mother No, yes Categorical data
9 Complications of childbirth for the mother No, yes Categorical data
10 Premature baby No, yes Categorical data
11 Jaundice No, yes Categorical data
12 Smell the food No, yes Categorical data
13 Taste the food No, yes Categorical data
14 He is afraid of loud sounds No, yes Categorical data
15 Degree 609-85 Numerical data
16 Crying for no reason No, yes Categorical data
17 Kisses with a sound No, yes Categorical data
18 Escaping home when doors are open No, yes Categorical data
19 Notice the sound of the bell No, yes Categorical data
20 Diapers No, yes Categorical data
21 Bathroom skills No, yes Categorical data
22 Responds when parents call by name No, yes Categorical data
23 Mind wandering No, yes Categorical data
24 Vitamin D3 2.90-102.1 Numerical data
25 Vitamin B12 0.01-2050 Numerical data
26 Vitamin zinc 0.9-292 Numerical data
27 Marital relationship for parents Not good, yes, separate, dead Categorical data
28 Blood match No, yes Categorical data
29 Maternal diseases during pregnancy No, yes Categorical data
30 Complications of childbirth for the child No, yes Categorical data
31 Chewing food No, yes Categorical data
32 Annoying from clothing tag No, yes Categorical data
33 Waves No, yes Categorical data
34 Patient moving at home No, yes Categorical data
35 Patient moves around itself No, yes Categorical data
36 Carry out orders No, yes Categorical data
37 Laughing for no reason No, yes Categorical data
38 Play with children No, yes Categorical data
39 Is there a language now? No, yes Categorical data
40 Pointing with the index finger No, yes Categorical data
41 Notice his name No, yes Categorical data
42 Arrange things in one row No, yes Categorical data
43 Nodded Previously, no, yes Categorical data
44 The age difference between the parents 1-28 Numerical data
45 Duration of premature baby 0-39 Numerical data
46 He plays with circle things Previously, no, yes Categorical data
47 Father age 22-83 Numerical data
48 Mother age 16-79 Numerical data
49 Severity (class) Light, medium, intense Categorical data

2.1.1. Data Coding and Cleaning

For any data, removing any unknown symbols or outliers should be addressed. Therefore, converting text or string data to numeric data must be achieved due to the ML method dealing with numeric data. In the ASD dataset, data cleaning eliminates all unnecessary symbols such as “?”, “/”, and “-”.

2.1.2. Imputing Missing Values

The used ASD dataset contains some missing values. Figure 2 shows the percentage of missing values. Several methods can be used for filling in missing values and manipulating them, such as model-based imputer (simple tree), distinct value, a random value, or mean, which is the most frequently used for handling this type of data using Equation (1). Then, the dataset should be normalized because the data have different scales, as presented in the next section.

Mean=sumzicountzi. (1)
Figure 2.

Figure 2

Percentage of missing values in the ASD dataset.

2.1.3. Dataset Normalization

Normalization is an operation that either modifies or rescales raw data such that each characteristic contributes uniformly. It addresses two primary data concerns that impede the learning process of ML algorithms: the existence of dominating features and outliers since the dataset has a different scale that can affect the model's process. This study used the min–max normalization approach to the ASD dataset, as seen in Equation (2).

x=xminxmaxxminx. (2)

2.1.4. Data Imbalance

One important thing that some researchers overlook is data asymmetry. Consequently, models are biased, and accuracy can no longer be used to measure integrity. There are three classes of health conditions in the used dataset, as shown in Figure 3. Class (1) has 259 instances as “medium,” class (0) has 241 instances as “light,” and class (2) has 38 instances as “intense.”

Figure 3.

Figure 3

Data imbalance of the ASD dataset.

An imbalance can be noticed in the used dataset, which can minimize the diagnosis process's accuracy. The SMOTE method commands resampling techniques utilized in ML to balance data based on the target class. In this context, the developed ML models can achieve high accuracy in classification and give a perception closer to reality [58]. SMOTE, a frequent oversampling technique, produces “synthetic” observations in the sample rather than duplicating data. This technique leverages the K-nearest neighbors of an observation to generate random synthetic observations [25]. At this step, the preprocessing stages have been stated and prepared for the ASD dataset to develop the hybrid models as presented in the next phase.

2.2. Phase 2: Development of Hybrid Diagnosis Models

This section addresses the stages of developing the hybrid diagnostic models for ASD.

2.2.1. FS Approaches

FS approaches ease significant concerns in classification procedures as they enhance classification accuracy, reduce data dimensionality, and remove unnecessary data. Figure 4 shows three filter approach methods: Chi2, IG, and ReF.

Figure 4.

Figure 4

The filter approach.

Each method of feature selection obtains a size different from the others. Furthermore, FS is considered essential in ML but does not always produce precision results due to not depending on expert judgment opinion. This stage chooses pertinent sociodemographic and medical tests, considers the class-labeled dataset, and scores these features based on their association with the class. Expert opinion plays an important role in the process of determining the importance of each feature. So that the importance of the influencer gives the subject a link from the feature that is irrelevant or has little influence on it to the feature that has very important; therefore, a modern MCDM method should be used for weights based on experts to overcome the above purpose. FWIZC method can handle this purpose, as presented in the next section.

2.2.2. FWZIC

FWZIC is one MCDM method that needs to be used for weighting the features resulting from FS techniques (Chi2, IG, and ReF). Figure 5 illustrates the steps of FWZIC through five essential processes that need to be applied for each set of medical tests and sociodemographic features resulting from three FS techniques [59]. The five steps are illustrated below.

Figure 5.

Figure 5

FWZIC methodology of ASD dataset.

Step 1 . —

Establish the set of evaluation features: the predetermined set of assessment features of ASD is examined and presented in the first step.

Step 2 . —

Structured expert judgment (SEJ): the identification and selection of expert team members from relevant fields of medicine (psychiatrists) are performed. Then, selection and nomination will commence. The SEJ panel has been formed. Table 2 depicts the conversion of the linguistic scale to the corresponding numerical scale, which followed the development of an evaluation form to capture the consensus of all SEJ team members for each medical test and sociodemographic feature. A panel of four experts assesses the features subjectively, as illustrated in the following step.

Table 2.

Five-point Likert scale and equivalent numerical scale.

Linguistic terms Numerical scoring scale
Not important 1
Slight important 2
Moderately important 3
Important 4
Very important 5
Step 3 . —

Building the expert decision matrix (EDM): the preceding stage defines the list of selected experts and each expert's choice within a specific feature. This stage builds the EDM. As stated in Table 3, the primary components of the EDM are the alternatives and decision criteria. Each criterion (Cj) in the attribute (represents the patient's features) crossovers with each selective expert (Ei) (represents the psychiatrist (who has evaluated the appropriate degree of relevance for each feature.

Table 3.

EDM.

Criteria/experts C1 C2 Cn
E1 Imp E1C1 Imp E1C2 Imp E1Cn
E2 Imp E2C1 Imp E2C2 Imp E2Cn
E3 Imp E3C1 Imp E3C2 Imp E3Cn
...
Em Imp EnC1 Imp EnC2 Imp EmCn

∗∗Imp represents the importance level.

Step 4 . —

Application of a fuzzy membership function: the fuzzy membership function and accompanying defuzzification procedure are used for the EDM's data to improve the data's accuracy and usability for future analysis. However, in MCDM, the problem is ambiguous and imprecise since giving a specific preference rate to each criterion is impossible. “The benefit of employing the fuzzy technique is the use of fuzzy numbers rather than exact numbers to calculate the relative value of the feature (criteria) to handle situations that are imprecise and ambiguous” [6062]. In fuzzy MCDM, triangular fuzzy numbers (TFNs) are the most prevalent sort of fuzzy numbers. A = (a.b.c) is used to signify TFNs. Due to their conceptual and computational simplicity, they are often utilized in real applications [63], as seen by the triangle membership in Figure 6.

Figure 6.

Figure 6

Membership of TFNs.

The membership function (x) of TFN A is given by

μAx=0,if x<a,xaba,if axb,cxcb,if bxc,0,if x>c,where abc. (3)
Remark 1 . —

Let x~=a1,b1,c1 andy~=a2,b2,c2 be two nonnegative TFNs and ∈ℝ+. Following the extension principle, the arithmetic operations are defined as follows:

x~+y~=a1+a2,b1+b2,c1+c2,x~y~=a1c2,b1b2,c1a2,αx~=αa1,αb1,αc1,x~11c1,1b1,1a1,x~×y~a1a2,b1b2,c1c2,x~y~a1c2,b1b2,c1a2. (4)

The value of each Numerical term with TFN is shown in Table 4.

Table 4.

Numerical terms and their equivalent TFNs.

Numerical scoring scale TFNs
1 (0.00, 0.10, 0.30)
2 (0.10, 0.30, 0.50)
3 (0.30, 0.50, 0.75)
4 (0.50, 0.75, 0.90)
5 (0.75, 0.90, 1.00)

Table 4 indicates that all linguistic variables be transformed to TFNs, supposing that the fuzzy number is the variable for each expert N feature (criteria). In other words, in psychiatry, expert N was tasked with identifying the critical degree of the assessment features (medical tests and sociodemographic) inside variables assessed using language variables.

  1. By using Equation (5), the ratio of fuzzification data is determined. As demonstrated in Table 5, the preceding equations are employed with TFNs [63].

ImpE1/C1~j=1nImpE1/C1j, (5)

where ImpE1/C1~ represent the fuzzy number of Imp (E1/C1).

  • (2) To determine the final fuzzy values of the weight coefficients of the evaluation feature (criterion) w1~,w2~,,wn~T, the mean values are determined. The fuzzy EDM EDM~ is utilized to calculate the final weight value of each feature (criterion) using Equation (6).

w~j=i=1mImpElj~/Cljj=1nImpElj~/Clj/m,for i=1,2,3,..m and j=1,2,3,..n. (6)
  • (3) Defuzzification to find the final weight: the centroid approach is the most prevalent defuzzification technique. Using TFNs, the mathematical expression for this procedure is (a + b + c)/3. Before computing the final values of the weight coefficients, the weight of importance should be allocated to each feature (criterion) based on the total weights of all features (criteria) for the rescaling purpose used in this step

Table 5.

Fuzzy EDM (EDM~) [63].

Experts Criteria
C1~ C2~ Cn~
E1 ImpE1/C1~j=1nImpE1/C1j~ ImpE1/C2~j=1nImpE1/C1j~ ImpE1/Cn~j=1nImpE1/C1j~
E2 ImpE2/C1~j=1nImpE2/C2j~ ImpE2/C2~j=1nImpE2/C2j ImpE2/Cn~j=1nImpE2/C2j
E3 ImpE3/C1~j=1nImpE3/C3j~ ImpE3/C2~j=1nImpE3/C3j~ ImpE3/Cn~j=1nImpE3/C3j~
E4 ImpE4/C1~j=1nImpE4/C4j~ ImpE4/C2~j=1nImpE4/C4j~ ImpE4/Cn~j=1nImpE4/Cnj~
Step 5 . —

Computation of the final values of the weight coefficients of the evaluation criteria: in this stage, the final values of the weight coefficients for the evaluation feature (criteria) (w1, w2, ⋯,w48)T that represented (C1=sex, C2=the blood type of the patient…. C48=mother age) are determined using the fuzzy data for the criterion from the previous step.

All five steps must be applied for each result of the FS technique. Besides, the sum of the weight must be equal to one. At this point and after calculating the weights for selected features (criteria), the constructed weights are distributed among the balanced ASD dataset for each FS technique value. Therefore, each weight generated must be multiplied by its fit data by using the following:

ASD=i=1mBiWiXi+ε, (7)

where Bi: estimation parameter for feature i, Wi: weight of feature i, and ε: error of estimation. After completing the process of Equation (7), the result is to produce a new weighted dataset for each FS technique that needs to be applied to the ML model in the next section.

2.2.3. Construction of Hybrid ML Models

This section builds hybrid diagnosis models based on the intersection of five supervised ML algorithms and three FS techniques, as shown in Figure 7. The hybrid diagnostic models must be used for training and testing by combining ML algorithms with the FS techniques established in the previous section (weighted datasets). The five ML algorithms in our trials as possibly viable methods to enhance the diagnosis of ASD and to match the new hybrid diagnosis model using exhaustive and best-researched algorithms. The utilized ML algorithms are as follows: (1) decision tree, (2) naive Bayes, (3) KNN, (4) SVM, and (5) AdaBoost. The results of the intersection process introduced 15 hybrid diagnosis models. All the hybrid models need to be evaluated for their performance metrics, as explained in the next stage.

Figure 7.

Figure 7

The intersection process for the developed hybrid diagnosis models.

2.2.4. Evaluation Criteria for the Hybrid Models

Measuring performance is essential for determining how effectively hybrid diagnosis models fulfill the objective. The performance of the 15 hybrid diagnosis models must be examined using five performance-evaluation metrics on the tested ASD datasets. Including CA, precision, F1 score, recall, and AUC. The metric criteria are defined and presented as follows:

  1. CA: this is the commonly used metric for evaluating classification models; it quantifies the degree of closeness to the real value. Accuracy is computed by

CA=TP+TNTP+FP+FN+TN. (8)
  • (2) Sensitivity (TPR/recall): the number of successfully identified labels from all the positive representations. It might be viewed as the capacity of a test to distinguish people with a condition properly. Sensitivity is computed using this method:

Recall=TPTP+FN. (9)
  • (3) Precision: it is the proportion of properly identified samples among all detected samples. It evaluates the classifier's capacity to exclude irrelevant topics. Precision is computed by

Precision=TPTP+FP. (10)
  • (4) F1 score: it is the weighted average of recall and precision. The best F1 score value is 1, while the poorest one is 0. The contribution of precision and recall to the F1 score is equivalent. The F1 score is computed with the following:

F_score=2TP2TP+FP+FN. (11)
  • (5) AUC: the associated ROC curve is used to evaluate the classification model's performance at different threshold settings. The AUC displays the model's performance by differentiating between classes (i.e., a degree of separability). A greater AUC is preferable. With a higher AUC, the model can identify ASD samples with light, moderate, and intense severity

  • (6) Training time: it means the time the model takes to train the detection of ASD. The lower, the better, and vice versa

  • (7) Time testing: it means the time of model takes to test the process: the lower, the better, and vice versa

The developed models must be benchmarked to select the best one based on the five performance-evaluation metrics (criteria). Therefore, a new decision matrix needs to be developed for this purpose. In addition, another MCDM method (FDSOM) needs to be used to evaluate and benchmark all developed diagnosis models using the developed decision matrix, as explained in the next section.

2.3. Phase 3: Evaluation and Benchmarking Framework

This stage covers the development framework for evaluating and benchmarking the 15 hybrid ASD diagnostic models based on MCDM approaches. The first part covers the developed decision matrix (DM), while the second part explains the FDOSM method steps.

2.3.1. DM

This section explains the developed dynamic DM used to evaluate and benchmark hybrid diagnosis models. DM is the most important aspect of the assessment and benchmarking technique [48, 54, 6467]. The primary components of decision-making are choice criteria and alternatives. The evaluation criteria represent the metrics used to benchmark the 15 hybrid diagnostic models (representing the alternatives). The processes taken to construct the DM are detailed in Table 6.

Table 6.

DM.

Alternatives/criteria Performance evaluation metric criteria
Hybrid diagnosis models C1 C2 C3 C4 C5 C6 C7
A1 ReF-decision tree C1-A1 C2-A1 C3-A1 C4-A1 C5-A1 C6-A1 C7-A1
A2 ReF-SVM C1-A2 C2-A2 C3-A2 C4-A2 C5-A2 C6-A2 C7-A2
A3 ReF-naive Bayes C1-A3 C2-A3 C3-A3 C4-A3 C5-A3 C6-A3 C7-A3
A4 ReF-KNN C1-A4 C2-A4 C3-A4 C4-A4 C5-A4 C6-A4 C7-A4
A5 ReF-AdaBoost C1-A5 C2-A5 C3-A5 C4-A5 C5-A5 C6-A5 C7-A5
A6 IG-decision tree C1-A6 C2-A6 C3-A6 C4-A6 C5-A6 C6-A6 C7-A6
A7 IG-SVM C1-A7 C2-A7 C3-A7 C4-A7 C5-A7 C6-A7 C7-A7
A8 IG-naive Bayes C1-A8 C2-A8 C3-A8 C4-A8 C5-A8 C6-A8 C7-A8
A9 IG-KNN C1-A9 C2-A9 C3-A9 C4-A9 C5-A9 C6-A9 C7-A9
A10 IG-AdaBoost C1-A10 C2-A10 C3-A10 C4-A10 C5-A10 C6-A10 C7-A10
A11 Chi2-decision tree C1-A11 C2-A11 C3-A11 C4-A11 C5-A11 C6-A11 C7-A11
A12 Chi2-SVM C1-A12 C2-A12 C3-A12 C4-A12 C5-A12 C6-A12 C7-A12
A13 Chi2-naive Bayes C1-A13 C2-A13 C3-A13 C4-A13 C5-A13 C6-A13 C7-A13
A14 Chi2-KNN C1-A14 C2-A14 C3-A14 C4-A14 C5-A14 C6-A14 C7-A14
A15 Chi2-AdaBoost C1-A15 C2-A15 C3-A15 C4-A15 C5-A15 C6-A15 C7-A15

C: criteria; A: alternative; C1: train time; C2: test time; C3: AUC; C4: classification accuracy; C5: F1 score; C6: precision; C7: recall.

2.3.2. FDOSM Method for Ranking Hybrid ML Models

FDOSM is considered an MCDM method for ranking and evaluation benchmarking. In decision-making, FDOSM comprised three block units: the data input unit, the data transformation unit, and the data processing unit [68]. The framework for group decision-making consists of two phases: external and internal aggregations. Figure 8 depicts the FDOSM methodology. The FDOSM steps can be expressed as follows:

  1. Data input unit: like existing MCDM approaches, the proposed MCDM method assigns m choices to each MCDM issue. A1, ⋯, Am that presented hybrid models and n set of decision criteria C1, ⋯, Cn that represented evaluation criteria. The DM represents this block's output. Next step, this choice matrix is converted into an opinion matrix [68].

  2. Data transformation unit: upon constructing the DM (the output of the first block), FDOSM adopts the transformation unit by selecting a three-parameter optimal solution (minimum, maximum, and critical values). The cost criterion combines the minimum value criterion, wherein the lowest value indicates the best option. The maximum value is used with the benefit criteria, whereby the highest value means the best solution. Critical value philosophy is the value employed in many situations, especially when the optimal answer is neither minimum nor maximum, as in the case of blood pressure. The following steps are outlined and detailed for this stage:

Figure 8.

Figure 8

FDOSM Methodology for ASD.

Stage 1 . —

Choose the optimal solution. Consequently, the optimal solution is described as follows:

A=maxivijjJ,minivijjJ,OpijI.Ji=1.2.3..m. (12)
Stage 2 . —

Compare the optimum solution to alternative values based on the criterion. This method of allocating weights to assessment criteria is implicitly supplied. Subjectively, the significance of the differences between the ideal solution and the alternatives is evaluated as shown in

OpLang=v~ijvijjJi=1.2.3..m. (13)

A panel of three experts specialized in data mining with bioinformatics have been asked in this stage, with more than five years of experience in this field.

Data-processing unit: the opinion matrix refers to the transformation unit's output. The last block begins by using TFNs to turn the opinion matrix into a fuzzy opinion decision matrix. A direct aggregation operator is then applied (i.e., arithmetic mean). Table 7 illustrates the transform linguistic terms into TFNs after comparing an ideal solution with other values of DM.

Table 7.

Linguistic terms and their equivalent TFNs.

Linguistic terms TFNs
No difference (0.00, 0.10, 0.30)
Slight difference (0.10, 0.30, 0.50)
Difference (0.30, 0.50, 0.75)
Big difference (0.50, 0.75, 0.90)
Huge difference (0.75, 0.90, 1.00)

This step used the same Step 4 in the FWZIC methodology.

The best-ranking order correlates to the lowest mean score value.

External aggregation: in external aggregation, fuzzy opinion matrices from various DMs are individually processed based on the processes outlined in the processing unit. The outcomes of the decision matrices are then aggregated into the final group decision using the arithmetic mean. In this instance, the expert opinions will be jointed after the final ranking has been determined.

3. Result and Discussion

The sequence results for each phase can be presented in this section.

3.1. Preprocessing Results

The results of the dataset after imputing the missing value are visualized in Figure 9. The result of SMOTE method is presented in Figure 10. In this context, SMOTE method aid ML models without bias for the diagnosis of ASD.

Figure 9.

Figure 9

Dataset after missing value.

Figure 10.

Figure 10

Data balance result.

As shown in Figure 10, the dataset has three balance labels of class “severity”: light, medium, and intense, and each class included 259 instances.

3.2. Feature Selection and FWIZC Results

As mentioned in Phase 2, three FS techniques have been applied to the balanced ASD dataset. In addition, the FWIZC method constructed the weights for the medical tests and sociodemographic features within each FS technique. Table 8 illustrates the results of each technique with its corresponding FWIZC weights.

Table 8.

Feature selection results and relevant FWIZC weights.

No. Chi2-FWIZC weights IG-FWIZC weights ReF-FWIZC weights
1 The blood type of the father 0.015933 The blood type of the patient 0.018751 The patient's blood type 0.016909
2 Vitamin D3 0.015897 The blood type of the mother 0.016847 mother's blood type 0.015103
3 Vitamin B12 0.01408 The blood type of the father 0.016847 Father's blood type 0.015103
4 Jaundice 0.026866 Relative relation 0.015771 m-age 0.028678
5 Marital relationship for parents 0.017665 Vitamin B12 0.014603 f-age 0.016089
6 Patient moving at home 0.033561 Vitamin zinc 0.014603 The age difference between the parents 0.013883
7 Relative relation 0.015516 The age difference between the parents 0.015771 Jaundice 0.025819
8 The blood type of the mother 0.015933 Premature baby 0.026891 Relative relation 0.013883
9 The blood type of the patient 0.017694 Marital relationship for parents 0.022249 Maternal diseases during pregnancy 0.026403
10 Premature baby 0.02585 Blood match 0.021993 Complications of childbirth for the child 0.03063
11 Duration of premature baby 0.024033 Sex 0.027507 Duration 0.022304
12 Sex 0.026708 Unnatural medicines for mother 0.029702 Sex 0.024509
13 Smell the food 0.019389 Taste the food 0.017561 Unnatural medicines for mother 0.026673
14 Taste the food 0.017166 Chewing food 0.014437 Smells food 0.017956
15 Chewing food 0.013635 Annoying from clothing tag 0.019961 Taste the food 0.015681
16 Annoying from clothing tag 0.019389 Waves 0.030499 Chews food before eating it 0.013272
17 Waves 0.028859 Patient moving at home 0.035524 Annoyance with clothing tag 0.017956
18 Patient moves around itself 0.039434 Patient moves around itself 0.041172 Moves around itself 0.037286
19 He is afraid of loud sounds 0.030589 He is afraid of loud sounds 0.031994 He is afraid of loud sounds 0.029177
20 Carry out orders 0.03512 Carry out orders 0.03685 Carry out orders 0.033518
21 Laughing for no reason 0.036231 Laughing for no reason 0.037755 Laughing for no reason 0.034157
22 Crying for no reason 0.027171 Crying for no reason 0.028323 Crying for no reason 0.025705
23 Mind wandering 0.033023 Mind wandering 0.034608 Mind wandering 0.031512
24 Play with children 0.032062 Play with children 0.033915 Mingles with children 0.030955
25 Kisses with a sound 0.015932 Kisses with a sound 0.016912 He kisses the mother by the sound 0.015596
26 Escaping home when doors are open 0.020294 Escaping home when doors are open 0.023215 If he opens the door of the house, he escapes 0.019783
27 Pointing with the index finger 0.033556 Pointing with the index finger 0.039766 Pointing with the index finger 0.031951
28 Notice the sound of the bell 0.025596 Notice the sound of the bell 0.029171 Notice the sound of the bell 0.02424
29 Notice his name 0.032634 Notice his name 0.033905 Notice his name 0.030632
30 Responds when parents call by name 0.023678 Responds when parents call by name 0.024576 Responds when parents call by name 0.022287
31 Diapers 0.020406 Diapers 0.021429 Bathroom skills 0.029392
32 Bathroom skills 0.030423 Bathroom skills 0.032171 Is there a language now? 0.037463
33 Is there a language now? 0.039972 Is there a language now? 0.041533 Arrange things in one row 0.029863
34 Arrange things in one row 0.031764 He plays with circle things 0.032964 Nodded 0.033562
35 He plays with circle things 0.031703 Nodded 0.036986 Marital relationship 0.019868
36 Nodded 0.035259 Duration of premature baby 0.024929 Blood match 0.019786
37 The age difference between the parents 0.015516 Smell the food 0.019961 Toxoplasmosis 0.020299
38 Unnatural medicines for mother 0.028723 Father age 0.018348 Complications of childbirth for the mother 0.02848
39 Complications of childbirth for the child 0.032738 Premature baby 0.024148
40 Diapers 0.019626
41 He plays with circle things 0.029863

Table 8 shows that the weights for each set of FS techniques have been obtained based on four physicians' subjective judgments. In this context, the role of the physicians' experience towards feature contributions has been addressed. For the ASD dataset, the relevancies of medical tests and sociodemographic characteristic features have been considered for the severity classes: light, medium, and intense. The benefit of the weighing process is assigning weight to each feature according to its importance. Thus, the constructed hybrid diagnosis ML models in the next section will be designed based on the weighted dataset resulting in a more accurate sense to be closer to reality.

3.3. Evaluation Criteria and DM Results

The performance metric results of the 15 hybrid diagnosis models using the developed DM can be shown in Table 9. The 15 models are evaluated using 66% of the dataset for training and 34% for testing.

Table 9.

Result of decision matrix.

Alternatives/criteria Performance evaluation metric criteria
Hybrid diagnosis models C1 C2 C3 C4 C5 C6 C7
A1 ReF-decision tree 0.37700 0.00001 0.99312 0.98943 0.98943 0.98944 0.98943
A2 ReF-SVM 2.22900 0.91700 0.95052 0.83245 0.83251 0.83266 0.83245
A3 ReF-naive Bayes 0.13400 0.05000 0.96142 0.84415 0.84367 0.84344 0.84415
A4 ReF-KNN 0.68700 0.51000 0.90306 0.76038 0.74901 0.75549 0.76038
A5 ReF-AdaBoost 0.57700 0.28700 0.98897 0.98528 0.98528 0.98530 0.98528
A6 IG-decision tree 0.47200 0.00200 0.99312 0.98943 0.98943 0.98944 0.98943
A7 IG-SVM 2.70600 0.91400 0.94778 0.83358 0.83364 0.83401 0.83358
A8 IG-naive Bayes 0.21900 0.04000 0.96313 0.84113 0.84029 0.83988 0.84113
A9 IG-KNN 0.60900 0.60600 0.89653 0.75585 0.74503 0.74750 0.75585
A10 IG-AdaBoost 0.69700 0.37600 0.98642 0.98189 0.98189 0.98193 0.98189
A11 Chi2-decision tree 0.46600 0.00200 0.99271 0.98830 0.98830 0.98830 0.98830
A12 Chi2-SVM 3.19100 1.06000 0.94979 0.83094 0.83080 0.83066 0.83094
A13 Chi2-naive Bayes 0.22900 0.04300 0.96235 0.84075 0.83966 0.83926 0.84075
A14 Chi2-KNN 0.90700 0.62900 0.89462 0.74302 0.73201 0.73414 0.74302
A15 Chi2-AdaBoost 0.75400 0.59500 0.98670 0.98226 0.98226 0.98230 0.98226

C: criteria; A: alternative; C1: train time; C2: test time; C3: AUC; C4: classification accuracy; C5: F1 score; C6: precision; C7: recall.

Table 9 (DM) shows that the 15 models (alternatives) have been evaluated using three performance evaluation metrics (criteria). The hybrid model A1=ReF-decision tree and A6=IG-decision tree have the highest accuracy of 98.94%, while the hybrid model A14=Chi2-KNN has the lowest accuracy of 74.30%. Furthermore, many models have similar accuracy results, such as A2=ReF-SVM, A7=IG-SVM, and A12=Chi2-SVM. On the contrary, some models have produced the shortest testing time, zero seconds, such as A1=ReF-decision tree. While A12=Chi2-SVM obtained the highest testing time, 1.06 seconds. In addition, A3=ReF-naive Bayes obtained 0.134 seconds for the shortest training time, and A12=Chi2-SVM obtained 3.191 seconds for the highest training time. Most ML models have obtained good results concerning the seven performance metrics. All the hybrid models have been measured using weighted datasets resulting from the FWZIC method. Thus, the FWZIC method has provided a suitable guideline for applying the weights to the ASD dataset, increasing the performance metric values. On the other hand, the evaluation results have conflict and trade-off issues among the criteria, making determining the best hybrid model a hard task. Therefore, the evaluation and benchmarking using FDOSM will solve these issues in the next section.

3.4. FDOSM Results

As shown in Table 10, there is an overlap of the obtained results for the 15 hybrid models, which cannot provide the precise decision of the best one. So, utilizing the FDOSM to benchmark 15 hybrid models must be achieved to select the best models based on seven evaluation metric criteria. The ranking results for the 15 hybrid models are shown in Table 10 with the score values and orders. The 15 hybrid diagnosis models are ranked according to the score values in ascending order. As the alternative score is lower, the model obtained a better rank and vice versa.

Table 10.

FDOSM results of the benchmarking of the 15 hybrid diagnosis models.

Hybrid diagnosis models Score value Ranking order
A1 ReF-decision tree 0.157142829 1
A2 ReF-SVM 0.761111002 10
A3 ReF-naive Bayes 0.576984072 7
A4 ReF-KNN 0.781745832 13
A5 ReF-AdaBoost 0.350793653 4
A6 IG-decision tree 0.175396799 2
A7 IG-SVM 0.761111002 10
A8 IG-naive Bayes 0.6007936 8
A9 IG-KNN 0.797618813 14
A10 IG-AdaBoost 0.371428574 5
A11 Chi2-decision tree 0.294444441 3
A12 Chi2-SVM 0.776983983 12
A13 Chi2-naive Bayes 0.6007936 8
A14 Chi2-KNN 0.816666435 15
A15 Chi2-AdaBoost 0.400000007 6

As shown in Table 10, the ranking results of the hybrid models according to the FDOSM reveal that the order of the best/first three hybrid models was A1=ReF-decision tree, A6=IG-decision tree, and A11=Chi2-decision tree. In addition, the last/worst three models were A14=Chi2-KNN, A9=IG-KNN, and A4=ReF-KNN. A1 is the first-best hybrid diagnosis model for detecting the severity of ASD and obtained a 0.15714 score value, while A6 is the second-best hybrid diagnosis model, obtaining a 0.17539 score value. The third-best rank is the A11 which has obtained a 0.29444 score value. In these contexts, the decision tree classifier has contributed to A29, A47, and A65 to obtain the best diagnosis model and optimal solution for ASD classification. KNN integrates with FS, which performs the lowest diagnosis model for ASD classification.

4. Proposal for Future Work

Increasing the opportunities to evidence the risk of medical and behavioral factors in ASD is a valid scientific complex problem where genetic and environmental factors contribute to the emergence of ASD by affecting early brain development. In contrast, apply the FWZIC method to weigh each feature's results. Besides, the intersection of more FS approaches is based on more techniques and more ML. In addition, the use of the new DM consists of eight criteria for performance evaluation metrics, as to utilize the FDOSM method for evaluating and benchmarking large-scale hybrid models to select the optimal model for the diagnosis of ASD.

5. Conclusion

This research direction aims at developing a hybrid model through the intersection between nine ML methods and eight FS techniques based on three approaches of FS for predicting and diagnosing autism based on effective sociodemographic and medical by proposing the highest methodological standards applied with high accuracy. Many ML models have been developed to deal with the diagnosis problem from ASD datasets with only sociodemographic features. However, the academic literature does not consider combining medical tests with sociodemographic features to diagnose ASD based on the severity levels. In addition, developing an effective and appropriate ML model for diagnosing autism is important and more reliable, considering the physicians' experience. Therefore, this study considers the literature challenges and overcomes the available issues by combining the ML algorithms, FS techniques, and MCDM methods. Firstly, the methodology developed 15 hybrid diagnosis models using the intersection of three FS techniques and five ML algorithms based on medical tests and sociodemographic features. The FS techniques are Chi2, IG, and ReF and are used with the popular supervised ML algorithms decision tree, naive Bayes, KNN, SVM, and AdaBoost. The 15 ML models have been constructed based on a balanced and weighted dataset with the principle of weighing features considering the physicians' experience through the FWZIC method. Since so many developed hybrid models acquired varied metric results, it is difficult to select the optimal model due to conflict and trade-offs between criteria. Therefore, the methodology developed a new DM to evaluate and benchmark all hybrid models using FDOSM based on seven performance metrics: CA, precision, F1 score, recall, test time, train time, and AUC. DM led to the FDOSM method for ranking to select the best optimal model. ReF-decision tree obtained the best rank among all models. The performance metrics for the ReF-decision tree were 0.4190, 0.0030, 0.9946, 0.9902, 0.9902, 0.9902, 0.9902, and 0.9951 for the C1=train time, C2=test time, C3=AUC, C4=CA, C5=F1 score, C6=precision, and C7=recall, respectively. The results demonstrate that the developed methodology reaches flavour performance and surpasses many existing hybrid diagnosis models for autism. The summarized points for this study are as follows:

  1. Developing these models with the obtained results provides a clear guideline to other researchers on choosing the best ML model supported by scientific justification. Accordingly, the selection process for the best models cannot be achieved based on a specific metric. The performance evaluation metrics should be considered simultaneously for choosing the optimal model within other bioinformatics fields

  2. To our knowledge, the best hybrid model depends on the expert physicians based on the included and excluded features. More investigation is needed to address this fact through a discussion study with a panel of experts in future work. In conclusion, for the overall results of the three FS techniques, medical test features were less necessary and less beneficial in diagnosing ASD. Most medical features have been excluded, while sociodemographic features have acquired the most important benefits. So, the performance of medical test features affects the diagnosis process less than sociodemographic features. In these contexts, the proposed hybrid model using an MCDM-based ML approach brings up a new concept of applying features' importance as weights when developing the detection model of autism

  3. An optimal hybrid model resulting from this study increases confidence and encourages global medical users to meet the performance goals of AI applications. These goals can be achieved through the presented evaluation and benchmarking MCDM methodology

  4. There is one limitation that has been faced in the study. In the FS approach, the process of the selected features is still unclear about how to define a threshold value that represents the stop point for selecting relevant features and excluding irrelevant/few features. Therefore, more experimental research needs to be investigated using more FS approaches to investigate more threshold values

Data Availability

Data is available on request (ahmed.bahri1978@iips.icci.edu.iq).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  • 1.Alqaysi M. E., Albahri A. S., Hamid R. A. Diagnosis-based hybridization of multimedical tests and sociodemographic characteristics of autism spectrum disorder using artificial intelligence and machine learning techniques: a systematic review. International Journal of Telemedicine and Applications . 2022;2022:26. doi: 10.1155/2022/3551528.3551528 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Albahri A. S., Alwan J. K., Taha Z. K., et al. IoT-based telemedicine for disease prevention and health promotion: state-of- the-art. Journal of Network and Computer Applications . 2021;173, article 102873 doi: 10.1016/j.jnca.2020.102873. [DOI] [Google Scholar]
  • 3.Satu M. S., Farida Sathi F., Arifen M. S., Hanif Ali M., Moni M. A. Early detection of autism by extracting features: a case study in Bangladesh. 2019 International Conference on Robotics,Electrical and Signal Processing Techniques (ICREST); 2019; Dhaka, Bangladesh. pp. 400–405. [DOI] [Google Scholar]
  • 4.Islam S., Akter T., Zakir S., Sabreen S., Hossain M. I. Autism spectrum disorder detection in toddlers for early diagnosis using machine learning. 2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE); 2020; Gold Coast, Australia. pp. 1–6. [DOI] [Google Scholar]
  • 5.Albahri A. S., Hamid R. A., Zaidan A. A., Albahri O. S. Early automated prediction model for the diagnosis and detection of children with autism spectrum disorders based on effective sociodemographic and family characteristic features. Neural Computing and Applications . 2022:1–27. doi: 10.1007/s00521-022-07822-0. [DOI] [Google Scholar]
  • 6.Akter T., Shahriare Satu M., Khan M. I., et al. Machine learning-based models for early stage detection of autism spectrum disorders. IEEE Access . 2019;7:166509–166527. doi: 10.1109/ACCESS.2019.2952609. [DOI] [Google Scholar]
  • 7.Akter T., Khan M. I., Ali M. H., Satu M. S., Uddin M. J., Moni M. A. Improved machine learning based classification model for early autism detection. ICREST 2021-2nd International Conference on Robotics, Electrical and Signal Processing Techniques; 2021; DHAKA, Bangladesh. pp. 742–747. [DOI] [Google Scholar]
  • 8.Dutta S. R., Datta S., Roy M. Using cogency and machine learning for autism detection from a preliminary symptom. Proceedings of the 9th International Conference On Cloud Computing, Data Science and Engineering, Confluence; 2019; Noida, India. pp. 331–336. [DOI] [Google Scholar]
  • 9.Küpper C., Stroth S., Wolff N., et al. Identifying predictive features of autism spectrum disorders in a clinical sample of adolescents and adults using machine learning. Scientific Reports . 2020;10(1):p. 4805. doi: 10.1038/s41598-020-61607-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Puerto E., Aguilar J., López C., Chávez D. Using multilayer fuzzy cognitive maps to diagnose autism spectrum disorder. Applied Soft Computing . 2019;75:58–71. doi: 10.1016/j.asoc.2018.10.034. [DOI] [Google Scholar]
  • 11.Joudar S. S., Albahri A. S., Hamid R. A. Triage and priority-based healthcare diagnosis using artificial intelligence for autism spectrum disorder and gene contribution: a systematic review. Computers in Biology and Medicine . 2022;146, article 105553 doi: 10.1016/j.compbiomed.2022.105553. [DOI] [PubMed] [Google Scholar]
  • 12.AlSereidi A., Salih S. Q. M., Mohammed R. T., et al. Novel federated decision making for distribution of anti-SARS-CoV-2 monoclonal antibody to eligible high-risk patients. International Journal of Information Technology and Decision Making . 2022:1–72. doi: 10.1142/S021962202250050X. [DOI] [Google Scholar]
  • 13.Alsattar H. A., Qahtan S., Mohammed R. T., et al. Integration of FDOSM and FWZIC under homogeneous Fermatean fuzzy environment: a prioritization of COVID-19 patients for mesenchymal stem cell transfusion. International Journal of Information Technology and Decision Making . 2022:1–41. doi: 10.1142/S0219622022500511. [DOI] [Google Scholar]
  • 14.Jassim M. M., Jaber M. M. Hybrid selection framework for class balancing approaches based on integrated Cnn and decision making techniques for lung cancer diagnosis. Eastern-European Journal of Enterprise Technologies . 2022;4(9(118)):69–76. doi: 10.15587/1729-4061.2022.263644. [DOI] [Google Scholar]
  • 15.Alamoodi A. H., Baker M. R., Albahri O. S., et al. Public sentiment analysis and topic modeling regarding COVID-19’s three waves of total lockdown: a case study on movement control order in Malaysia. KSII Transactions on Internet and Information Systems . 2022;16(7):2169–2190. doi: 10.3837/tiis.2022.07.003. [DOI] [Google Scholar]
  • 16.Alsalem M. A., Albahri O. S., Zaidan A. A., et al. Rescuing emergency cases of COVID-19 patients: an intelligent real-time MSC transfusion framework based on multicriteria decision-making methods. Applied Intelligence . 2022;52(9):9676–9700. doi: 10.1007/s10489-021-02813-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Albahri A. S., Zaidan A. A., Albahri O. S., et al. Development of IoT-based mhealth framework for various cases of heart disease patients. Health and Technology . 2021;11(5):1013–1033. doi: 10.1007/s12553-021-00579-x. [DOI] [Google Scholar]
  • 18.Mohammed T. J., Hadi S. M., Albahri A. S. Improving the prioritization procedure of patients with COVID-19 in hospitals based on decision-making techniques: a systematic review. Iraqi Journal for Computers and Informatics . 2022;48(1):6–12. doi: 10.25195/ijci.v48i1.347. [DOI] [Google Scholar]
  • 19.Albahri A. S., Hamid R. A., Alwan J. ., et al. Role of biological data mining and machine learning techniques in detecting and diagnosing the novel coronavirus (COVID-19): a systematic review. Journal of Medical Systems . 2020;44(7):p. 122. doi: 10.1007/s10916-020-01582-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Yin W., Mostafa S., Wu F. X. Diagnosis of autism spectrum disorder based on functional brain networks with deep learning. Journal of Computational Biology . 2021;28(2):146–165. doi: 10.1089/cmb.2020.0252. [DOI] [PubMed] [Google Scholar]
  • 21.Asadzadeh S., Yousefi Rezaii T., Beheshti S., Delpak A., Meshgini S. A systematic review of EEG source localization techniques and their applications on diagnosis of brain abnormalities. Journal of Neuroscience Methods . 2020;339:p. 108740. doi: 10.1016/j.jneumeth.2020.108740. [DOI] [PubMed] [Google Scholar]
  • 22.Ataei A., Amini A., Ghazizadeh A. Brain activity estimation using EEG-only recordings calibrated with joint EEG-fMRI recordings using compressive sensing. 13th International conference on Sampling Theory and Applications (SampTA); 2019; Bordeaux, France. pp. 5–7. [DOI] [Google Scholar]
  • 23.Raj S., Masood S. Analysis and detection of autism spectrum disorder using machine learning techniques. Procedia Computer Science . 2020;167:994–1004. doi: 10.1016/j.procs.2020.03.399. [DOI] [Google Scholar]
  • 24.Rajab K. D., Padmavathy A., Thabtah F. Machine learning application for predicting autistic traits in toddlers. Arabian Journal for Science and Engineering . 2021;46(4):3793–3805. doi: 10.1007/s13369-020-05165-3. [DOI] [Google Scholar]
  • 25.Abdelhamid N., Padmavathy A., Peebles D., Thabtah F., Goulder-Horobin D. Data imbalance in autism pre-diagnosis classification systems: an experimental study. Journal of Information and Knowledge Management . 2020;19(2040014):1–16. doi: 10.1142/S0219649220400146. [DOI] [Google Scholar]
  • 26.Alahmari F. A comparison of resampling techniques for medical data using machine learning. Journal of Information & Knowledge Management . 2020;19(1) doi: 10.1142/S021964922040016X. [DOI] [Google Scholar]
  • 27.Altay O., Ulas M. Prediction of the autism spectrum disorder diagnosis with linear discriminant analysis classifier and K-nearest neighbor in children. 6th International Symposium on Digital Forensic and Security, ISDFS 2018- Proceeding; 2018; Antalya, Turkey. pp. 1–4. [DOI] [Google Scholar]
  • 28.Hassan M. M., Mokhtar H. M. O. AutismOnt: an ontology-driven decision support for autism diagnosis and treatment. Egyptian Informatics Journal . 2022;23(1):95–103. doi: 10.1016/j.eij.2021.07.002. [DOI] [Google Scholar]
  • 29.Bolón-Canedo V., Sánchez-Maroño N., Alonso-Betanzos A. Feature selection for high-dimensional data. Progress in Artificial Intelligence . 2016;5(2):65–75. doi: 10.1007/s13748-015-0080-y. [DOI] [Google Scholar]
  • 30.Rana A., Pandey R. A review of popular decision tree algorithms in data mining. Asian Journal of Multidimensional Research . 2021;10(10):230–237. doi: 10.5958/2278-4853.2021.00837.5. [DOI] [Google Scholar]
  • 31.Ayon D. Machine learning algorithms: a review. International Journal Of Computer Science & Information Technology . 2016;7(3):1174–1179. doi: 10.21275/ART20203995. [DOI] [Google Scholar]
  • 32.Gou J., Ma H., Ou W., Zeng S., Rao Y., Yang H. A generalized mean distance-based k -nearest neighbor classifier. Expert Systems with Applications . 2019;115:356–372. doi: 10.1016/j.eswa.2018.08.021. [DOI] [Google Scholar]
  • 33.Zendehboudi A., Baseer M. A., Saidur R. Application of support vector machine models for forecasting solar and wind energy resources: a review. Journal of Cleaner Production . 2018;199:272–285. doi: 10.1016/j.jclepro.2018.07.164. [DOI] [Google Scholar]
  • 34.Tyralis H., Papacharalampous G. Boosting algorithms in energy research: a systematic review. Neural Computing and Applications . 2021;33(21):14101–14117. doi: 10.1007/s00521-021-05995-8. [DOI] [Google Scholar]
  • 35.Albahri O. S., Albahri A. S., Zaidan A. A., et al. Fault-tolerant mHealth framework in the context of IoT based real-time wearable health data sensor. IEEE Access . 2019;7:50052–50080. doi: 10.1109/ACCESS.2019.2910411. [DOI] [Google Scholar]
  • 36.Mohammed K. I., Zaidan A. A., Zaidan B. B., et al. Novel technique for reorganisation of opinion order to interval levels for solving several instances representing prioritisation in patients with multiple chronic diseases. Computer Methods and Programs in Biomedicine . 2020;185, article 105151 doi: 10.1016/j.cmpb.2019.105151. [DOI] [PubMed] [Google Scholar]
  • 37.Alsalem M. A., Alamoodi A. H., Albahri O. S., et al. Multi-criteria decision-making for coronavirus disease 2019 applications: a theoretical analysis review. Artificial Intelligence Review . 2022;55(6):4979–5062. doi: 10.1007/s10462-021-10124-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Alsalem M. A., Mohammed R., Albahri O. S., et al. Rise of multiattribute decision-making in combating COVID-19: a systematic review of the state-of-the-art literature. International Journal of Intelligence Systems . 2022;37(6):3514–3624. doi: 10.1002/int.22699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Mohammed K. I., Jaafar J., Zaidan A. A., et al. A uniform intelligent prioritisation for solving diverse and big data generated from multiple chronic diseases patients based on hybrid decision-making and voting method. IEEE Access . 2020;8:91521–91530. doi: 10.1109/ACCESS.2020.2994746. [DOI] [Google Scholar]
  • 40.Saaty T. L. Mathematical Models for Decision Support . Berlin, Heidelberg: Springer; 1988. What is the Analytic Hierarchy Process? pp. 109–121. [DOI] [Google Scholar]
  • 41.Khatari M., Zaidan A. A., Zaidan B. B., Albahri O. S., Alsalem M. A., Albahri A. S. Multidimensional benchmarking framework for AQMs of network congestion control based on AHP and group-TOPSIS. International Journal of Information Technology and Decision Making . 2021;20(5):1409–1446. doi: 10.1142/S0219622021500127. [DOI] [Google Scholar]
  • 42.Zughoul O., Zaidan A. A., Zaidan B. B., et al. Novel triplex procedure for ranking the ability of software engineering students based on two levels of AHP and group TOPSIS techniques. International Journal of Information Technology and Decision Making . 2021;20(1):67–135. doi: 10.1142/S021962202050042X. [DOI] [Google Scholar]
  • 43.Albahri O. S., Zaidan A. A., Zaidan B. B., et al. New mHealth hospital selection framework supporting decentralised telemedicine architecture for outpatient cardiovascular disease-based integrated techniques: haversine-GPS and AHP-VIKOR. Journal of Ambient Intelligence and Humanized Computing . 2022;13(1):219–239. doi: 10.1007/s12652-021-02897-4. [DOI] [Google Scholar]
  • 44.Mohammed T. J., Albahri A. S., Zaidan A. A., et al. Convalescent-plasma-transfusion intelligent framework for rescuing COVID-19 patients across centralised/decentralised telemedicine hospitals based on AHP-group TOPSIS and matching component. Applied Intelligence . 2021;51(5):2956–2987. doi: 10.1007/s10489-020-02169-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Rezaei J. Best-worst multi-criteria decision-making method. Omega . 2015;53:49–57. doi: 10.1016/j.omega.2014.11.009. [DOI] [Google Scholar]
  • 46.Alsalem M. A., Zaidan A. A., Zaidan B. B., et al. Multiclass benchmarking framework for automated acute leukaemia detection and classification based on BWM and group-VIKOR. Journal of Medical Systems . 2019;43(7):p. 212. doi: 10.1007/s10916-019-1338-x. [DOI] [PubMed] [Google Scholar]
  • 47.Pamučar D., Stević Ž., Sremac S. A new model for determining weight coefficients of criteria in MCDM models: full consistency method (FUCOM) Symmetry . 2018;10(9):p. 393. doi: 10.3390/sym10090393. [DOI] [Google Scholar]
  • 48.Salih M. M., Albahri O. S., Zaidan A. A., Zaidan B. B., Jumaah F. M., Albahri A. S. Benchmarking of AQM methods of network congestion control based on extension of interval type-2 trapezoidal fuzzy decision by opinion score method. Telecommunication Systems . 2021;77(3):493–522. doi: 10.1007/s11235-021-00773-2. [DOI] [Google Scholar]
  • 49.Al-Samarraay M. S., Salih M. M., Ahmed M. A., et al. A new extension of FDOSM based on pythagorean fuzzy environment for evaluating and benchmarking sign language recognition systems. Neural Computing and Applications . 2022;34(6):4937–4955. doi: 10.1007/s00521-021-06683-3. [DOI] [Google Scholar]
  • 50.Al-Humairi S., Hizami A., Zaidan A. A., et al. Towards sustainable transportation: a pavement strategy selection based on the extension of dual-hesitant fuzzy multi-criteria decision-making methods. IEEE Transactions on Fuzzy Systems . 2022 doi: 10.1109/TFUZZ.2022.3168050. [DOI] [Google Scholar]
  • 51.Alamoodi A. H., Albahri O. S., Zaidan A. A., et al. New extension of fuzzy-weighted zero-inconsistency and fuzzy decision by opinion score method based on cubic pythagorean fuzzy environment: a benchmarking case study of sign language recognition systems. International Journal of Fuzzy Systems . 2022;24(4):1909–1926. doi: 10.1007/s40815-021-01246-z. [DOI] [Google Scholar]
  • 52.Krishnan E., Mohammed R., Alnoor A., et al. Interval type 2 trapezoidal-fuzzy weighted with zero inconsistency combined with VIKOR for evaluating smart e-tourism applications. International Journal of Intelligence Systems . 2021;36(9):4723–4774. doi: 10.1002/int.22489. [DOI] [Google Scholar]
  • 53.Mahmoud U. S., Albahri A. S., AlSattar H. A., et al. DAS benchmarking methodology based on FWZIC II and FDOSM II to support industrial community characteristics in the design and implementation of advanced driver assistance systems in vehicles. Journal of Ambient Intelligence and Humanized Computing . 2022:1–28. doi: 10.1007/s12652-022-04201-4. [DOI] [Google Scholar]
  • 54.Alsalem M. A., Alsattar H. A., Albahri A. S., et al. Based on T-spherical fuzzy environment: a combination of FWZIC and FDOSM for prioritising COVID-19 vaccine dose recipients. Journal of Infection and Public Health . 2021;14(10):1513–1559. doi: 10.1016/j.jiph.2021.08.026. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 55.Albahri O. S., Zaidan A. A., Albahri A. S., et al. Novel dynamic fuzzy decision-making framework for COVID-19 vaccine dose recipients. Journal of Advanced Research . 2022;37:147–168. doi: 10.1016/j.jare.2021.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Albahri A. S., Albahri O. S., Zaidan A. A., et al. Integration of fuzzy-weighted zero-inconsistency and fuzzy decision by opinion score methods under a q-rung orthopair environment: a distribution case study of COVID-19 vaccine doses. Computer Standards & Interfaces . 2022;80, article 103572 doi: 10.1016/j.csi.2021.103572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Al-Samarraay M. S., Zaidan A. A., Albahri O. S., et al. Extension of interval-valued pythagorean FDOSM for evaluating and benchmarking real-time SLRSs based on multidimensional criteria of hand gesture recognition and sensor glove perspectives. Applied Soft Computing . 2022;116, article 108284 doi: 10.1016/j.asoc.2021.108284. [DOI] [Google Scholar]
  • 58.Halibas A. S., Reazol L. B., Delvo E. G. T., Tibudan J. C. Performance analysis of machine learning classifiers for ASD screening. International conference on innovation and intelligence for informatics, computing, and technologies (3ICT); 2018; Sakhier, Bahrain. pp. 1–6. [DOI] [Google Scholar]
  • 59.Mohammed R. T., Zaidan A. A., Yaakob R., et al. Determining importance of many-objective optimisation competitive algorithms evaluation criteria based on a novel fuzzy-weighted zero-inconsistency method. International Journal of Information Technology and Decision Making . 2022;21(1):195–241. doi: 10.1142/S0219622021500140. [DOI] [Google Scholar]
  • 60.Arqub O. A., Al-Smadi M., Momani S., Hayat T. Application of reproducing kernel algorithm for solving second-order, two-point fuzzy boundary value problems. Soft Computing . 2017;21(23):7191–7206. doi: 10.1007/s00500-016-2262-3. [DOI] [Google Scholar]
  • 61.Abu Arqub O., AL-Smadi M., Momani S., Hayat T. Numerical solutions of fuzzy differential equations using reproducing kernel Hilbert space method. Soft Computing . 2016;20(8):3283–3302. doi: 10.1007/s00500-015-1707-4. [DOI] [Google Scholar]
  • 62.Abu Arqub O. Adaptation of reproducing kernel algorithm for solving fuzzy Fredholm–Volterra integrodifferential equations. Neural Computing and Applications . 2017;28(7):1591–1610. doi: 10.1007/s00521-015-2110-x. [DOI] [Google Scholar]
  • 63.NǍdǍban S., Dzitac S., Dzitac I. Fuzzy TOPSIS: a general view. Procedia Computer Science . 2016;91:823–831. doi: 10.1016/j.procs.2016.07.088. [DOI] [Google Scholar]
  • 64.Zaidan A. A., Zaidan B. B., Alsalem M. A., Albahri O. S., Albahri A. S., Qahtan M. Y. Multi-agent learning neural network and Bayesian model for real-time IoT skin detectors: a new evaluation and benchmarking methodology. Neural Computing and Applications . 2020;32(12):8315–8366. doi: 10.1007/s00521-019-04325-3. [DOI] [Google Scholar]
  • 65.Albahri A. S., Hamid R. A., Albahri O. S., Zaidan A. A. Detection-based prioritisation: framework of multi-laboratory characteristics for asymptomatic COVID-19 carriers based on integrated entropy-TOPSIS methods. Artificial Intelligence in Medicine . 2021;111, article 101983 doi: 10.1016/j.artmed.2020.101983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Albahri O. S., Zaidan A. A., Salih M. M., et al. Multidimensional benchmarking of the active queue management methods of network congestion control based on extension of fuzzy decision by opinion score method. International Journal of Intelligence Systems . 2021;36(2):796–831. doi: 10.1002/int.22322. [DOI] [Google Scholar]
  • 67.Hamid R. A., Albahri A. S., Albahri O. S., Zaidan A. A. Dempster–Shafer theory for classification and hybridised models of multi-criteria decision analysis for prioritisation: a telemedicine framework for patients with heart diseases. Journal of Ambient Intelligence and Humanized Computing . 2022;13(9):4333–4367. doi: 10.1007/s12652-021-03325-3. [DOI] [Google Scholar]
  • 68.Salih M. M., Zaidan B. B., Zaidan A. A. Fuzzy decision by opinion score method. Applied Soft Computing . 2020;96, article 106595 doi: 10.1016/j.asoc.2020.106595. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data is available on request (ahmed.bahri1978@iips.icci.edu.iq).


Articles from Computational and Mathematical Methods in Medicine are provided here courtesy of Wiley

RESOURCES