The Use of Ensemble Models for Multiple Class and Binary Class Classification for Improving Intrusion Detection Systems

Celestine Iwendi; Suleman Khan; Joseph Henry Anajemba; Mohit Mittal; Mamdouh Alenezi; Mamoun Alazab

doi:10.3390/s20092559

. 2020 Apr 30;20(9):2559. doi: 10.3390/s20092559

The Use of Ensemble Models for Multiple Class and Binary Class Classification for Improving Intrusion Detection Systems

Celestine Iwendi ¹, Suleman Khan ², Joseph Henry Anajemba ^3,^*, Mohit Mittal ^4,^*, Mamdouh Alenezi ⁵, Mamoun Alazab ⁶

PMCID: PMC7249012 PMID: 32365937

Abstract

The pursuit to spot abnormal behaviors in and out of a network system is what led to a system known as intrusion detection systems for soft computing besides many researchers have applied machine learning around this area. Obviously, a single classifier alone in the classifications seems impossible to control network intruders. This limitation is what led us to perform dimensionality reduction by means of correlation-based feature selection approach (CFS approach) in addition to a refined ensemble model. The paper aims to improve the Intrusion Detection System (IDS) by proposing a CFS + Ensemble Classifiers (Bagging and Adaboost) which has high accuracy, high packet detection rate, and low false alarm rate. Machine Learning Ensemble Models with base classifiers (J48, Random Forest, and Reptree) were built. Binary classification, as well as Multiclass classification for KDD99 and NSLKDD datasets, was done while all the attacks were named as an anomaly and normal traffic. Class labels consisted of five major attacks, namely Denial of Service (DoS), Probe, User-to-Root (U2R), Root to Local attacks (R2L), and Normal class attacks. Results from the experiment showed that our proposed model produces 0 false alarm rate (FAR) and 99.90% detection rate (DR) for the KDD99 dataset, and 0.5% FAR and 98.60% DR for NSLKDD dataset when working with 6 and 13 selected features.

Keywords: intrusion detection system, ensemble methods, feature selection, machine learning, false positive rate, artificial intelligence

1. Introduction

The increase in how people view and utilize the Internet has become a blessing and also a liability to our everyday online activities. The quest for urgent data transmission on the internet and the need for commensurable security, authentication, confidentiality of web applications, and cloud interface computing have given rise to all kinds of advanced security attacks. The day to day internet usage is becoming complicated due to the threat of the internet in data security, industrial attack, and sponsored attacks to social and engineering facilities [1,2]. The complex natures of the attacks demand response with security systems that are efficient, automated, having faster responses, accuracy, and efficient security preventing systems in place.

Network intrusion detection systems (NIDS) have been developed by researchers over time that serve the purpose of detecting any suspicious action and intention that will lead to data theft or identity cloning. The fact that there has been a rapid response to security attacks on many web-based applications has not deterred the intruders from discovering loopholes to the networks and sending more sophisticated attacks.

An ExtraTrees classifier that is used in selecting applicable features for different types of Intruders with extreme learning machines (ELMs) was proposed by [1]. During attacks classification, multi-class issues were divided into multiple binary classifications and the authors used subjective extreme learning machines to solve the issue of imbalance. Lastly, they implemented in parallel the ELMs ensemble by using GPUs in order to perform in real-time intrusion detection. Their results did better than all the other methods earlier in use, achieving 98.24% and 99.76% precision on their datasets for multi-class classification. Their proposer incurred a small overhead and lacks training on how to distinguish between normal traffic and potential attacks. Meanwhile, a multi-model biomatrix recognition system that is based on pattern recognition methods was used to make a personal identification by [2]. A modification of the fingerprint was done by applying the Delaney triangulation network. Although their system achieved a high precision with low error rate equals 0.9%, it is limited and cannot function as IDS because it is based on eyelash detection and not on the internet or online system.

Another multiclass classification that uses a heterogeneous ensemble model and outlier detection in a combination of numerous approaches and ensemble methods was developed by [3]. Their study was based on Pre-processing involving a way to filter global outliers, using a synthetic minority oversampling technique (SMOTE) algorithm to repeat the sampling process. They performed a binarization on the dataset by using the OnevsOne decomposing technique. In addition, Adaboost, random subspace algorithms, and random forest were used to design their model as the base classifier. Their proposed model performed better in terms of outlier detection and classification prediction for the multiclass problem, and also did better than other classical algorithms commonly in use. The study failed to combine filtration and wrapper selection methods in order to investigate the effect of partial removal of point-outliers from datasets prior to building up of classifiers. DOS, probe, U2R, and R2L were the four types of attacks used by [4] to deal with the random forest model. They implemented ten cross-validations that were functional for classification usage and a Feature selection that was implemented on the dataset in order to reduce dimensionality, remover of redundancy and unrelated features. On comparing their random forest modeling with a j48 classier, their experimentation proves that accuracy and DR for four types of attacks are better, but they failed to use evolutionary calculation as a feature selection measure that could further improve the accuracy of the classifier. The fact is that denial of service (DoS) attacks have created massive disruptions to private and public sectors web-based applications of which many are not in the news due to management fears of customers’ panic and loss of shares. It becomes a challenge to create a multiple class-based IDS that has the capacity to withstand multiple attacks provide higher accuracy, higher detection rate (DR), and lower false detection rate (FAR).

This paper’s intention is to develop an intelligent intrusion detection system that has high accuracy, high packet detection rate, and low false alarm rate. The Objectives include 1. Developed machine learning models for the intrusion detection system; 2. Implement and evaluate the proposed solution on network security datasets; 3. Proposed a data-independent Model; 4. Achieved high accuracy; 5. Achieved high detection system; and 6. Achieved a low false alarm rate.

Our motivation is to reduce False Positive Rate (FPR) by applying dimensional reduction method on the Correlation Feature Selection (CFS) algorithm.

Our contribution includes:

The research performs dimensionality reduction using the Correlation-based feature selection (CFS) approach.
Machine Learning Ensemble Models with base classifiers (j48, Random forest and reptree) were used to perform simulations.
Automatically proposed optimal subset features for the new dataset.
FAR and Detection rate has a great impact on the IDS system, so we propose a novel solution based on machine learning ensemble models with the effect of the CFS algorithm.
Our Proposed CFS + Ensemble Classifiers has 0 false alarm rate and 99.90% detection rate for kdd99 dataset and for nslkdd dataset 0.5% FAR and 98.60% detection rate.
Our proposed model was evaluated and compared with two different datasets and also these research experimental results are also likened with other recent and important papers in this area.

The remainder of the paper is structured as stated: In Section 2, we describe the Literature review. Section 3 presented the proposed methodology. Section 4 describes the experiments and results. Section 5 concludes the research and the mindset of future work.

2. Literature Review

A hybrid smart system with an enhancement of the decision tree was used by the authors in [5] to design a multiple classifier system. This was done by applying Adaboost and naïve Bayes with decision trees (NBDT), non-nested generalized exemplar (NNge), and incremental pruning (JRip) rule-based classifiers (NNJR). The system was able to detect network intrusions efficiently. The only limitation to this research is that other data mining approaches were not explored in full. Hybrid IDS based on integrating the predictions of a tree by probability in a diverse kind of classifier was proposed by [6]. Their result illustrates a model that gives a much lower false alarm rate and a peak detection rate. Moreover, their proposed model shows better precision than the recent IDS models with a precision equivalent to 96.27% for KDD’99 and 89.75% for NSL-KDD—unlike authors in [7] that use spectral clustering (SC) and deep neural network (DNN) in their proposer for intrusion detection. Their results indicate that their classifier delivers a real tool of study and analysis of intrusion detection in a large network and does better than back propagation neural network (BPNN), support vector machine (SVM), random forest (RF), and Bayes tree models in spotting precision and the types of irregular attacks in the network.

The hybrid model of [8] is a proposed system designed on the network transaction that estimates the intrusion scope threshold degree at data’s peak features which are readily accessible for the physical activities. Their results show that the hybrid approach is necessary in order to achieve accuracy of 99.81% and 98.56% for the binary class and multiclass NSL-KDD datasets, respectively. Nevertheless, it was suggested for further studies to apply optimizing techniques with the intrusion detection model because it is likely to have a better accuracy rate.

A Gini index based feature selection can give the ensemble technique a higher increase accuracy of detection by 10% according to [9]. Other benefits include reduction of a false positive rate to 0.05 and improving the system performance in terms of the time it takes for executing a truer positive rate. Nevertheless, reduced features that will require less processing time in a distributed situation need to be applied to improve the detection rate.

An improved conditional variational Auto Encoder (ICVAE) with a deep neural network (DNN) was combined to design an intrusion detection model known as ICVAE-DNN by [10]. They learn and explore potential sparse representations between network data features and classes that show better overall accuracy, detection rate, and false positive rate than the nine state-of-the-art intrusion detection methods. Nonetheless, there is a need to improve the detection performance of minority attacks and unknown attacks. The adversarial learning method can be used to explore the spatial distribution of ICVAE latent variables to better reconstruct input samples. The machine learning-based IDS developed by the authors in [11] are based on deep learning. According to the authors, in large network datasets and unbalanced network traffic, the performance of the IDS may be affected, this can result in an anomaly network-based IDS. A Deep Belief Networks (DBNs) approach which projected deep learning as a swift upsurge of machine learning (ML) was proposed in [12,13]. Following this proposal, deep learning has realized greatly the extraction of high-level dormant features from dataset models. However, notwithstanding these huge successes, several problems related to IDS still exist—firstly, a high network data dimension. In many IDS models, the feature selection approach is first considered as one of the steps of the preprocessing [14]—for instance, the advancement of the Internet of Things (IoT) and the prevalent cloud-based services, in addition to the emergence of several new attacks. In the training dataset, several unidentified attacks do not appear. For instance, in the NSL-KDD dataset considered in [15,16], about 16.6% of the attack samples in the dataset tested did not appear in the training dataset. This implies that mostly all conventional IDS typically achieve poor performance. However, for an anomaly network-based IDS (A-NIDS), the authors in [17,18] proposed a primal dependable hybrid approach that incorporates the Adaboost meta-algorithm and artificial bee colony (ABC). This is intended to achieve optimal detection rate (DR) at a minimized false positive rate (FPR) [19]. In the study by [20], the ABC algorithm is implemented for selection of features, while the Adaboost meta-algorithm is used for feature classification and evaluation. The Adaboost meta-algorithm was implemented to tackle the unbalanced data based on the actual plan, while the ABC was used for the IDS problem optimization. Incorporating both the redesigned density peak clustering algorithm (MDPCA) and the deep belief networks (DBNs) resulted in a novel fuzzy aggregation approach which was proposed in [21]. The MDPCA section of the algorithm splits the primal training dataset into numerous minor subsets based on the similarity of the training samples feature. On the other hand, the results of the entire sub-DBNs classifiers are combined according to the weights of the fuzzy membership. The objective of [22] was to design a system that has to have the capacity for accurate traffic classification of classes into normal and attack, measure up the huge datasets, and be able to acquire a lower false alarms rate. To achieve these, the authors leveraged on the Extreme Learning Machine (ELM) algorithm, which is an advanced ML algorithm. Although the ELM algorithm has proved to be more efficient in terms of performance against the Support Vector Machine (SVM) algorithm, it operates, however, at high frequency while sustaining adequate classification ability. The authors further attempted to enhance the performance ELM algorithm by including a redesigned kind of Huang’s Kernel-based ELM and combined this with the Multiple Kernel Boost (MKBoost) framework which was earlier introduced by [3]. A novel approach based on the combination of discretization, filtering, and classification methods using a KDD Cup 99 dataset is presented in [23]. The focus of the research was to drastically minimize the number of features while classifier performance is absolutely maintained, or even improved. The approach makes use of filters because of their high-speed characteristics and based on their high suitability for large datasets. Deep learning models were applied as classifiers. Bearing in mind the importance of the temporary data classification of network attacks, the Long Short Term Memory (LSTM) network, a modification of frequent networks, was used in classifying the KDD’s dataset attacks [24]. Several works in the literature of [25,26] motivated the development of our proposed approach. A scheme of nested binary trees was used in [26]; the scheme realized a good performance when tested with minor UCI datasets, but the computational difficulty of this scheme amplified swiftly with the increase at the number of instances. The recent study of [25] integrated both the oversampling and binarization with boosting, and indicated that the proposed approach realized improved performance than the multiclass learners and one-versus-all (OVA) framework. Even though information about the runtime was voided in the study, the use of oversampling enhances substantial computational difficulty; hence, this method failed to scale proficiently for an application to IDS datasets, which encompasses a higher number of samples. On the other hand, the authors in [26] implemented random undersampling (RUS) in their method because it can realize similar performance when used for all the datasets while dealing with class imbalance mitigation.

Several studies on the use of binary classifiers set to the detection of intrusion have been established. A good number of these studies engaged the use of classifiers based on SVM. Authors presented a simple decision tree–based OVA model which populates a decision tree structure using a set of class probabilities [27]. An OVA method in [28] was also incorporated into a least-squares SVM technique and analyzed on the KDD dataset. The output showed that, for each of the five classes of traffic, their attack detection rate was approximately 99%. Additionally, the authors observed in the method, the best model realized an average FPR of 0.28%. SVMs in a binary classification method was employed by [29]. Authors in [30] proposed a composite scheme architecture in which precise classifiers were allocated the task of detecting precise classes. For example, an SVM was allocated for the detection of DoS attacks, while an RBF-based neural network was allocated for the detection of U2R-based attacks. The results of the hybrid classifier were transferred to a different ensemble which was allocated for the detection of R2L and probe attacks. For this scenario, in advance, a definite architecture was defined. A weighting element was included in a scheme of binary SVMs in [31]. The binarization methods that were tested included one-versus-one (OVO), OVA, directed acyclic graphs, and ECOC. It was noticed that the OVA model distributes the best performance. It is observed by the authors that the weight which measures a prediction level of certainty was targeted at the unclassifiable areas in which the group of binary classifiers cannot approve on a single class prediction. using a precise subset of the KDDTest+ dataset, the model was assessed, but then the outputs proved that employing a weighting system with the model resulted in an improved general performance better than the model that did not include weighting scheme. Individual class performance on binarization approaches have been analyzed in all the above-mentioned works; however, the lowest FPR was realized in the recent works [32,33,34,35,36] while many other algorithm and DoS were considered by [37,38,39,40,41,42,43,44].

3. Proposed Methodology

This research has five phases according to our proposed methodology shown in Figure 1; the 1st phase is data collection. After data collection, the next phase is data pre-processing, which is phase 2. In data pre-processing, duplicate values inside the dataset are removed. Inconsistent values are also removed. Missing values were checked for its presence or not in the dataset. Data normalization was also done to bring down the whole dataset into one standard scale. Non-numeric values were converted to numeric by doing encoding. After data pre-processing, the 3rd phase is dimensionality reduction, which was done by using the CFS method. After dimensionality reduction, the next phase, which is the 4th phase, comes in the 4th phase machine learning ensemble classifiers Bagging, and Adaboost was used. The 5th phase is an evaluation phase; in this phase, this research work is compared with other state-of-the-art work that used the same approach.

3.1. Description

This research uses two datasets: the KDD99 dataset and the NSLKDD dataset.

3.1.1. KDD99 Dataset

KDD99 is one of the most famous and old data sets used in network security for intrusion detection systems. KDD99 is a derived version of the 1998 DARPA. The kdd99 dataset was developed in an MIT research lab, and it is used by IDS designers as a benchmark to evaluate various methodologies and techniques [40]. The kdd99 has 4,900,000 rows and 41 attributes, and one is class label. Twenty-two network attacks are listed in the KDD99 dataset [41]. In this research, we did binary classification as well as multiclass classification for kdd99 and nslkdd datasets. We named all the attacks as an anomaly and normal traffic and then performed experiments. Class labels consist of four major attacks like DoS, Probe, U2R, R2L, and Normal class. We did further classification in DoS, Probe, U2R, and R2L, in order to detect the categories of these attacks.

Table 1 represents the total number of normal and anomaly packets that contain the KDD99 dataset used in this research. 97,277 and 396,731 packets were used for anomaly and normal classes to develop ensemble machine learning classifiers upon which training and testing can be performed. In addition, 70% of the KDD99 dataset was used for training and validation purposes, and the rest of the 30% dataset was used for testing and validation, respectively. The samples for KDD99 Training and Testing are present in Table 2.

Table 1.

KDD99 dataset binary classifications total packets.

Packets Details	Packets Count
Normal Packets	97,277
Anomaly Packets	396,731
Total Size	494,008

Attack Name	Category	Count
Smurf	DoS	280,790
Neptune	DoS	107,200
Normal	Normal	97,277
Back	DoS	2203
Satan	Probe	1589
Ipsweep	Probe	1247
Portsweep	Probe	1040
Warezclient	R2L	1020
Teardrop	DoS	979
Pod	DoS	264
Nmap	Probe	231
Guess passwd	R2L	53
Buffer overflow	U2R	30
Land	DoS	21
Warezmaster	R2L	20
Imap	R2L	12
Loadmodule	U2R	9
Ftp_write	R2L	8
Multihop	R2L	7
Phf	R2L	4
Perl	U2R	3

Attack Name	Count
Normal	77,054
Neptune	45,871
Satan	4368
Ipsweep	3740
Smurf	3311
Portsweep	3088
Nmap	1566
Back	1315
Guess_passwd	1284
Mscan	996
Warezmaster	964
Teardrop	904
Warezclient	890
Apache2	737
Processtable	685
Snmpguess	331
Saint	319
Mailbomb	293
Pod	242
Snmpgetattack	178
Httptunnel	133

S.No.	Feature Name	Feature Type	S.No.	Feature Name	Feature Type
1	Duration	Number	2	Protocol Type	Non-Numeric
3	Service	Non-Numeric	4	Flag	Non-Numeric
5	Source Bytes	Number	6	Destination Bytes	Number
7	Land	Non-Numeric	8	Wrong Fragment	Number
9	Urgent	Number	10	Hot	Number
11	Number of failed logins	Number	12	logged in	Non-Numeric
13	Number Access Files	Number	14	Root Shell	Number
15	Su_Attemped	Number	16	Number Root	Number
17	Number of File Creations	Number	18	Number Shells	Number
19	Number Access Files	Number	20	number outbound Commands	Number
21	Is Host Login	Non-Numeric	22	Is Guest Login	Non-Numeric
23	Count	Number	24	Service Count	Number
25	Serror Rate	Number	26	Service Error Rate	Number
27	Rerror Rate	Number	28	Service RError Rate	Number
29	Same Service Rate	Number	30	Different Service Rate	Number
31	Service Different Host Rate	Number	32	Dst_host_count	Number
33	Dst_host_srv_count	Number	34	Dst_host_same_srv_rate	Number
35	Dst_host_diff_srv_rate	Number	36	Dst_host_same_src_port_rate	Number
37	Dst_host_srv_diff_host_rate	Number	38	Dst_host_serror_rate	Number
39	Dst_host_srv_serror_rate	Number	40	Dst_host_rerror_rate	Number
41	Dst_host_srv_rerror_rate	Number	42	Class Label Type	Non-Numeric

Dataset	Selected Features Using CFS
KDD99 (For 2 Attacks)	6, 12, 23, 31, 32
KDD99 (For 21 Attacks)	2, 3, 4, 5, 6, 7, 8, 14, 23, 30, 36
nslkdd (For 2 Attacks)	1, 3, 4, 5, 7, 8, 11, 12, 13, 30, 35, 36, 37
nslkdd (For 21 Attacks)	1, 3, 4, 5, 7, 8, 11, 12, 13, 30, 35, 36, 37

	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
Normal	99.10	0.60	97.40	99.10	98.30	99.90
Anomaly	99.40	0.90	99.80	99.40	99.60	99.90

	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
Normal	99.20	0.60	97.80	99.20	98.50	99.80
Anomaly	99.40	0.80	99.80	99.40	99.60	100.00

	TP Rate	FP Rate	Precision	Recall	F1 Score	ROC Area
Normal	98.70	0.60	97.40	98.70	98.10	99.50
Anomaly	99.40	1.30	99.70	99.40	99.50	100.00

	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
Normal	99.30	0.60	97.70	99.30	98.50	99.70
Anomaly	99.40	0.70	99.80	99.40	99.60	100.00

S.No.	Class	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
1	Normal	99.70	0.00	99.80	99.70	99.80	100.00
2	Buffer-overflow	46.20	0.00	100.00	46.20	63.20	99.70
3	Loadmodule	0.00	0.00	0.00	0.00	0.00	99.10
4	Perl	100.00	0.00	33.33	100.00	50.00	100.00
5	Neptune	100.00	0.00	100.00	100.00	100.00	100.00
6	Smurf	100.00	0.00	100.00	100.00	100.00	100.00
7	Guess_passwd	100.00	0.00	93.80	100.00	96.80	100.00
8	Pod	100.00	0.00	100.00	100.00	100.00	100.00
9	Teardrop	100.00	0.00	100.00	100.00	100.00	100.00
10	Portsweep	99.30	0.00	95.30	99.30	97.30	100.00
11	Ipsweep	97.90	0.10	81.60	99.90	89.00	99.20
12	Land	100.00	0.00	83.30	100.00	90.90	100.00
13	Ftp_write	0.00	0.00	0.00	0.00	0.00	100.00
14	Back	99.70	0.00	99.80	99.70	99.80	100.00
15	Imap	50.00	0.00	100.00	50.00	66.70	100.00
16	Satan	98.50	0.00	99.10	98.50	98.80	100.00
17	Phf	0.00	0.00	0.00	0.00	0.00	100.00
18	Nmap	55.60	0.00	97.20	55.60	70.70	99.80
19	Multihop	50.00	0.00	33.33	50.00	40.00	81.70
20	Warezmaster	60.00	0.00	100.00	60.00	75.00	71.70
21	Warezclient	93.00	0.00	97.10	93.00	95.00	99.10
22	Weighted Avg	99.90	0.00	99.90	99.90	99.90	100.00

	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
Normal	99.10	1.10	99.00	99.10	99.00	99.90
Anomaly	98.90	0.90	99.00	98.90	98.90	99.90

	Normal	Anomaly
Normal	28,934	271
Anomaly	759	118,238

	Normal	Anomaly
Normal	28,934	271
Anomaly	759	118,238

	Normal	Anomaly
Normal	28,975	230
Anomaly	658	118,339

	Normal	Anomaly
Normal	28,838	367
Anomaly	772	118,225

S.No.	Class	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
1	Normal	99.00	1.20	98.90	99.00	99.00	99.99
2	Neptune	100.00	0.00	99.90	100.00	99.90	100.00
3	Warezclient	90.20	0.00	95.60	90.20	92.80	99.80
4	Ipsweep	90.50	0.00	99.50	90.50	94.80	99.90
5	Portsweep	97.90	0.10	97.10	97.90	97.50	99.90
6	Teardrop	100.00	0.00	96.30	100.00	98.10	100.00
7	Nmap	96.20	0.30	78.20	96.20	86.30	99.90
8	Satan	97.20	0.30	91.40	97.20	94.20	99.80
9	Smurf	99.50	0.20	93.30	99.50	94.40	100.00
10	Pod	98.40	0.00	95.30	98.40	96.80	100.00
11	Back	100.00	0.00	99.80	100.00	99.90	100.00
12	Guess_passwd	96.80	0.00	96.50	96.80	96.70	99.50
13	Warezmaster	92.40	0.00	98.10	92.40	95.10	99.20
14	Saint	0.00	0.00	0.00	0.00	0.00	95.40
15	Mscan	95.70	0.00	94.80	95.70	95.20	99.80
16	Apache2	99.10	0.00	100.00	99.10	99.50	99.80
17	Snmpgetattack	1.80	0.00	100.00	1.80	3.40	98.80
18	Processtable	99.50	0.00	99.50	99.50	99.50	100.00
19	Httptunnel	95.00	0.00	90.50	95.00	92.70	97.50
20	Snmpguess	40.00	0.10	55.90	46.60	47.20	99.30
21	Mailbomb	88.00	0.00	95.70	88.00	91.70	99.30
22	Weighted Avg	98.40	0.60	98.30	98.40	98.30	99.90

KDD99 Experiment Average Results
S.No.	Proposed Models	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
1	Adaboost j48	99.90	0.00	99.90	99.90	99.90	100.00
2	Adaboost random forest	99.90	0.00	99.90	99.90	99.90	100.00
3	Adaboostreptree	99.90	0.00	99.90	99.90	99.90	100.00
4	Bagging j48	99.90	0.00	99.90	99.90	99.90	100.00
5	Bagging random forest	99.90	0.00	99.90	99.90	99.90	100.00
6	Bagging reptree	99.90	0.00	99.90	99.90	99.90	100.00
NSLKDD Experiment Average Results
S.No.	Proposed Models	TP Rate	FP Rate	Precision	Recall	F1-Score	ROC Area
1	Adaboost j48	98.40	0.60	98.30	98.40	98.30	99.90
2	Adaboost random forest	98.50	0.60	98.30	98.50	98.40	99.80
3	Adaboostreptree	98.20	0.80	97.90	98.20	98.00	99.90
4	Bagging j48	98.50	0.60	98.40	98.50	98.30	99.90
5	Bagging random forest	98.60	0.50	98.40	98.60	98.40	99.90
6	Bagging reptree	98.20	0.70	98.00	98.20	98.10	99.90

Method	Accuracy Detection Rate (%)	FR Rate (%)
DAR Ensemble [52]	78.88	N/A
Naive Bayes-KNN-CF [53]	82.00	05.43
Feature Selection + SVM [54]	82.37	15.00
GAR Forest + Symmatrixal Uncertainity [55]	85.00	12.20
Bagging j48 [56]	84.25	02.79
PCA+PSO [57]	99.40	0.60
Propose Model Bagging Random Forest (KDD99 dataset)	99.90	0.00
Propose Model Bagging Random Forest (NSLKDD dataset)	98.60	0.50

PERMALINK

The Use of Ensemble Models for Multiple Class and Binary Class Classification for Improving Intrusion Detection Systems

Celestine Iwendi

Suleman Khan

Joseph Henry Anajemba

Mohit Mittal

Mamdouh Alenezi

Mamoun Alazab

Abstract

1. Introduction

2. Literature Review

3. Proposed Methodology

Figure 1.

3.1. Description

3.1.1. KDD99 Dataset

Table 1.

Table 2.

Table 3.

3.1.2. NSLKDD Dataset

Table 4.

Table 5.

Table 6.

Table 7.

3.2. Pre-Processing

3.2.1. Normalization

3.2.2. Data Encoding

3.3. Feature Selection

Correlation-Based Feature Selection (CFS)

Figure 2.

Table 8.

3.4. Bagging Classifier

3.5. Adaboost Classifier

3.6. Evaluation Matrixs

4. Experiments

4.1. Binary Class Experiment Results for KDD99

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Table 18.

Table 19.

Table 20.

Table 21.

Figure 3.

Table 22.

Figure 4.

Table 23.

Figure 5.

Table 24.

Figure 6.

Table 25.

Figure 7.

Table 26.

Figure 8.

4.2. Binary Class Experiment Results for NSLKDD

Table 27.

Table 28.

Table 29.

Table 30.

Table 31.

Table 32.

Table 33.

Table 34.

Table 35.

Table 36.

Table 37.

Table 38.

Table 39.

Figure 9.

Table 40.

Figure 10.

Table 41.

Figure 11.

Table 42.

Figure 12.