An Experimental Analysis of Attack Classification Using Machine Learning in IoT Networks

Andrew Churcher; Rehmat Ullah; Jawad Ahmad; Sadaqat ur Rehman; Fawad Masood; Mandar Gogate; Fehaid Alqahtani; Boubakr Nour; William J Buchanan

doi:10.3390/s21020446

. 2021 Jan 10;21(2):446. doi: 10.3390/s21020446

An Experimental Analysis of Attack Classification Using Machine Learning in IoT Networks

Andrew Churcher ¹, Rehmat Ullah ^2,^*, Jawad Ahmad ¹, Sadaqat ur Rehman ³, Fawad Masood ⁴, Mandar Gogate ¹, Fehaid Alqahtani ⁵, Boubakr Nour ⁶, William J Buchanan ¹

PMCID: PMC7827441 PMID: 33435202

Abstract

In recent years, there has been a massive increase in the amount of Internet of Things (IoT) devices as well as the data generated by such devices. The participating devices in IoT networks can be problematic due to their resource-constrained nature, and integrating security on these devices is often overlooked. This has resulted in attackers having an increased incentive to target IoT devices. As the number of attacks possible on a network increases, it becomes more difficult for traditional intrusion detection systems (IDS) to cope with these attacks efficiently. In this paper, we highlight several machine learning (ML) methods such as k-nearest neighbour (KNN), support vector machine (SVM), decision tree (DT), naive Bayes (NB), random forest (RF), artificial neural network (ANN), and logistic regression (LR) that can be used in IDS. In this work, ML algorithms are compared for both binary and multi-class classification on Bot-IoT dataset. Based on several parameters such as accuracy, precision, recall, F1 score, and log loss, we experimentally compared the aforementioned ML algorithms. In the case of HTTP distributed denial-of-service (DDoS) attack, the accuracy of RF is 99%. Furthermore, other simulation results-based precision, recall, F1 score, and log loss metric reveal that RF outperforms on all types of attacks in binary classification. However, in multi-class classification, KNN outperforms other ML algorithms with an accuracy of 99%, which is 4% higher than RF.

Keywords: Internet of Things (IoT), IoT attacks, security, intrusion detection systems, privacy, machine learning, ML models, multi-class classification

1. Introduction

The Internet of Things (IoT) offers a vision where devices with the help of sensors can understand the context and through networking functions can connect with each other [1]. The devices in the IoT network can be employed for collecting information based on the use cases. These include retail, healthcare, and manufacturing industries that use IoT devices for tasks such as tracking purchased items, remote patient monitoring, and fully autonomous warehouses. It is reported that the amount of IoT devices has been growing every year with the predicted amount of devices by 2025 reaching 75.44 billion [2]. Such a massive surge of IoT devices ultimately results in more attackers to target IoT networks. Reports state that most of the attack traffic generated on IoT networks is automated through various means such as scripts and malware [3]. The increase in attacks combined with the autonomous nature of the attacks is a problem for IoT networks as the devices are mostly used in a fire and forget fashion for years without any human interaction. This combined with the limitations of IoT devices including limited processing power and bandwidth means that providing adequate security can be difficult, which can result in network layer attacks such as denial of service (DoS). Therefore, it is important to research ways to identify this kind of traffic on networks which can be used in intrusion detection and prevention systems.

Machine learning (ML) methods can be exploited to detect malicious traffic in intrusion detection and prevention systems. ML is a subset of artificial intelligence (AI) that involves using algorithms to learn from data and make predictions based on the data provided [4]. ML has many applications including in retail, healthcare, and finance where AI algorithms may be applied for predicting customer spending habits, predicting medical problems in patients, and detecting bank fraud, respectively [5].

Due to the large yearly increases in cyberattacks that are being seen on a yearly basis, ML methods are being incorporated to help tackle the increasing threats of cyberattacks. ML has several uses within the field of cybersecurity, such as network threat analysis, which can be defined as the act of analyzing threats to the network [6]. ML can be beneficial in this task as it is able to monitor incoming and outgoing traffic to identify potentially suspicious traffic [7]. This area of research is known as intrusion detection and is a widely known research area. ML can be applied to intrusion detection systems (IDS) to help improve the systems ability to run autonomously and increase the accuracy of the system when raising the alarm on a suspected attack [8]. To this end, our primary role is to identify the best ML methods for detecting attacks on IoT networks, using a state-of-the-art dataset by utilizing both binary and multi-class classification testing.

The main contributions of this paper can be summarized as follows:

We conduct an in-depth and comprehensive survey on the role of various ML methods and attack detection specifically in regards to IoT networks.
We evaluate and compare the state-of-the-art ML algorithms in terms of various performance metrics such as confusion matrix, accuracy, precision, recall, F1 score, log loss, ROC AUC, and Cohen’s kappa coefficient (CKC).
We evaluate the results comparing binary class testing as well as examining the results of the multi-class testing.

The rest of the paper is organized as follows: Table 1 lists all the abbreviations used in the paper. Section 2 is devoted to a literature review involving investigating IoT intrusion detection techniques as well as ML methods and how they are being used to aid intrusion detection efforts specifically in regards to IoT networks. Details of various attacks that can occur in IoT networks are also showcased with an explanation of how the various ML methods and performance metrics work. Section 3 explains the performance evaluation, which also includes an in-depth examination of the data used in the datasets. The models are compared against each other for both binary and multi-class classification with an overall best model being selected. Finally, Section 4 draws a conclusion.

Table 1.

Abbreviations and their explanations.

Acronym	Explanation	Acronym	Explanation
IDS	Intrusion Detection Systems	ANN	Artificial Neural Network
ML	Machine Learning	KNN	K-nearest Neighbour
SVM	Support Vector Machine	DT	Decision Tree
NB	Naive Bayes	RF	Random Forest
LR	Logistic Regression	DDoS	Distributed Denial-of-Service
IoT	Internet of Things	CKC	Cohen’s Kappa Coefficient
TP	True Positive	TN	True Negative
FP	False Positive	FN	False Negative
TPR	True Positive Rate	FPR	False Positive Rate

	Actual Label
Predicted label	No attack	Attack
No attack	True negative	False negative
Attack	False positive	True positive

	Actual Label
Predicted label	Class 1	Class 2	Class 3
Class 1	C	W	W
Class 2	W	C	W
Class 3	W	W	C

Features	Description
Stime	Record start time
Sport	Port that data is being sent from
Dport	Port that data is being received from
Pkts	Total number of packets transferred
Bytes	Total number of bytes transferred
Ltime	Record last time
Seq	Sequence number
Dur	Record total duration
Mean	Average duration of aggregated records
Sum	Total duration of aggregated records
Min	Minimum duration of aggregated records
Max	Maximum duration of aggregated records
Spkts	Source to destination packet count
Dpkts	Destination to source packet count
Sbytes	Source to destination byte count
Dbytes	Destination to source byte count
Rate	Total packets per second in transaction
Srate	Source to destination packets per second
Drate	Destination to source packets per second

Dataset	No Attack Data	Attack Data	Total
Data exfiltration	24	118	142
DDoS HTTP	55	19771	19826
DDoS TCP	32	1048543	1048575
DDoS UDP	36	1048539	1048575
Key logging	164	1469	1633
OS Scan	3949	358275	362224
Service scan	1993	1046582	1048575
DoS HTTP	56	29706	29762
DoS TCP	106	1048469	1048575
DoS UDP	37	1048538	1048575

Classes	Training Data	Test Data	Total
No attack	1398	335	1733
Data exfiltration	22	7	29
DDoS HTTP	4209	1015	5224
DDoS TCP	221638	56377	278015
DDoS UDP	222728	55302	278030
Key logging	314	81	395
OS Scan	75877	18907	94784
Service scan	221745	55768	277509
DoS HTTP	6343	1475	7818
DoS TCP	223555	55236	278791
DoS UDP	222171	55501	277672

Module Name	Description
numpy	Used to store the dataset in an array
pandas	Used to read the dataset CSV file
preprocessing	Used to normalize feature data
model_selection	Used for splitting the training and test data
random	Used to randomize the multi-class dataset
metrics	Contains the performance metrics used in testing the model
neighbors	Contains KNN model
SVM [57]	Contains the SVM model
tree	Contains the DT model
naive bayes	Contains the NB model
ensemble	Contains the RF model
linear model	Contains the LR model
models	Contains the ANN model
layers	Contains ANN layers
utils	Contains class weight for ANN

Algorithms Used	Accuracy	Precision	Recall	F1 Score	Log Loss	ROC AUC
KNN [14]	0.86	0.95	0.87	0.91	0.19	0.83
SVM [57]	0.89	0.95	0.91	0.93	0.27	0.85
DT [19]	1.0	1.0	1.0	1.0	9.99	1.0
NB [58]	0.89	1.0	0.87	0.93	3.57	0.93
RF [24]	1.0	1.0	1.0	1.0	0.059	1.0
ANN [25]	0.82	0.82	1.0	0.90	2.57	0.5
LR [31]	0.89	0.95	0.91	0.93	0.22	0.85

Test Amount	Accuracy	Precision	Recall	F1 Score	Log Loss	ROC AUC
20	1.0	1.0	1.0	1.0	0.059	1.0
30	1.0	1.0	1.0	1.0	0.043	1.0
40	0.98	0.97	1.0	0.98	0.042	0.94
50	0.97	0.96	1.0	0.98	0.083	0.9
60	0.94	0.97	0.95	0.96	0.089	0.89

Algorithms Used	Accuracy	Precision	Recall	F1 Score	Log Loss	ROC AUC
KNN [14]	0.99	0.99	1.0	0.99	0.0095	0.83
SVM [57]	0.99	0.99	1.0	0.99	0.0093	0.77
DT [19]	1.0	1.0	1.0	1.0	7.25	1.0
NB [58]	0.99	0.99	0.99	0.99	0.063	0.66
RF [24]	0.99	0.99	1.0	0.99	0.0021	0.88
ANN [25]	0.99	0.99	1.0	0.99	0.044	0.5
LR [31]	0.99	0.99	1.0	0.99	0.0069	0.77

Algorithms Used	Accuracy	Precision	Recall	F1 Score	Log Loss	ROC AUC
KNN [14]	1.0	1.0	1.0	1.0	4.56	1.0
SVM [57]	0.99	0.99	1.0	0.99	8.93	0.92
DT [19]	1.0	1.0	1.0	1.0	9.99	1.0
NB [58]	0.99	1.0	0.99	0.99	0.00098	0.99
RF [24]	0.99	0.99	1.0	0.99	5.71	0.92
ANN [25]	0.99	0.99	1.0	0.99	5.30	0.5
LR [31]	0.99	0.99	1.0	0.99	7.77	0.78

Algorithms Used	Accuracy	Precision	Recall	F1 Score	Log Loss	ROC AUC
KNN [14]	n/a	n/a	n/a	n/a	n/a	n/a
SVM [57]	1.0	1.0	1.0	1.0	2.84	1.0
DT [19]	0.99	1.0	0.99	0.99	0.000011	0.99
NB [58]	n/a	n/a	n/a	n/a	n/a	n/a
RF [24]	1.0	1.0	1.0	1.0	0.0020	1.0
ANN [25]	0.99	0.99	1.0	0.99	5.30	0.5
LR [31]	0.99	1.0	0.99	0.99	0.00028	0.99

Dataset	Best Model
Dataset	No Weighted Classes	Weighted Classes
Data Exfiltrantion	RF	RF
DDoS HTTP	DT	ANN
DDoS TCP	RF	RF
DDoS UDP	KNN	RF
Keylogging	DT	DT
OS Scan	RF	ANN
Service scan	RF	ANN
DoS HTTP	DT	ANN
DoS TCP	KNN	DT
DoS UDP	NB	SVM
Most Occurrences	RF	ANN

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	172	0	0	2	0	1	107	50	0	2	1
1	1	4	0	0	0	2	0	0	0	0	0
2	0	0	965	4	2	0	0	0	43	1	0
3	0	0	1	56368	3	0	0	0	0	2	3
4	0	0	0	4	55296	0	0	0	0	0	2
5	0	1	0	0	0	80	0	0	0	0	0
6	48	0	0	0	0	0	18294	565	0	0	0
7	12	0	0	0	0	0	395	55357	0	0	0
8	0	0	38	6	3	0	0	0	1427	1	0
9	0	0	2	9	1	0	0	0	4	55218	2
10	0	0	0	4	1	0	0	0	0	1	55496

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	10	0	0	2	3	0	183	111	6	15	5
1	0	0	0	0	4	0	0	0	2	1	0
2	0	0	0	296	6	0	0	0	79	630	4
3	0	0	0	19626	17561	0	0	0	55	17778	1357
4	0	0	0	429	54506	0	0	0	0	2	365
5	0	0	0	0	7	0	0	0	72	2	0
6	0	0	0	1	0	0	13056	5779	1	41	29
7	0	0	0	1	0	0	3097	52658	0	8	0
8	0	0	0	512	17	0	0	0	56	885	5
9	0	0	0	1804	442	0	0	0	48	52933	9
10	0	0	0	4021	5139	0	0	0	0	22	46319

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	3	0	0	1	20	0	98	227	0	1	4
1	0	0	0	0	3	0	0	0	0	0	0
2	0	0	0	0	1084	0	0	0	0	0	0
3	0	0	0	55648	0	0	0	0	0	0	0
4	0	0	0	0	55460	0	0	0	0	0	0
5	0	0	0	0	84	0	0	0	0	0	0
6	0	0	0	0	0	0	9620	9474	0	0	0
7	0	0	0	0	0	0	0	55504	0	0	0
8	0	0	0	0	0	0	0	0	1597	0	0
9	1	0	0	0	0	0	0	0	0	55784	0
10	0	0	0	0	0	0	0	0	0	0	55387

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	297	2	0	1	3	10	0	21	0	10	3
1	0	6	0	0	3	0	0	0	0	0	0
2	0	0	0	0	1055	0	0	0	0	0	0
3	0	0	0	55716	0	0	0	0	0	0	0
4	0	0	0	0	55324	0	0	0	0	0	0
5	0	0	0	0	0	70	0	0	0	0	0
6	1037	0	0	0	0	0	17965	0	0	0	0
7	1442	0	0	0	0	0	0	54099	0	0	0
8	0	0	0	0	0	0	0	0	0	0	1520
9	0	0	0	0	0	0	0	0	0	55745	0
10	0	0	0	0	0	0	0	0	0	0	55674

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	33	1	0	1	0	9	224	62	1	5	0
1	0	7	0	0	0	0	0	0	0	0	0
2	2	0	1008	0	0	0	0	7	0	0	0
3	29	0	0	56296	0	0	0	29	23	0	0
4	2	0	0	0	55298	0	0	2	0	0	0
5	0	0	0	0	0	81	0	0	0	0	0
6	67	0	0	0	0	0	18199	641	0	0	0
7	362	0	0	0	0	0	15754	39648	0	0	0
8	0	0	0	0	0	0	0	0	1475	0	0
9	1	0	0	0	0	0	0	55	0	55180	0
10	1	0	0	0	0	0	0	1	0	0	55499

				Predicted
True	1	2	3	4	5	6	7	8	9	10
0	0	0	16	2	0	31	100	0	172	14
1	0	0	5	2	0	0	0	0	0	0
2	0	0	955	20	0	0	0	0	0	0
3	0	0	56377	0	0	0	0	0	0	0
4	0	0	95	55207	0	0	0	0	0	0
5	0	0	77	4	0	0	0	0	0	0
6	0	0	1	0	0	9238	9514	0	151	3
7	0	0	0	0	0	0	55716	0	47	1
8	0	0	1422	0	0	0	0	0	0	53
9	0	0	0	0	0	0	0	0	55229	7
10	0	0	10	0	0	0	0	0	0	55491

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	250	1	1	0	1	11	8	32	10	12	9
1	0	7	0	0	0	0	0	0	0	0	0
2	0	0	1013	0	0	2	0	0	0	0	0
3	0	0	231	37544	18602	0	0	0	0	0	0
4	0	0	0	76	55226	0	0	0	0	0	0
5	0	4	1	0	0	76	0	0	0	0	0
6	180	11	0	0	0	0	17748	514	443	11	0
7	333	0	0	0	0	0	16281	38936	131	83	0
8	0	0	0	0	0	0	2	0	1473	0	0
9	3758	0	0	0	0	0	0	0	25	50448	1005
10	1	0	0	0	0	0	0	0	0	0	55500

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	4	0	1	14	0	0	219	86	4	6	1
1	0	0	0	7	0	0	0	0	0	0	0
2	0	0	981	34	0	0	0	0	0	0	0
3	0	0	27	56349	1	0	0	0	0	0	0
4	0	0	0	4	55298	0	0	0	0	0	0
5	0	0	0	81	0	0	0	0	0	0	0
6	0	0	0	9	0	0	15312	3586	0	0	0
7	0	0	0	0	0	0	2701	53063	0	0	0
8	0	0	0	59	0	0	0	0	1415	0	1
9	0	0	0	55	0	0	19	0	1	55161	0
10	0	0	0	2	0	0	0	0	0	0	55499

PERMALINK

An Experimental Analysis of Attack Classification Using Machine Learning in IoT Networks

Andrew Churcher

Rehmat Ullah

Jawad Ahmad

Sadaqat ur Rehman

Fawad Masood

Mandar Gogate

Fehaid Alqahtani

Boubakr Nour

William J Buchanan

Abstract

1. Introduction

Table 1.

2. Background and Related Work

2.1. Intrusion Detection System

2.2. IoT Intrusion Detection Using Machine Learning

2.2.1. K-Nearest Neighbor

2.2.2. Support Vector Machine

2.2.3. Decision Tree

2.2.4. Random Forest

2.2.5. Naive Bayes

2.2.6. ANN

2.2.7. Logistic Regression

2.3. Internet of Things Attacks

2.3.1. Data Exfiltration

2.3.2. DoS and DDoS

2.3.3. Keylogging

2.3.4. OS Scan and Service Scan

3. Performance Evaluation

3.1. Benchmark Data

3.2. Performance Evaluation Metrics

3.2.1. Confusion Matrix

Table 2.

Table 3.

3.2.2. Accuracy

3.2.3. Precision

3.2.4. Recall

3.2.5. F1 Score

3.2.6. Log Loss

3.2.7. ROC AUC

3.2.8. Cohen’s Kappa Coefficient

3.3. Dataset Description

Table 4.

Table 5.

Table 6.

3.4. Implementation

3.4.1. Tools Used

Table 7.

3.4.2. Feature Extraction

3.4.3. Feature Scaling

3.4.4. Multi-Class Dataset

3.4.5. Training Data

3.4.6. Test Data

3.5. Results and Discussion

3.5.1. Binary Classification

Table 8.

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Table 18.

Table 19.

Table 20.

Table 21.

Table 22.

Table 23.

Table 24.

Table 25.

Table 26.

Table 27.

Table 28.

Table 29.

Table 30.

Table 31.

					Predicted
					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	0	0	1	1	0	12	228	81	4	6	2
1	0	0	0	0	7	0	0	0	0	0	0
2	0	0	1010	0	5	0	0	0	0	0	0
3	0	0	0	56377	0	0	0	0	0	0	0
4	1	0	0	2	55299	0	0	0	0	0	0
5	0	0	0	0	0	81	0	0	0	0	0
6	0	0	0	0	0	0	16950	1956	0	1	0
7	0	0	0	0	0	0	4195	51569	0	0	0
8	0	0	0	0	0	0	0	0	1473	0	2
9	0	0	0	0	0	0	0	0	4	55232	0
10	0	0	0	0	0	0	0	0	0	1	55500

					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	0	0	0	11	2	0	200	101	0	9	12
1	0	0	0	7	0	0	0	0	0	0	0
2	0	0	0	93	6	0	0	0	308	601	7
3	0	0	0	16470	6901	0	0	0	44	24490	8472
4	0	0	0	145	49111	0	0	0	0	71	5947
5	0	0	0	81	0	0	0	0	0	0	0
6	2	0	0	0	0	0	14690	4186	0	0	29
7	0	0	0	0	0	0	3713	52049	0	0	2
8	0	0	0	139	18	0	9	0	482	819	8
9	0	0	0	274	453	0	2	0	435	54035	37
10	0	0	0	10658	9195	0	0	0	0	15	35633

					Predicted
					Predicted
True	0	1	2	3	4	5	6	7	8	9	10
0	291	4	0	0	2	7	10	14	0	6	1
1	0	7	0	0	0	0	0	0	0	0	0
2	0	9	692	0	2	0	0	0	306	2	4
3	0	105	432	14010	7516	0	0	0	1004	23636	9674
4	0	44	50	186	52423	0	0	0	34	71	2494
5	0	3	0	0	0	78	0	0	0	0	0
6	3570	0	0	0	0	0	13753	1582	2	0	0
7	4353	0	0	0	0	0	9652	41758	0	0	1
8	0	15	710	0	8	0	0	0	736	4	2
9	0	0	954	424	453	0	11	0	2479	50907	8
10	0	177	197	10008	9850	0	0	0	216	16	35037