Deep belief network based intrusion detection techniques: A survey

doi:10.1016/j.eswa.2020.114170

Expert Systems with Applications

Volume 167, 1 April 2021, 114170

https://doi.org/10.1016/j.eswa.2020.114170 Get rights and content

Highlights

•
Overview of the data set and the performance metric in intrusion detection.
•
Description on Deep Belief Network technology.
•
Comparative analysis of DBN-IDS models proposed from 2013 to 2019.

Abstract

With the recent growth in the number of IoT devices, the amount of personal, sensitive, and important data flowing through the global network have grown rapidly. Additionally, the malicious attempt to access important information or damage the network have also become more complex and advanced. Thus, cybersecurity has become an important issue for the evolution toward future networks that can react and counter such threats. Intrusion detection is an important part of the cybersecurity technology with the goal of monitoring and analyzing network traffic from various resources and detect malicious activities. In recent years, deep learning base deep neural network (DNN) techniques have been utilized as the key solution to detect malicious attacks and among many DNNs, deep belief network (DBN) has been the most influential technique. There have been many attempts to survey wide range of machine learning and deep learning technique based intrusion detection research works, including DBN, but failed to provide a complete review of all the aspects related to the DBN based intrusion detection models. Unlike previous survey papers, we first provide basic concepts on data set, performance metric, and restricted Boltzmann machines, to help understand the basic DBN based intrusion detection model. Finally, a complete review and analysis on the previously published works on DBN based IDS models is provided.

Introduction

Due to ever-increasing connections between digital devices related to smart home, transportation, manufacturing, healthcare, and monitoring, the society is experiencing increased productivity powered by the Internet of Things (IoT) technology (Park et al., 2016, Kar and Sanyal, 2018, Gyamfi et al., 2019, Nord et al., 2019). However, the massive volume of data flowing through the network between the IoT devices contains personal, sensitive, and important information. Thus, cybersecurity is the key to the success of IoT and is defined as technologies designed to identify, protect, detect, respond, and recover computing devices, networks, software, and data against malicious attacks. Among the five core elements of the cybersecurity, intrusion detection system (IDS) aims to solve the first and third elements by identifying hostile activities from the normal traffic data and detect or classify known attacks or zero-day attacks.

Since the birth of the concept of intrusion detection by Anderson in 1980 (Anderson, 1980), machine learning (ML) technologies such as neural networks (NN), k-nearest neighbor (KNN), support vector machine (SVM), and decision tree (DT) have been the key players in IDS research. However, due to drastic increase in volume and complexity in the network traffic data, the traditional ML based IDS with shallow structure is unsuited in the era of IoT with billions of devices. Thus, deep learning (DL) techniques have been applied to the conventional NN architecture with the name of deep neural network (DNN). The major DNN models are deep belief network (DBN), stacked autoencoder (SAE), convolutional neural network (CNN), and recurrent neural network (RNN). DBN is a DNN that is composed of multiple restricted Boltzmann machines (RBMs) that is trained in an unsupervised manner and fine-tuned with back-propagation algorithm. DBN is the most important and most frequently used technology in the state-of-art IDS models and is the main topic of this paper. SAE has a similar structure as DBN with multiple layers of autoencoder (AE). AE consists of an encoder that is trained to represent the input with reduced dimension and a decoder that approximately reconstructs the input data. Mohammadi and Namadchian (2017) proposed an IDS model based on AE and a Memetic algorithm, Farahnakian and Heikkonen (2018) presented another IDS model based on AE with softmax, and Ieracitano, Adeel, Morabito, and Hussain (2020) proposed a different AE-IDS model based on statistical analysis of the input data. CNN is also a popular DNN with hierarchical structure similar to digital images. The basic components of CNN are convolutional layer, pooling layer, and classification layer. McLaughlin, Martinez del Rincon, Kang, and Yerima (2017) proposed a CNN based malware detection model and Nix and Zhang (2017) presented a different detection model based on CNN for sequence classification. RNN has a different architecture compared to DBN, SAE, and CNN due to the cyclic connections. Due to the cyclic connections, past network activation states can be used in the current state to better represent the time dependent signals. Kim, Kim, Thu, and Kim (2016) investigated the use of long short term memory (LSTM) to RNN based IDS and Yin, Zhu, Fei, and He (2017) proposed a DL approach based on RNN for binary and multiclass classification.

Since the survey work by Nguyen and Armitage (2008), which is one of the first major overview work in traffic classification based on ML techniques, most of survey works in IDS is focused on ML techniques. Xin, Kong, Liu, and Chen (2018) presents literature surveys on ML and DNN with detailed description on the data set used with the techniques, but does not contain basic concepts on ML to help the readers understand the ML based IDS techniques. Mishra, Varadharajan, Tupakula, and Pilli (2019) presents a detailed investigation on ML based intrusion detection techniques with a focus on attack features, but does not include DL based techniques. Mahdavifar and Ghorbani (2019) provides a survey on intrusion detection, malware detection, and phishing/spam detection based on DL. However, Mahdavifar and Ghorbani (2019) does not have the data set and performance metric information used in DL based IDS models. Berman, Buczak, Chavis, and Corbett (2019) provides basic concepts on DL methods with survey on application works in DNN-IDS, but does not contain detailed comparative analysis of the surveyed IDS works. Aldweesh, Derhab, and Emam (2020) surveys DL based IDS models from 2014 to 2018, but important IDS models that optimize data features and system hyperparameters are not analyzed.

In contrast to the past surveys on DL based IDS that covers wide range of ML techniques, this paper focuses on one specific technique that is most popular among many DL methods, which is the DBN technology. To the best of our knowledge, this paper is the first among many surveys that presents basic concepts of data set, the performance metric, the DBN concept, and provide detailed review of the most important works on DBN based IDS from 2013 to 2020. Furthermore, we evaluate and compare the various DBN based detection models based on data set, structure, optimization algorithm, and applications utilized in the surveyed DBN based IDS research works.

The remainder of the paper is organized as follows. Section 2 presents basic concepts related to the data set and performance metrics used in intrusion detection research. Section 3 provides an overview on basic concepts related to restricted Boltzmann machine and deep belief network. In Section 4, we review different methods proposed on DBN based IDS models and analyze them based on various criteria in Section 5. Finally, concluding remarks are given in Section 6.

Section snippets

Data set

One of the most important factors in building an intrusion detection system (IDS) is the selection of dataset. The chosen dataset is not only used to train an IDS model, but also used to evaluate the effectiveness of a proposed IDS model. Due to the difficulty in direct collection of realtime attack and normal network traffic, publicly available standard data set, such as KDD Cup 99, NSL-KDD, UNSW-NB15, and ADFA, are commonly used in intrusion detection research community for comparative

Restricted Boltzmann Machine

Restricted Boltzmann machines (RBMs) (Hinton, 2012) are Boltzmann machines (BMs) (Aarts & Korst, 1989) without connections between visible units in the visible layer and between hidden units in the hidden layer as shown in Fig. 1. BMs are probabilistic graph models that consists of visible, representing observations, and hidden, representing hidden features, units. Based on the visible variables v = (v₁, v₂, …, v_n) and hidden variables h = (h₁, h₂,…, h_m), the joint distribution of a RBM’s

Comparative analysis of DBN based IDS methods

In this section, we present research works on intrusion detection based on deep belief network (DBN). The goal of this section is to describe and compare the key algorithm, training method, data set used, and performance results reported by the authors. Fiore, Palmieri, Castiglione, and De Santis (2013) presented an intrusion detection system (IDS) model, which is one of the first works in application of DBN to IDS. The proposed IDS consists of discriminative RBM (DRBM) that is trained in

Discussions

In this section, we study the general framework of DBN-IDS based on 16 important research works presented in the previous section with the time range from 2013 to 2020. To easily understand the general framework, it is divided into different aspects that represent the framework as shown in Fig. 3: training data preprocessor, DBN classifier, DBN optimizer, fine-tuning algorithm.

Conclusions

In order to provide a complete review of DBN based IDS models from the past to present and also help the readers understand the basic architecture of the proposed models, we started the paper with an overview of the data set and the performance metric used in intrusion detection research community. The data sets that were introduced in this paper were KDD Cup 99, NSL-KDD, UNSW-NB15, and ADFA that are publicly available standard data sets. Among many performance metrics used in intrusion

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRFK) funded by the Ministry of Education (2018R1D1A1B07041981).

References (54)

A.A. Diro et al.
Distributed attack detection scheme using deep learning approach for Internet of Things
Future Generation Computer Systems
(2018)
S.M. Erfani et al.
High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning
Pattern Recognition
(2016)
U. Fiore et al.
Network anomaly detection with the restricted Boltzmann machine
Neurocomputing
(2013)
K.S. Gyamfi et al.
Heartbeat design for energy-aware IoT: Are your sensors alive?
Expert Systems with Applications
(2019)
S. Huda et al.
Defending unknown attacks on cyber-physical systems by semi-supervised approach and available unlabeled data
Information Sciences
(2017)
S. Huda et al.
A malicious threat detection model for cloud assisted internet of things (CoT) based industrial control system (ICS) networks using deep belief network
Journal of Parallel and Distributed Computing
(2018)
C. Ieracitano et al.
A novel statistical analysis and autoencoder driven intelligent intrusion detection approach
Neurocomputing
(2020)
U.N. Kar et al.
An overview of device-to-device communication in cellular networks
ICT Express
(2018)
S. Mahdavifar et al.
Application of deep learning to cybersecurity: A survey
Neurocomputing
(2019)
J.H. Nord et al.
The Internet of Things: Review and theoretical framework
Expert Systems with Applications
(2019)

H. Park et al.

Recent advancements in the Internet-of-Things related standards: A oneM2M perspective

ICT Express

(2016)

L. Zhou et al.

An approach for overlapping and hierarchical community detection in social networks based on coalition formation game theory

Expert Systems with Applications

(2015)

E. Aarts et al.

Simulated annealing and boltzmann machines: A stochastic approach to combinatorial optimization and neural computing

(1989)

ADFA dataset....

A. Aldweesh et al.

Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues

Knowledge-Based Systems

(2020)

Z. Alom et al.

Intrusion detection using deep belief networks

Anderson, J. P. (1980) Computer security threat monitoring and surveillance. Technical Report, James P. Anderson...

D.S. Berman et al.

A survey of deep learning methods for cyber security

Information

(2019)

G. Creech et al.

Generation of a new IDS test dataset: Time to retire the KDD collection

Y. Ding et al.

Application of Deep Belief Networks for opcode based malware detection

W. Elmasry et al.

Evolving deep learning architectures for network intrusion detection using a double PSO metaheuristic

Computer Networks

(2020)

F. Farahnakian et al.

A deep auto-encoder based approach for intrusion detection system

N. Gao et al.

An intrusion detection model based on deep belief networks

A.M. Hay

The derivation of global estimates from a confusion matrix

Remote Sensing Letters

(1988)

G.E. Hinton

Training products of experts by minimizing contrastive divergence

Neural Computation

(2002)

G.E. Hinton et al.

A fast learning algorithm for deep belief nets

Neural Computation

(2006)

Hinton, G. E. (2012). A practical guide to training restricted Boltzmann machines. In Neural networks: Tricks of the...

Cited by (69)

Multichannel semi-supervised active learning for PolSAR image classification
2024, International Journal of Applied Earth Observation and Geoinformation
Deep neural networks have recently been extensively utilized for Polarimetric synthetic aperture radar (PolSAR) image classification. However, this heavily relies on extensive labeled data which is both costly and labor-intensive. To lower the collection of labeling data and enhance the classification performance, a novel multichannel semi-supervised active learning (MSSAL) method is proposed for PolSAR image classification. First, a multichannel strategy-based committee model with cooperative representation classification is presented to explore more effective information in the limited training data. Second, a loss prediction (LP) module is designed to identify the most informative pixels, and an ensemble learning (EL) strategy is designed to select the pixels with the highest confidence. Then, the deep neural network is fine-tuned with the obtaining target pixels through LP and EL in each iteration. Finally, the trained deep model predicts labels for all unlabeled data, outputting the final classification results. The proposed method is evaluated on three real-world PolSAR datasets, demonstrating superior performance to other PolSAR image classification methods with limited labeled samples.
Power plant turbine power trend prediction based on continuous prediction and online oil monitoring data of deep learning
2024, Tribology International
Power output is an important property of steam turbines and more accurate trend prediction is essential for understanding the operation of power plants and anomaly detection in equipment. Equipment power is strongly influenced by human factors and achieving accurate trend prediction is often impossible. In the current research, a continuous prediction model was developed on the basis of deep learning to establish the relationships among oil online monitoring parameters and equipment power. The model used long short term memory (LSTM) method to develop a trend prediction model for oil wear state parameters. Trend prediction results were applied as a test set to establish a deep belief networks (DBN) prediction model for predicting device power. In modeling process, continuous prediction model was optimized via feature selection method on the basis of ridge regression with L2 regularization, recursive feature elimination (RFE) and differential evolution (DE) algorithms for the elimination of subjective factor to decrease cumulative error of forecasting. Comparative experimental results showed that LSTM-RFE-DE-DBN continuous prediction model outperformed LSTM-RFE-DE-BPNN and LSTM-DBN. The developed model realized continuous prediction and applied lubricating oil wear status with objective factors to perform the power prediction of power plant turbines with subjective factors.
Stochastic gradient descent classifier-based lightweight intrusion detection systems using the efficient feature subsets of datasets
2024, Expert Systems with Applications
The Internet of Things (IoT) has become an essential part of our daily lives. However, with the increasing use of IoT, the number of botnet attacks targeting resource-constrained IoT devices is also on the rise. To mitigate these threats, intrusion detection systems (IDSs) have been developed. However, traditional IDSs based on heavyweight deep/machine learning, fuzzy logic, rough set theories, or data mining techniques, often lack in detection accuracy and energy efficiency. Therefore, there is a crucial need for more lightweight, accurate, and energy-efficient IDSs capable of detecting a broad spectrum of cyber attacks. This paper presents a solution to these challenges by introducing lightweight and accurate IDSs that use a stochastic gradient descent classifier (SGDC) and four feature-selection algorithms based on a ridge regressor. The hyperparameters of the SGDC algorithm and ridge regressor model were fine-tuned to enhance the accuracy of IDSs while reducing computational complexity. Moreover, the fine-tuned feature selectors were used to decrease the dataset’s dimensionality and improve the accuracy of IDSs. To evaluate the proposed IDSs, three network traffic datasets (KDD-CUP-1999, BotIoT-2018, and N-BaIoT-2021) were used. The systems achieved an average accuracy of 92.69%, and the number of features was reduced by an average of 79.93%. The results demonstrate that the proposed systems can be utilized for lightweight IDSs on resource-constrained IoT devices. Overall, this paper presents a significant contribution to the field of IDSs for IoT devices by offering an efficient and accurate solution. The proposed lightweight IDSs have the potential to enhance IoT security and privacy, safeguarding sensitive IoT data.
A systematic literature review of recent lightweight detection approaches leveraging machine and deep learning mechanisms in Internet of Things networks
2024, Journal of King Saud University - Computer and Information Sciences
The Internet of Things (IoT) connects daily use devices to the Internet, such as home appliances, health care equipment, sensors, and industrial devices. Concurrently, numerous cyber-attacks target those objects and their backbone IoT networks consecutively. Therefore, several researchers have adopted Machine Learning (ML) and Deep Learning (DL) algorithms to develop efficient Intrusion Detection Systems (IDSs). However, the restricted resources of IoT devices hinder integrating those systems with those tiny devices. Hence, designing lightweight IDSs gets more interest from researchers to build efficient detection models to discard attacks in IoT networks. To give a holistic insight into this research domain, this paper presents a Systematic Literature Review (SLR) to review and analyse the recent ML and DL techniques to lighten the IDS models for detecting attacks in IoT devices. In addition, the literature studies were retrieved from six scientific databases Google Scholar, Science Direct, IEEE Xplore®, Scopus, Web of Science, and Springer. From 4,703 identified records, 57 studies were adopted based on predesigned research questions and inclusion/exclusion criteria. The study's findings illustrate the most recently used ML and DL mechanisms and feature engineering techniques to lighten the proposed IDS models. It also shows the most attacks detected, datasets used, tools and network simulators employed, and evaluation metrics and parameters. Furthermore, it suggests the research challenges and future direction after discussing the limitations of the currently proposed techniques. This study shows that most selected studies are journal articles published in IEEE Xplore®. Furthermore, the most used feature engineering techniques are filter-based, as they deliver better performance and lightness than the developed models. Most studies use correlation algorithms as a feature selection technique. Finally, the most discussed attack in the selected studies is the DoS attack.
TSGS: Two-stage security game solution based on deep reinforcement learning for Internet of Things
2023, Expert Systems with Applications
The lack of effective defense resource allocation strategies and reliable multi-agent collaboration mechanisms lead to the low stability of Deep Reinforcement Learning (DRL)-based security defense strategies in several Internet of Things (IoT) applications. To address the aforementioned issues and approach real-world scenarios, we construct a grid-based adversarial security scenario, propose a two-stage zero-sum security game model (including the resource allocation stage and the patrolling detection stage), and design a two-stage security game solution algorithm based on DRL for this game model, named TSGS. In the resource allocation stage, TSGS uses auxiliary action embedding and gradient approximation approach to compute the Nash Equilibrium (NE) allocation strategy, which addresses the problem of unreasonable allocation. In the patrolling detection stage, TSGS achieves team collaboration by training a multi-agent Dueling Deep-Q Network under the centralized training and decentralized execution (CTDE) framework, which solves the cooperation problem among multiple defense agents. In addition, we design and implement a multi-parameterized attacker model to make the attacker’s behaviors more realistic in the game. Finally, the validity of the TSGS is verified by detailed experimental results for several adversarial experimental scenarios. Compared with the baseline methods, the defense strategy learned by TSGS has higher utility and greater robustness. Especially, TSGS achieves efficient collaboration during the patrolling detection stage after resource allocation by making full use of real-time information and communication.
Transfer learning-driven inversion method for the imaging problem in electrical capacitance tomography
2023, Expert Systems with Applications
Low-quality tomograms constrain the potential of the electrical capacitance tomography technology. In order to break through this bottleneck and innovate reconstruction algorithms, the deep transfer learning prior (DTLP) is introduced in this study, which is coupled with the imaging physical mechanisms and the domain knowledge modeled by a new regularizer into a new imaging model. The proposed imaging model is solved by a new optimizer in a simpler and less computationally expensive way. A new deep transfer learning method is developed to infer DTLPs by synergizing deep convolutional neural network with extreme learning machine (ELM) based on the collected multi-fidelity training samples. The training of the ELM is formulated into a new bilevel optimization problem, and a new nested optimizer is proposed to solve the problem. The quantitative and qualitative evaluation results confirm that the new method shows performance advantages over popular imaging methods in terms of detail restoration, noise immunity, artifact removal and edge preservation. The proposed imaging method synergizes deep transfer learning with imaging physical mechanisms, providing new opportunities and insights for unlocking the potential of the measurement technique and achieving better reconstructions.

View all citing articles on Scopus

View full text

ReviewDeep belief network based intrusion detection techniques: A survey

Highlights

Abstract

Introduction

Section snippets

Data set

Restricted Boltzmann Machine

Comparative analysis of DBN based IDS methods

Discussions

Conclusions

Declaration of Competing Interest

Acknowledgments

Future Generation Computer Systems

Pattern Recognition

Neurocomputing

Expert Systems with Applications

Information Sciences

Journal of Parallel and Distributed Computing

Neurocomputing

ICT Express

Neurocomputing

Expert Systems with Applications

ICT Express

Expert Systems with Applications

Simulated annealing and boltzmann machines: A stochastic approach to combinatorial optimization and neural computing

Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues

Knowledge-Based Systems

Intrusion detection using deep belief networks

A survey of deep learning methods for cyber security

Information

Generation of a new IDS test dataset: Time to retire the KDD collection

Application of Deep Belief Networks for opcode based malware detection

Evolving deep learning architectures for network intrusion detection using a double PSO metaheuristic

Computer Networks

A deep auto-encoder based approach for intrusion detection system

An intrusion detection model based on deep belief networks

The derivation of global estimates from a confusion matrix

Remote Sensing Letters

Training products of experts by minimizing contrastive divergence

Neural Computation

A fast learning algorithm for deep belief nets

Neural Computation

Review
Deep belief network based intrusion detection techniques: A survey