Intrusion detection in cyber-physical systems using a generic and domain specific deep autoencoder model

doi:10.1016/j.compeleceng.2021.107044

Computers & Electrical Engineering

Volume 91, May 2021, 107044

https://doi.org/10.1016/j.compeleceng.2021.107044 Get rights and content

Abstract

The rapid growth of network-related services in the last decade has produced a huge amount of sensitive data on the internet. But networks are very much prone to intrusions where unauthorized users attempt to access sensitive information and even disrupt the system. Building a competent network intrusion detection system (IDS) is necessary to prevent such attacks. IDSs generally use machine learning algorithms for classifying the attacks. But the features used for classification are not always suitable or sufficient. Besides, the number of intrusions is much less than the number of non-intrusions. Hence naive approaches may fail to provide acceptable performance due to this class imbalance. To counter this problem, in this paper, we propose a model that extracts useful features from the given features and then uses a deep learning algorithm to classify the intrusions. It is to be noted that underlying data points cannot be thought of as sampled from the same distribution, rather from two different distributions - one generic to all network intrusions, and the other specific to the domain. Keeping this fact in mind, we propose a unique Generic-Specific autoencoder architecture where the generic one learns the features that are common across all forms of network intrusions, and the specific ones learn features that are pertaining only to that domain. The model has been evaluated on the CICIDS2017 dataset, which is the largest dataset of this type available online, and we have set new benchmark results on this dataset. Source code of this work is available at: https://github.com/SoumyadeepThakur/Intrusion-AE

Introduction

The use of network-related services has increased over the years and thus the amount of sensitive data on the internet has grown. Networks are prone to intrusions where unauthorized users, with malicious intent, gain entry to a system on a network and attempt to obtain sensitive information or sabotage the system. Despite numerous network security methods, cyber-attacks still appear. Network intrusion is, therefore, a prime concern and thus network intrusion detection systems (NIDSs) are necessary to prevent such attacks. The information obtained by analyzing the packet data during attacks may be helpful to detect such attacks in the future. Moreover, if after detecting an intrusion, we can further classify the type of intrusion namely, Denial of Service (DoS), Cross-Site Scripting (XSS), etc., and more effective countermeasures can be taken to prevent such attacks. One important factor to be kept in mind while building a NIDS is that if an intrusion wrongly classified as a non-intrusion is more dangerous than a false alarm (non-intrusion classified as an intrusion).

Using these network services, cyber-physical systems integrate computation with physical processes by using feedback loops. These systems consist of sensors, actuators, and other components that communicate with each other through a network. The communication network has the same protocols (of that of a computer network) at lower levels such as TCP/IP, and wireless protocols. Hence, cyber-physical systems are prone to the same intrusions as that of a simple computer network. However, cyber-physical systems are safety-critical and sudden failure due to cyber-attacks or otherwise can cause severe damage to the physical systems that are being controlled and also to the people dependent on these systems. Thus, it is of prime importance that such attacks are prevented. A Cyber-physical system (CPS) assimilates computing resources and physical processes, so that useful control is performed through computation and communication with the connected devices [1]. It enables the remote access and control of systems, devices, and machines, and thus are essential in many industrial environments. Nevertheless, the extensive implementation of CPS comes with various security threats which can lead to severe damages to the controlled physical objects and harm the users who completely rely on them. Hence, NIDS must be implemented on such systems so that preventive actions can be taken before there is irreparable damage due to these attacks.

Although network monitoring has been extensively used for security, forensics, etc., recent advancements in technology have thrown forward many new challenges [2]. Some of the most pressing issues are –

•
Volume - The volume of data continues to drastically grow owing to the increasing popularity of the Internet of Things, cloud-based services, etc. New techniques have to be developed to efficiently and effectively analyze such huge quantities of data.
•
Accuracy – To provide performance with the required levels of accuracy, greater levels of granularity and contextual understanding are needed to get a more holistic and comprehensive view.
•
Diversity – This is caused by the number of new and different protocols and the vast variety of data traveling through modern networks. This makes it difficult to learn appropriate features that can distinguish between normal and abnormal traffic.
•
Low-frequency attacks – These attacks lead to an imbalance in the training set for artificial intelligence approaches, leading to poor precision of detection when they occur.

Moreover, most IDSs make use of machine learning algorithms to classify the attacks. This necessitates the extraction of good features for different intrusions that can then be used for supervised learning to identify the attacks. However, sufficient and appropriate traffic data is often not available that facilitates proper feature learning. Further, the number of intrusions is much less compared to the number of non-intrusions, leading to more difficulties in training.

To handle these issues, network traffic data can be collected from different sources and unsupervised feature learning can be applied to learn appropriate feature representations for these data. These features can then be used to train a classifier using a labeled (and smaller if convenient) dataset comprising both benign and anomalous traffic. The traffic data for the labeled dataset can be collected in a confined, isolated, and private network environment. In this paper we propose a model to identify only the useful feature attributes from the given feature vector and then use machine learning algorithms to classify the intrusions using these features. Our proposed model has been evaluated on the CICIDS2017 [3] dataset.

The rest of the paper is organized as follows. Section 2 deals with the literature survey, Section 3 describes the proposed method, Section 4 presents the experimental results and the dataset description, and a comparative study of the results, and finally Section 5 concludes the paper.

Section snippets

Related work

Network intrusion detection is an active field of research and various machine learning based approaches have been proposed in the literature over the years. In this regard, it is to be noted that the CICIDS2017 is a relatively new dataset and therefore not many approaches have been tried on this dataset. In this section, first, a general overview of the methods that have been used for IDS is provided which includes popular approaches such as feature selection, ensemble paradigms, deep learning

Proposed method

Network intrusion can be varied, vastly ranging from simple brute force attacks to very complex attacks such as Distributed DoS. These intrusions can be grouped into domains (i.e., generalized groups) such as DoS, Web attacks, etc. For example, if one is considering attacks such as SQL injection, Cross-Site Scripting, and Brute Force it can be grouped into a more generalized domain of Web Attacks as all of these attacks are carried out by exploiting vulnerabilities in a web application.

Experimental results

In this section, we provide a detailed description of the dataset used, along with the data pre-processing procedure adopted here. The detailed structure of the model is also presented. Finally, a performance analysis of the model is provided followed by a comparative study with other classifiers and models found in the literature. The experiments are done on a Windows 10 – 64-bit PC with 12 GB RAM and CPU Intel(R) [email protected] GHz.

Conclusion

The problem of network intrusion detection can be defined as identifying unlawful access, misuse, and abuse of computer systems either by insiders and/or outsiders. Network intrusion can be varied ranging from simple brute force attacks to very complex attacks. More specifically, data points of network intrusion dataset can be thought of following two distinct distributions - one generic to all network intrusions, and the other specific to the domains. Based on this fact, we propose a

Declaration of Competing Interest

The authors declare that there is no conflict of interest

Authors statement

Authors declare that there is no conflict of interest during the submission of the paper at this venue. All the authors agreed on the submission of the paper at this venue. The paper is submitted solely at this venue.

Soumyadeep Thakur is a post-graduate student within the Computer Science and Engineering Department of Indian Institute of Technology, Bombay. He completed his undergraduate studies from the Department of Computer Science and Engineering of Jadavpur University in 2019. His research interests lie in Deep Learning, Natural Language Processing and Reinforcement Learning

References (25)

S. Han et al.
Intrusion detection in cyber-physical systems: techniques and challenges
IEEE Syst J
(2014)
N. Shone et al.
A deep learning approach to network intrusion detection
IEEE Trans Emerg Top Comput Intell
(2018)
I. Sharafaldin et al.
Towards a reliable intrusion detection benchmark dataset
Softw Netw
(2017)
B.M. Aslahi-Shahri
A hybrid method consisting of GA and SVM for intrusion detection system
Neural Comput Appl
(2016)
S. Sun et al.
Wrapper feature selection based on lightning attachment procedure optimization and support vector machine for intrusion detection
S. Mukkamala et al.
Intrusion detection using an ensemble of intelligent paradigms
J Netw Comput Appl
(2005)
B. Dong and X. Wang, “Comparison deep learning method to traditional methods using for network intrusion detection,”...
A. McDole et al.
Analyzing CNN based behavioural malware detection techniques on cloud IaaS
Cloud Comput CLOUD
(2020)
M. Gupta et al.
Secure V2V and V2I communication in intelligent transportation using cloudlets
IEEE Trans Serv Comput
(2020)
D. Gupta et al.
Access Control Model for Google Cloud IoT

F. Farahnakian and J. Heikkonen, “A deep auto-encoder based approach for intrusion detection system,” 2018, doi:...

Y. Mirsky, T. Doitshman, Y. Elovici, and A. Shabtai, “Kitsune: an ensemble of autoencoders for online network intrusion...

Cited by (46)

Few-shot IoT attack detection based on SSDSAE and adaptive loss weighted meta residual network
2023, Information Fusion
The Internet of Things (IoT) is an open and comprehensive network of smart objects. Unfortunately, it is also becoming increasingly vulnerable to security attacks during the increasing popularity of the IoT. It can lead traditional antivirus software to be less likely to prevent this threat. Therefore, it is necessary to design a model for the IoT attack detection. Current detection models are trained using massive big data samples. However, the distribution of traffic samples is few in specific scenarios. Also, existing models are also susceptible to noise interference in IoT environments, lowering detection efficiency and accuracy. In this work, we propose a few-shot IoT attack detection approach using a semi-supervised deep sparse autoencoder (SSDSAE) and an adaptive loss weighted meta residual network (ALWM-ResNet). First, an SSDSAE feature extraction model for local graph embedding is designed using local and non-local graph embedding constraints. Then, we design an ALWM-ResNet model to achieve IoT attack detection with few-shot samples under noise labels. A weighted function map is established using a weighted network and a meta-model, and weights are adaptively learned from the noise labels. Finally, we validate our approach using four IoT datasets. Several experimental results demonstrate the superior performance of our approach in IoT attack detection under few-shot samples.
CPS-GUARD: Intrusion detection for cyber-physical systems and IoT devices using outlier-aware deep autoencoders
2023, Computers and Security
Detecting attacks to Cyber-Physical Systems (CPSs) is of utmost importance, due to their increasingly frequent use in many critical assets. Intrusion detection in CPSs and other domains, such as the Internet of Things, is often addressed through machine and deep learning. However, many existing proposals tend to favor the application of complex detection models over the usability in real-world operations. This paper presents CPS-GUARD, a novel intrusion detection approach based on a single semi-supervised autoencoder and a technique to set the threshold used to discriminate normal operations from attacks. The technique is outlier-aware, in that it relies on outlier detection to mitigate inherent imperfections of the training data.
CPS-GUARD is evaluated by means of direct experiments with normal and intrusion data points pertaining to individual sensing devices, an HTTP server and four full-fledged systems, including CPSs. Experiments are based on a wide spectrum of attacks available in six state-of-the-art datasets. The intrusion detection results of CPS-GUARD are within 0.949-1.000 recall, 0.961-0.999 precision and 0.006-0.027 false positive rate depending on the specific system. The results are competitive with other existing intrusion detection methods. The evaluation is complemented by a comparative study on alternative threshold selection and outlier detection techniques.
A model-based mode-switching framework based on security vulnerability scores
2023, Journal of Systems and Software
Software vulnerabilities can affect critical systems within an organization impacting processes, workflows, privacy, and safety. When a software vulnerability becomes known, affected systems are at risk until appropriate updates become available and eventually deployed. This period can last from a few days to several months, during which attackers can develop exploits and take advantage of the vulnerability. It is tedious and time-consuming to keep track of vulnerabilities manually and perform necessary actions to shut down, update, or modify systems. Vulnerabilities affect system components, such as a web server, but sometimes only target specific versions or component combinations.
In this paper, we propose a novel approach for automated mode switching of software systems to support system administrators in dealing with vulnerabilities and reducing the risk of exposure. We rely on model-driven techniques and use a multi-modal architecture to react to discovered vulnerabilities and provide automated contingency support. We have developed a dedicated domain-specific language to describe potential mitigation as mode switches. We have evaluated our approach with a web server case study, analyzing historical vulnerability data. Based on the vulnerabilities scores sum, we demonstrated that switching to less vulnerable modes reduced the attack surface in 98.9% of the analyzed time.
Editor’s note: Open Science material was validated by the Journal of Systems and Software Open Science Board.
Swarm intelligence for IoT attack detection in fog-enabled cyber-physical system
2023, Computers and Electrical Engineering
To provide remote access, surveillance, and analysis, network integration is common in Cyber-Physical Systems (CPSs). This leads to cyber attacks due to the integration of insecure networking devices. In violating internet security, the attackers interfere with system function, which leads to shattering consequences. With the incorporation of Fog with IoT, the attacks in the CPS can be detected in less time than in cloud-based CPS. The detection of attacks on CPS is targeted by cybercriminals which increases the identification difficulty. This paper proposes a new swarm-based feature selection algorithm to improve attack detection in an IoT-based CPS environment. An Enhanced Chicken swarm optimization (ECSO) with self-learning ability-based feature selection is used to select the relevant features from the preprocessed data. Next, the ensemble classifiers are executed with the desired features on the cloud. The proposed ECSO-based ensemble classifier has experimented against the NSL-KDD dataset. The evaluated results show the adequate performance of the proposed system using various statistical measures.
Remora whale optimization-based hybrid deep learning for network intrusion detection using CNN features
2022, Expert Systems with Applications
Citation Excerpt :
However, this model failed to use an imbalanced attack dataset for achieving better performance. To achieve better performance outcomes (Thakur et al., 2021) presented a Deep autoencoder model for detecting intrusions in cyber-physical systems. This technique identified the necessary feature attributes for categorizing the intrusions based on these identified features.
Security remains as a key role in this internet world owing to the fast expansion of users on the internet. Numerous existing intrusion detection approaches were introduced by numerous researchers to recognize and identify intruders. Meanwhile, the existing systems failed to achieve satisfactory detection accuracy. Hence, this paper develops a robust intrusion detection model, named Remora Whale Optimization (RWO)-based Hybrid deep model for detecting intrusions. Here, the input data is pre-processed, and thereafter data transformation is done. With the transformed data, effective CNN features are extracted and feature conversion is performed to convert the features into vector form. Moreover, RV-coefficient is accomplished for performing feature selection process and finally, network intrusions are effectively detected using Hybrid deep model where the Deep Maxout Network and Deep Auto Encoder are used. On the other hand, the training procedure of the Hybrid deep model is carried out using the designed optimization algorithm, named RWO, which is the hybridization of the Remora Optimization Algorithm (ROA) and Whale Optimization Algorithm (WOA). Furthermore, the devised technique achieved superior performance using the evaluation metrics, such as testing accuracy, precision, recall, and F1-score with the higher values of 0.938, 0.920, 0.932, and 0.926, respectively.
Privacy preserving monitoring protocol for Cyber–Physical System
2022, Computers and Electrical Engineering
Citation Excerpt :
Because of the tight integration of real-valued and dense real-time systems with software-based discrete automated control, the design and implementation concepts of CPS are significantly distinct from those of other embedded systems. CPS is developed by integrating the computational and physical components, and hence they allow communication with human beings in multiple ways [1,2]. Embedding the monitoring protocol in the CPS needs proper attention; disclosing any monitored information with an intruder may create a major issue [3].
Cyber–Physical Systems integrate computational and physical systems that can interact with humans in various ways. Due to less research on location privacy, building a monitoring protocol for the CPS system is challenging. Location privacy is becoming a major issue in the CPS, where the actual source and its messages are kept avoided by an attacker. The attacker follows the incoming message’s route to reach the source’s location. To hide the location of the source from attackers and increase their safety period, this research proposes a Multi-Ring protocol. A complete theoretical and mathematical analysis is discussed to check the model’s complexity and performance. The message delivery delay, throughput, energy consumption, and safety time ratio metrics are used for performance comparison. The simulation results confirmed that the proposed protocol increases the source safety period and network lifetime compared to the existing protocols.

View all citing articles on Scopus

Anuran Chakraborty received his B.E. degree in Computer Science and Engineering from Jadavpur University, Kolkata, India in 2020. His research interests are Image Processing, Machine Learning and Deep Learning.

Rajonya De received his B.E. degree in Computer Science and Engineering from Jadavpur University, Kolkata, India in 2020. His research interests are Image Processing, Machine Learning and Deep Learning.

Neeraj Kumar is highly-cited researcher from WoS in 2019 and 2020, and published more than 400 research papers in top-cited journals and conferences. His research is supported by funding from various agencies across the globe. His research areas are Green computing and Network management, IoT, Big Data Analytics, Deep learning and Cyber-security. He is serving as editors of various journals.

Ram Sarkar did his Bachelors from University of Calcutta in 2003. He did his Masters and PhD degrees from Jadavpur University (JU) in 2005 and 2012 respectively. He is an Associate Professor at JU. He was a Fulbright-Nehru Fellow and worked at University of Maryland, USA during 2014–15. His research interests are Machine and Deep Learning and Image Processing.

: This paper is for CAEE special section VSI-aicps. Reviews processed and recommended for publication to the Editor-in-Chief by Guest Editor Dr. Dr. Ali Dehghantanha.

View full text

Intrusion detection in cyber-physical systems using a generic and domain specific deep autoencoder model

Abstract

Introduction

Section snippets

Related work

Proposed method

Experimental results

Conclusion

Declaration of Competing Interest

Authors statement

Intrusion detection in cyber-physical systems: techniques and challenges

IEEE Syst J

A deep learning approach to network intrusion detection

IEEE Trans Emerg Top Comput Intell

Towards a reliable intrusion detection benchmark dataset

Softw Netw

A hybrid method consisting of GA and SVM for intrusion detection system

Neural Comput Appl

Wrapper feature selection based on lightning attachment procedure optimization and support vector machine for intrusion detection

Intrusion detection using an ensemble of intelligent paradigms

J Netw Comput Appl

Analyzing CNN based behavioural malware detection techniques on cloud IaaS

Cloud Comput CLOUD

Secure V2V and V2I communication in intelligent transportation using cloudlets

IEEE Trans Serv Comput

Access Control Model for Google Cloud IoT