Abstract
One of the most effective methods of training a model for intrusion detection requires a very good selection of features from the data and efficient and robust training algorithms to facilitate a better prediction model. Choosing features scoring above a certain threshold allows for the removal of unrelated features following which makes the job of a prediction model easier. The study aims to identify and select the highly correlated features after feature reduction for training the model and then employ various machine learning algorithms to make the classifications with tree-based ensemble learning techniques and non-linear SVM. The dataset from NSL-KDD which is a version derived from the KDD’99 Cup dataset is considered. Implementation is carried out in Python 3 using the scikit-learn machine learning library which is built upon SciPy. Further, the performances of various machine learning classifiers will be evaluated to test for and compare the detection metrics.









Similar content being viewed by others
Data availability
The dataset used and analyzed during the current study are available from the corresponding author on reasonable request.
References
Levin I. KDD-99 classifier learning contest: LLSoft’s results overview. SIGKDD Explor. 2000;1:67–75.
Kohavi R, Brodley CE, Frasca B, Mason L, Zheng Z. KDD-Cup 2000 organizers’ report: peeling the onion. SIGKDD Explor Newsl. 2000;2:86–93.
NSL-KDD dataset, available on: https://www.unb.ca/cic/datasets/nsl.html. Accessed on Jan 2022.
Google Patents (2018): Method and system for network intrusion detection‟, available on : https://patents.google.com/patent/US20180241763A1/en, accessed on Apr 2021
Relan NG, Patil DR. Implementation of network intrusion detection system using variant of decision tree algorithm, International Conference on Nascent Technologies in the Engineering Field (ICNTE). 2015.
Dhanabal L, Shantharajah SP. A Study on NSL-KDD dataset for intrusion detection system based on classification algorithms. Int J Adv Res Comput Commun Eng. 2015;4(6):446–52.
Nkiama H, Zainudeen S, Saidu M. A subset feature elimination mechanism for intrusion detection system. Int J Adv Comput Sci Appl. 2016;7(4):148–57.
Senthilnayaki B, Venkatalakshmi K, Kannan A. Intrusion detection using optimal genetic feature selection and SVM based classifier. In: 3rd International Conference on Signal Processing, Communication and Networking (ICSCN). 2015.
Osanaiye O, Cai H, Raymond Choo K-K, Dehghantanha A, Xu Z, Dlodlo M. Ensemble-based multi-filter feature selection method for DDoS detection in cloud computing. EURASIP J Wirel Commun Netw. 2016;130.
Thaseen IS, Aswani Kumar C. Intrusion detection model using fusion of chi- square feature selection and multi class SVM. J King Saud Univ Comput Inf Sci. 2017;29(4):462–72.
Al Mehedi Hasan Md, Nasser M, Ahmad S, Islam Molla K. Feature selection for intrusion detection using random forest. J Inf Secur. 2016;7(3):129–40.
Singh KJ, De T. Efficient classification of DDoS attacks using an ensemble feature selection algorithm. J Intell Syst. 2020;29(1):71–83.
Farnaaz N, Jabbar MA. Random forest modelling for network intrusion detection system. In: 12th International Conference on Information Processing, 2016; p. 213–17
Aziz ASA, EL-Ola Hanafi S, Ella-Hassanien A. Comparison of classification techniques applied for network intrusion detection and classification. J Appl Logic. 2017;24:109–18.
Sultana A, Jabbar MA. Intelligent network intrusion detection system using data mining techniques. In: 2nd International Conference on Applied and TheoriticalComputing and Communication Technology, 2016, p. 329–33
Kang SH, Kim KJ. A feature selection approach to find optimal feature subsets for the network intrusion detection system. Clust Comput. 2016;19:325–33.
Abdollahi J, Davari N, Panahi Y, Gardaneh M. Detection of metastatic breast cancer from whole-slide pathology images using an ensemble deep-learning method: detection of breast cancer using deep-learning. Arch Breast Cancer. 2022;9(3):364–76.
Abdollahi J, Keshandehghan A, Gardaneh M, Panahi Y, Gardaneh M. Accurate detection of breast cancer metastasis using a hybrid model of artificial intelligence algorithm. Arch Breast Cancer. 2020. https://doi.org/10.32768/abc.20207118-24.
Amani F, Abdollahi J, Mohammadnia A, Amani P, Fattahzadeh-Ardalani G. Using Stacking methods based Genetic Algorithm to predict the time between symptom onset and hospital arrival in stroke patients and its related factors. JBE. 2022;8(1):8–23.
Abdollahi J, Moghaddam BN, Parvar ME. Improving diabetes diagnosis in smart health using genetic-based Ensemble learning algorithm. Approach to IoT Infrastructure. Future Gen Distrib Syst J. 2019;1:23–30.
Abdollahi J, Nouri-Moghaddam B & Ghazanfari M. Deep Neural Network Based Ensemble learning Algorithms for the healthcare system (diagnosis of chronic diseases). arXiv preprint arXiv:2103.08182. 2021.
Abdollahi J, Nouri-Moghaddam B. Hybrid stacked ensemble combined with genetic algorithms for diabetes prediction. Iran J Comput Sci. 2022;5:205–20. https://doi.org/10.1007/s42044-022-00100-1.
Abdollahi J, Nouri-Moghaddam B. A hybrid method for heart disease diagnosis utilizing feature selection based ensemble classifier model generation. Iran J Comput Sci. 2022;5:229–46. https://doi.org/10.1007/s42044-022-00104-x.
Scikit learn, available on: https://scikit-learn.org/stable, accessed on Jan 2022.
Singh KJ, De T. Analysis of application layer DDoS Attack Detection parameters using statistical classifiers. Internetworking Indones J. 2017;9(2):23–31.
Mukherjee S, Sharma N. Intrusion detection using naive bayes classifier with feature reduction, 2012. Procedia Technol. 2012;4:119–28.
Parsazad S, Saboori E, Allahyar. Fast Feature Reduction in intrusion detection datasets. In: MIPRO 2012 Proceedings of the 35th International Convention, 2012; p. 1023–9.
Alazab A, Hobbs M, Abawajy J and Alazab M. Using feature selection for intrusion detection system, (2012). In: International Symposium on Communications and Information Technologies (ISCIT), 2012; p. 296–301.
Zhang F and Wang D. An effective feature selection approach for network intrusion detection. In: 2013 IEEE Eighth International Conference on Networking, Architecture and Storage, 2013; p. 307–11.
Funding
There is no funding for the project.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Research Trends in Computational Intelligence” guest edited by Anshul Verma, Pradeepika Verma, Vivek Kumar Singh and S. Karthikeyan.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Johnson Singh, K., Maisnam, D. & Chanu, U.S. Intrusion Detection System with SVM and Ensemble Learning Algorithms. SN COMPUT. SCI. 4, 517 (2023). https://doi.org/10.1007/s42979-023-01954-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-023-01954-3