Combining Feature Selection and Local Modelling in the KDD Cup 99 Dataset

Porto-Díaz, Iago; Martínez-Rego, David; Alonso-Betanzos, Amparo; Fontenla-Romero, Oscar

doi:10.1007/978-3-642-04274-4_85

Iago Porto-Díaz¹⁸,
David Martínez-Rego¹⁸,
Amparo Alonso-Betanzos¹⁸ &
…
Oscar Fontenla-Romero¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5768))

Included in the following conference series:

International Conference on Artificial Neural Networks

2041 Accesses
4 Citations

Abstract

In this work, a new approach for intrusion detection in computer networks is introduced. Using the KDD Cup 99 dataset as a benchmark, the proposed method consists of a combination between feature selection methods and a novel local classification method. This classification method –called FVQIT (Frontier Vector Quantization using Information Theory)– uses a modified clustering algorithm to split up the feature space into several local models, in each of which the classification task is performed independently. The method is applied over the KDD Cup 99 dataset, with the objective of improving performance achieved by previous authors. Experimental results obtained indicate the adequacy of the proposed approach.

This work was supported in part by Spanish Ministerio de Ciencia e Innovación under Project Code TIN 2006-02402, partially supported by the European Union ERDF, and by Xunta de Galicia under Project Code 08TIC012105PR.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Verwoerd, T., Hunt, R.: Intrusion Detection Techniques and Approaches. Computer Communications 25(15), 1356–1365 (2002)
Article Google Scholar
Elkan, C.: Results of the KDD 1999 Classifier Learning. ACM SIGKDD Explorations Newsletter 1(2), 63–64 (2000)
Article Google Scholar
Bolon-Canedo, V., Sanchez-Maroño, N., Alonso-Betanzos, A.: A Combination of Discretization and Filter Methods for Improving Classification Performance in KDD Cup 1999 Dataset. In: Proceedings of the International Joint Conference on Neural Networks, IJCNN (in press, 2009)
Google Scholar
Martinez-Rego, D., Fontenla-Romero, O., Porto-Diaz, I., Alonso-Betanzos, A.: A New Supervised Local Modelling Classifier Based on Information Theory. In: Proceedings of the International Joint Conference on Neural Networks, IJCNN (in press, 2009)
Google Scholar
Guyon, I., Gunn, S., Nikravesh, M., Zadeh, L.: Feature Extraction. In: Foundations and Applications. Springer, Heidelberg (2006)
Book MATH Google Scholar
Zhao, Z., Liu, H.: Searching for Interacting Features. In: Proceedings of International Joint Conference on Artificial Intelligence, IJCAI, pp. 1156–1167 (2007)
Google Scholar
Dash, M., Liu, H.: Consistency-based Search in Feature Selection. Artificial Intelligence Journal 151(1-2), 155–176 (2003)
Article MathSciNet MATH Google Scholar
Press, W.H., Flannery, B.P., Teutolsky, S.A., Vetterling, W.T.: Numerical Recpies in C. Cambridge University Press, Cambridge (1988)
Google Scholar
Yang, N., Webb, G.I.: Proportional k-Interval Discretization for Naive-Bayes Classifiers. In: EMCL 2001: Proceedings of the 12th European Conference on Machine Learning, pp. 564–575. Springer, Heidelberg (2001)
Chapter Google Scholar
Fayyad, U.M., Irani, K.B.: Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning. In: Proceedings of the 13th International Joint Conference on Artificial Intelligence, pp. 1022–1029. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Grunwald, P.: The Minimum Description Length Principle and Reasoning Under Uncertainty. Unpublished Doctoral Dissertation, University of Amsterdam (1998)
Google Scholar
Principe, J., Lehn-Schioler, T., Hedge, A., Erdogmus, D.: Vector-Quantization Using Information Theoretic Concepts. Natural Computing 4, 39–51 (2005)
Article MathSciNet MATH Google Scholar
Castillo, E., Fontenla-Romero, O., Guijarro-Berdiñas, B., Alonso-Betanzos, A.: A global optimum approach for one-layer neural networks. Neural Computation 14(6), 1429–1449 (2002)
Article MATH Google Scholar
DARPA 1998 Dataset, http://www.ll.mit.edu/mission/communications/ist/corpora/ideval/index.html (cited, March 2009)
Levin, I.: KDD 1999 Classifier Learning Contest LLSoft’s Results Overview. ACM SIGKDD Explorations Newsletter 1(2), 67–75 (2000)
Article MathSciNet Google Scholar
Fugate, M., Gattiker, J.R.: Computer Intrusion Detection with Classification and Anomaly Detection, using SVMs. International Journal of Pattern Recognition and Artificial Intelligence 17(3), 441–458 (2003)
Article Google Scholar
Alonso-Betanzos, A., Sanchez-Maroño, N., Carballal-Fortes, F.M., Suarez-Romero, J., Perez-Sanchez, B.: Classification of Computer Intrusions Using Fuctional Networks. A Comparative Study. In: ESANN 2007: Proceedings of the European Symposium on Artificial Neural Networks, pp. 25–27 (2007)
Google Scholar
Sabhnani, M., Serpen, G.: Why Machine Learning Algorithms Fail in Misuse Detection on KDD Intrusion Detection Data Set. Intelligent Data Analysis 8(4), 403–415 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of A Coruña, Spain
Iago Porto-Díaz, David Martínez-Rego, Amparo Alonso-Betanzos & Oscar Fontenla-Romero

Authors

Iago Porto-Díaz
View author publications
You can also search for this author in PubMed Google Scholar
David Martínez-Rego
View author publications
You can also search for this author in PubMed Google Scholar
Amparo Alonso-Betanzos
View author publications
You can also search for this author in PubMed Google Scholar
Oscar Fontenla-Romero
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Elettronica, Politecnico di Milano, Piazza L. da Vinci 32, 20133, Milano, Italy
Cesare Alippi
Department of Electrical and Computer Engineering, University of Cyprus, 75 Kallipoleos Street, 1678, Nicosia, Cyprus
Marios Polycarpou , Christos Panayiotou & Georgios Ellinas , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Porto-Díaz, I., Martínez-Rego, D., Alonso-Betanzos, A., Fontenla-Romero, O. (2009). Combining Feature Selection and Local Modelling in the KDD Cup 99 Dataset. In: Alippi, C., Polycarpou, M., Panayiotou, C., Ellinas, G. (eds) Artificial Neural Networks – ICANN 2009. ICANN 2009. Lecture Notes in Computer Science, vol 5768. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04274-4_85

Download citation

DOI: https://doi.org/10.1007/978-3-642-04274-4_85
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04273-7
Online ISBN: 978-3-642-04274-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics