Self-Learning IP Traffic Classification Based on Statistical Flow Characteristics

Zander, Sebastian; Nguyen, Thuy; Armitage, Grenville

doi:10.1007/978-3-540-31966-5_26

Sebastian Zander¹⁷,
Thuy Nguyen¹⁷ &
Grenville Armitage¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 3431))

Included in the following conference series:

International Workshop on Passive and Active Network Measurement

2147 Accesses
56 Citations

Abstract

A number of key areas in IP network engineering, management and surveillance greatly benefit from the ability to dynamically identify traffic flows according to the applications responsible for their creation. Currently such classifications rely on selected packet header fields (e.g. destination port) or application layer protocol decoding. These methods have a number of shortfalls e.g. many applications can use unpredictable port numbers and protocol decoding requires high resource usage or is simply infeasible in case protocols are unknown or encrypted. We propose a framework for application classification using an unsupervised machine learning (ML) technique. Flows are automatically classified based on their statistical characteristics. We also propose a systematic approach to identify an optimal set of flow attributes to use and evaluate the effectiveness of our approach using captured traffic traces.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sen, S., Spatscheck, O., Wang, D.: Accurate, Scalable In-Network Identification of P2P Traffic Using Application Signatures. In: WWW 2004, New York, USA (May 2004)
Google Scholar
Frank, J.: Machine Learning and Intrusion Detection: Current and Future Directions. In: Proceedings of the National 17th Computer Security Conference (1994)
Google Scholar
Roughan, M., Sen, S., Spatscheck, O., Duffield, N.: Class-of-Service Mapping for QoS: A statistical signature-based approach to IP traffic classification. In: ACM SIGCOMM Internet Measurement Workshop 2004, Taormina, Sicily, Italy,
Google Scholar
McGregor, A., Hall, M., Lorier, P., Brunskill, J.: Flow Clustering Using Machine Learning Techniques. In: Passive & Active Measurement Workshop 2004, France (April 2004)
Google Scholar
Lan, K., Heidemann, J.: On the correlation of Internet flow characteristics, Technical Report ISI-TR-574, USC/Information Sciences Institute (July 2003)
Google Scholar
Claffy, K., Braun, H.-W., Polyzos, G.: Internet Traffic Profiling, CAIDA, San Diego Supercomputer Center outreach/papers/1994/itf/ (1994), http://www.caida.org/
Dunnigan, T., Ostrouchov, G.: Flow Characterization for Intrusion Detection, Oak Ridge National Laboratory, Tech Report (November 2000), http://www.csm.ornl.gov/~ost/id/tm.ps
NetMate as of, (January 2005), http://sourceforge.net/projects/netmate-meter/
Cheeseman, P., Stutz, J.: Bayesian Classification (Autoclass): Theory and Results. In: Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, USA (1996)
Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of Royal Statistical Society, Series B 30(1) (1977)
Google Scholar
NLANR traces as of, (January 2005), http://pma.nlanr.net/Special/

Download references

Author information

Authors and Affiliations

Centre for Advanced Internet Architectures (CAIA), Swinburne University of Technology, Melbourne, Australia
Sebastian Zander, Thuy Nguyen & Grenville Armitage

Authors

Sebastian Zander
View author publications
You can also search for this author in PubMed Google Scholar
Thuy Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Grenville Armitage
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Computing, Georgia Institute of Technology, 30332, Atlanta, Georgia
Constantinos Dovrolis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zander, S., Nguyen, T., Armitage, G. (2005). Self-Learning IP Traffic Classification Based on Statistical Flow Characteristics. In: Dovrolis, C. (eds) Passive and Active Network Measurement. PAM 2005. Lecture Notes in Computer Science, vol 3431. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31966-5_26

Download citation

DOI: https://doi.org/10.1007/978-3-540-31966-5_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25520-8
Online ISBN: 978-3-540-31966-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics