Skip to main content

Empirical Analysis of Application-Level Traffic Classification Using Supervised Machine Learning

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 5297))

Abstract

Accurate application traffic classification and identification are important for network monitoring and analysis. The accuracy of traditional Internet application traffic classification approaches is rapidly decreasing due to the diversity of today’s Internet application traffic, such as ephemeral port allocation, proprietary protocol, and traffic encryption. This paper presents an empirical evaluation of application-level traffic classification using supervised machine learning techniques. Our results indicate that we cannot achieve high accuracy with a simple feature set. Even if a simple feature set shows good performance in application category-level classification, more sophisticated feature selection methods and other techniques are necessary for performance enhancement.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sen, S., Spatscheck, O., Wang, D.: Accurate, Scalable In-Network Identification of P2P Traffic Using Application Signatures. In: WWW 2004 Conference (2004)

    Google Scholar 

  2. Karagiannis, T., Broido, A., Faloutsos, M., Claffy, K.C.: Transport layer identification of p2p traffic. In: Internet Measurement Conference (IMC) (2004)

    Google Scholar 

  3. Park, B.-C., Won, Y.J., Kim, M.-S., Hong, J.W.-K.: Towards Automated Application Signature Generation for Traffic Identification. In: Proceedings of the IEEE/IFIP Network Operations and Management Symposium (NOMS 2008), Salvador, Brazil, April 2008, pp. 160–167 (2008)

    Google Scholar 

  4. Erman, J., Arlitt, M., Mahanti, A.: Traffic Classification Using Clustering Algorithms. In: SIGCOMM 2006 Workshops, Pisa, Italy, September 2006, pp. 281–286 (2006)

    Google Scholar 

  5. Zander, S., Nguyen, T., Armitage, G.: Automated Traffic Classification and Application Identification using Machine Learning. In: Proceedings of the IEEE Conference on Local Computer Networks, Sydney, Australia, November 2005, pp. 250–257 (2005)

    Google Scholar 

  6. Thuy, T., Nguyen, T., Armitage, G.: Training on multiple Sub-Flows to optimize the use of Machine Learning classifiers in real-world IP Networks. In: IEEE Conference on Local Computer Networks, Tampa, Florida, USA, November 2006, pp. 369–376 (2006)

    Google Scholar 

  7. Park, J., Tyan, H.-R., Kuo, C.-C.J.: GA-Based Internet Traffic Classification Technique for QoS Provisioning. In: International Conference on Intelligent Information Hiding and Multimedia, Pasadena, California, USA, December 2006, pp. 251–254 (2006)

    Google Scholar 

  8. Moore, A.W., Zuev, D.: Internet Traffic Classification Using Bayesian Analysis Techniques. In: SIGMETRICS 2005, Banff, Alberta, Canada, June 2005, pp. 50–60 (2005)

    Google Scholar 

  9. Erman, J., Mahanti, A., Arlitt, M.: Internet Traffic Identification using Machine Learning. In: IEEE Global Telecommunications Conference, California, USA, November-December 2006, pp. 1–6 (2006)

    Google Scholar 

  10. Williams, N., Zander, S., Armitage, G.: A Preliminary Performance Comparison of Five Machine Learning Algorithms for Practical IP Traffic Flow Classification. In: SIGCOMM Computer Communication Review, October 2006, pp. 7–15 (2006)

    Google Scholar 

  11. Battiti, R.: Using mutual information for selecting features in supervised neural net learning. IEEE Transactions on Neural Networks 5(4) (July 1994)

    Google Scholar 

  12. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. Journal of Machine Learning Research (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Park, B., Won, Y.J., Choi, MJ., Kim, MS., Hong, J.W. (2008). Empirical Analysis of Application-Level Traffic Classification Using Supervised Machine Learning. In: Ma, Y., Choi, D., Ata, S. (eds) Challenges for Next Generation Network Operations and Service Management. APNOMS 2008. Lecture Notes in Computer Science, vol 5297. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88623-5_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88623-5_55

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88622-8

  • Online ISBN: 978-3-540-88623-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics