Skip to main content

LCGT: A Low-Cost Continuous Ground Truth Generation Method for Traffic Classification

  • Conference paper
Book cover Management Enabling the Future Internet for Changing Business and New Computing Services (APNOMS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 5787))

Included in the following conference series:

Abstract

Recently, with the progress of research on accurate traffic classification (TC), the major obstacle to achieving accurate TC is the lack of an efficient ground truth (GT) generation method. A firm GT is important for exploring the underlying characteristics of network traffic, building the traffic model, and verifying the classification result, etc. However, current existing GT generation methods can only be made manually or with additional high-cost DPI (deep packet inspection) devices. They are neither too complicated nor too expensive for research community. In response to this problem, we present LCGT, a low-cost continuous GT generation method for TC. Based on LCGT, we propose a novel updateable TC system, which can always reflect the features of up-to-date traffic. While we have found LCGT to be very useful in our own research, we seek to initiate a broader discussion to guide the refinement of the tools. LCGT is located on: http://code.google.com/p/traclassy

China 973 Programme (No. 2009CB320505), Project 60811140347 supported by NSFC-KOSEF, Specialized Research Fund for the Doctoral Program of Higher Education(200800130014), Project 60772111 supported by National Natural Science Foundation of China.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ubicom Inc., Solving Performance Problems with Interactive Applications in a Broadband Environment using StreamEngine Technology, August 14 (2007)

    Google Scholar 

  2. http://www.iana.org/assignments/port-numbers

  3. Williams, N., Zander, S., Armitage, G.: A preliminary performanc comparison of five machine learning algorithms for practical IP traffic flow classification. Special Interest Group on Data Communication (SIGCOMM) Computer Communication Review 36(5), 5–16 (2006)

    Google Scholar 

  4. Andrew, W.M., Denis, Z.: Internet traffic classification using Bayesian analysis techniques. In: ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), Banff, Alberta, Canada (June 2005)

    Google Scholar 

  5. Livadas, C., Walsh, R., Lapsley, D.: Using Machine Learning Techniques to Identify Botnet Traffic. IEEE, 967–974 (2006)

    Google Scholar 

  6. Auld, T., Moore, A.W., Gull, S.F., et al.: Bayesian Neural Networks for Internet Traffic Classification. IEEE Transactions on Neural Networks (1), 223–239 (2007)

    Google Scholar 

  7. Bernaille, L., Teixeira, R., Salamatian, K.: Early Application Classification. In: The 2nd ADETTI/ISCTE CoNEXT Conference, Lisboa, Portugal (December 2006)

    Google Scholar 

  8. McGreor, A., Hall, M., Lorier, P., et al.: Flow clustering Using Machine Learning Techniques. Passive&Active Measurement workshop 3015(4), 205–214 (2004)

    Google Scholar 

  9. Pietrzyk, M., Urvoy-Keller, G., Costeux, J.-L.: Revealing the Unknown ADSL Traffic Using Statistical Methods. In: Proc. 1st COST TMA Workshop, pp. 75–83 (2009)

    Google Scholar 

  10. Iliofotou, M., Kim, H., Pappu, P., Faloutsos, M., Mitzenmacher, M., Varghese, G.: Graph-based P2P Traffic Classification at the Internet Backbone. In: IEEE 12th Global Internet Symposium (in Conjunction with IEEE INFOCOM 2009) (April 2009)

    Google Scholar 

  11. Moore, A.W., Zuev, D., Crogan, M.: Discriminators for use in flow-based classification, Technical Report, RR-05-13, Department of Computer Science, Queen Mary, University of London (August 2005)

    Google Scholar 

  12. Canini, M., Li, W., Moore, A.W., Bolla, R.: GTVS: Boosting the Collection of Application Traffic Ground Truth. In: Proc. 1st COST TMA Workshop (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tian, X., Huang, X., Sun, Q. (2009). LCGT: A Low-Cost Continuous Ground Truth Generation Method for Traffic Classification. In: Hong, C.S., Tonouchi, T., Ma, Y., Chao, CS. (eds) Management Enabling the Future Internet for Changing Business and New Computing Services. APNOMS 2009. Lecture Notes in Computer Science, vol 5787. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04492-2_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04492-2_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04491-5

  • Online ISBN: 978-3-642-04492-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics