skip to main content
10.1145/2435264.2435307acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
research-article

High throughput and programmable online trafficclassifier on FPGA

Authors Info & Claims
Published:11 February 2013Publication History

ABSTRACT

Machine learning (ML) algorithms have been shown to be effective in classifying the dynamic internet traffic today. Using additional features and sophisticated ML techniques can improve accuracy and can classify a broad range of application classes. Realizing such classifiers to meet high data rates is challenging. In this paper, we propose two architectures to realize complete online traffic classifier using flow-level features. First, we develop a traffic classifier based on C4.5 decision tree algorithm and Entropy-MDL discretization algorithm. It achieves an accuracy of 97.92% when classifying a traffic trace consisting of eight application classes. Next, we accelerate our classifier using two architectures on FPGA. One architecture stores the classifier in on-chip distributed RAM. It is designed to sustain a high throughput. The other architecture stores the classifier in block RAM. It is designed to operate with small hardware footprint and thus built at low hardware cost. Experimental results show that our high throughput architecture can sustain a throughput of $550$ Gbps assuming 40 Byte packet size. Our low cost architecture demonstrates a 22% better resource efficiency than the high throughput design. It can be easily replicated to achieve $449$ Gbps while supporting 160 input traffic streams concurrently. Both architectures are parameterizable and programmable to support any binary-tree-based traffic classifier. We develop a tool which allows users to easily map a binary-tree-based classifier to hardware. The tool takes a classifier as input and automatically generates the Verilog code for the corresponding hardware architecture.

References

  1. R. Alshammari and A. N. Zincir-Heywood. Machine learning based encrypted traffic classification: identifying ssh and skype. In the proc. of CISDA, pages 289-296, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Angevine and A. Zincir-Heywood. A preliminary investigation of skype traffic classification using a minimalist feature set. In the proc. of ARES, pages 1075-1079, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. Bernaille, R. Teixeira, I. Akodkenou, A. Soule, and K. Salamatian. Traffic classification on the fly. ACM SIGCOMM Computer Communication Review, 36:23-26, April 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Bonfiglio, M. Mellia, M. Meo, D. Rossi, and P. Tofanelli. Revealing skype traffic: when randomness plays with you. In the proc. of SIGCOMM, pages 37-48, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Erman, M. Arlitt, and A. Mahanti. Traffic classification using clustering algorithms. In the proc. of MineNet, pages 281-286, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. U. M. Fayyad and K. B. Irani. Multi-interval discretization of continuous valued attributes for classification learning. In the proc. of IJCAI, pages 1022-1027, 1993.Google ScholarGoogle Scholar
  7. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The weka data mining software: an update. SIGKDD Explor. Newsl., 11(1):10-18, Nov. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M.-H. Ho, Y.-Q. Ai, T. C.-P. Chau, S. C. L. Yuen, C.-S. Choy, P. H. W. Leong, and K.-P. Pun. Architecture and design flow for a highly efficient structured asic. VLSI, PP(99):1, 2012.Google ScholarGoogle Scholar
  9. W. Jiang and M. Gokhale. Real-time classification of multimedia traffic using fpga. In the proc. of FPL, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Karagiannis, A. Broido, M. Faloutsos, and K. Claffy. Transport layer identification of p2p traffic. In the proc. of IMC, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Karagiannis, K. Papagiannaki, and M. Faloutsos. Blinc: multilevel traffic classification in the dark. In the proc. of SIGCOMM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Kohavi. A study of cross-validation and bootstrap for accuracy estimation and model selection. In the proc. of IJCAI, pages 1137-1143. Morgan Kaufmann, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y.-S. Lim, H.-C. Kim, J. Jeong, C.-K. Kim, T. T. Kwon, and Y. Choi. Internet traffic classification demystified: on the sources of the discriminative power. In the Proc. of ACM Co-NEXT, '10, pages 9:1-9:12, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Luo, K. Xiang, and S. Li. Acceleration of decision tree searching for ip traffic classification. In the proc. of ANCS, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Monemi, R. Zarei, M. Marsono, and M. Khalil-Hani. Parameterizable decision tree classifier on netfpga. Advances in Intelligent Systems and Computing, 182:119-128, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  16. A. W. Moore and K. Papagiannaki. Toward the accurate identification of network applications. LNCS, 3431/2005:41-54, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. R. Quinlan. C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Sen, O. Spatscheck, and D. Wang. Accurate, scalable innetwork identification of p2p traffic using application signatures. In the proc. of WWW, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. T. Traces. http://tstat.tlc.polito.it/traces.shtml.Google ScholarGoogle Scholar
  20. N. Williams, S. Zander, and G. Armitage. A preliminary performance comparison of five machine learning algorithms for practical ip traffic flow classification. SIGCOMM Comput. Commun. Rev., 36(5):5-16, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Zander, T. Nguyen, and G. Armitage. Automated traffic classification and application identification using machine learning. In the proc. of LCN, pages 250-257, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. High throughput and programmable online trafficclassifier on FPGA

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        FPGA '13: Proceedings of the ACM/SIGDA international symposium on Field programmable gate arrays
        February 2013
        294 pages
        ISBN:9781450318877
        DOI:10.1145/2435264

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 February 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate125of627submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader