skip to main content
10.1145/3341216.3342213acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

Towards a Profiling View for Unsupervised Traffic Classification by Exploring the Statistic Features and Link Patterns

Published: 14 August 2019 Publication History

Abstract

In this paper, we study the network traffic classification task. Different from existing supervised methods that rely heavily on the labeled statistic features in a long period (e.g., several hours or days), we adopt a novel view of unsupervised profiling to explore the flow features and link patterns in a short time window (e.g., several seconds), dealing with the zero-day traffic problem. Concretely, we formulate the traffic identification task as a graph co-clustering problem with topology and edge attributes, and proposed a novel Hybrid Flow Clustering (HFC) model. The model can potentially achieve high classification performance, since it comprehensively leverages the available information of both features and linkage. Moreover, the two information sources integrated in HFC can also be utilized to generate the profiling for each flow category, helping to reveal the deep knowledge and semantics of network traffic. The effectiveness of the model is verified in the extensive experiments on several real datasets of various scenarios, where HFC achieves impressive results and presents powerful application ability.

Supplementary Material

MP4 File (p50-qin.mp4)

References

[1]
Laurent Bernaille, Renata Teixeira, Ismael Akodkenou, Augustin Soule, and Kave Salamatian. 2006. Traffic classification on the fly. ACM SIGCOMM Computer Communication Review 36, 2 (2006), 23.
[2]
C. Boutsidis and E. Gallopoulos. 2008. SVD based initialization: A head start for nonnegative matrix factorization. Pattern Recognition 41, 4 (2008), 1350--1362.
[3]
Tianxiang Gao and Chris Chu. 2018. DID: Distributed Incremental Block Coordinate Descent for Nonnegative Matrix Factorization. In AAAI Conference on Artificial Intelligence. 2991--2998.
[4]
Bin Hu and Yi Shen. 2012. Machine learning based network traffic classification: a survey. JOURNAL OF INFORMATION & COMPUTATIONAL SCIENCE 9, 11 (2012), 3161--3170.
[5]
Brunet Jean-Philippe, Tamayo Pablo, Todd R Golub, and Jill P Mesirov. 2004. Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci USA 101, 12 (2004), 4164--4169.
[6]
Yu Jin, Nick Duffield, Patrick Haffner, Subhabrata Sen, and Zhi Li Zhang. 2010. Inferring applications at the network layer using collective traffic statistics. ACM SIGMETRICS Performance Evaluation Review 38, 1 (2010), 351--352.
[7]
Yu Jin, Esam Sharafuddin, and Zhi Li Zhang. 2009. Unveiling core network-wide communication patterns through application traffic activity graph decomposition. ACM SIGMETRICS Performance Evaluation Review 37, 1 (2009), 49--60.
[8]
Jingu Kim and Haesun Park. 2008. Sparse nonnegative matrix factorization for clustering. Technical Report. Georgia Institute of Technology.
[9]
Mohammad Lotfollahi, Ramin Shirali Hossein Zade, Mahdi Jafari Siavoshani, and Mohammdsadegh Saberian. 2017. Deep packet: A novel approach for encrypted traffic classification using deep learning. arXiv preprint arXiv:1709.02656 (2017).
[10]
Laszlo Lovasz and Michael Plummer. 2009. Matching Theory. Annals of Discrete Mathematics 367 (2009).
[11]
Qin Meng, Jin Di, Lei Kai, Gabrys Bogdan, and Musial Katarzyna. 2018. Adaptive Community Detection Incorporating Topology and Content in Social Networks. Knowledge-Based Systems 161 (2018), 342--356.
[12]
Cui Peng, Wang Xiao, Pei Jian, and Wenwu Zhu. 2017. A Survey on Network Embedding. IEEE Transactions on Knowledge & Data Engineering PP, 99 (2017), 1--1.
[13]
Guo Jun Qi, Charu C.Aggarwal, and Thomas Huang. 2012. Community Detection with Edge Content in Social Media Networks. In IEEE International Conference on Data Engineering. 534--545.
[14]
Muhammad Shafiq, Xiangzhan Yu, Asif Ali Laghari, Lu Yao, Nabin Kumar Karn, and Foudil Abdessamia. 2016. Network Traffic Classification techniques and comparative analysis using Machine Learning algorithms. In ICCC. 2451--2455.
[15]
Muhammad Shafiq, Xiangzhan Yu, and Dawei Wang. 2017. Network Traffic Classification Using Machine Learning Algorithms. In Advances in Intelligent Systems and Interactive Applications. 621--627.
[16]
Hongtao Shi, Hongping Li, Zhang Dan, Chaqiu Cheng, and Xuanxuan Cao. 2018. An Efficient Feature Generation Approach based on Deep Learning and Feature Selection Techniques for traffic classification. Computer Networks 132 (2018).
[17]
Guanglu Sun, Lili Liang, Chen Teng, Xiao Feng, and Lang Fei. 2018. Network traffic classification based on transfer learning. Computers & Electrical Engineering (2018), S004579061732829X.
[18]
Guanglu Sun, Chen Teng, Yangyang Su, and Chenglong Li. 2018. Internet Traffic Classification Based on Incremental Support Vector Machines. Mobile Networks & Applications 14 (2018), 1--8.
[19]
Jun Zhang, Xiao Chen, Yang Xiang, Wanlei Zhou, and Jie Wu. 2015. Robust Network Traffic Classification. IEEE/ACM Transactions on Networking 23, 4 (2015), 1257--1270.
[20]
Jun Zhang, Xiang Yang, Wang Yu, Wanlei Zhou, Xiang Yong, and Guan Yong. 2012. Network Traffic Classification Using Correlation Information. IEEE Transactions on Parallel & Distributed Systems 24, 1 (2012), 104--117.

Cited By

View all
  • (2024)Pre-train and Refine: Towards Higher Efficiency in K-Agnostic Community Detection without Quality DegradationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671686(2467-2478)Online publication date: 25-Aug-2024
  • (2023)Towards a Better Tradeoff between Quality and Efficiency of Community Detection: An Inductive Embedding Method across GraphsACM Transactions on Knowledge Discovery from Data10.1145/359660517:9(1-34)Online publication date: 15-Jun-2023
  • (2023)Scalable Deep Reinforcement Learning-Based Online Routing for Multi-Type Service RequirementsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.328465134:8(2337-2351)Online publication date: Aug-2023
  • Show More Cited By

Index Terms

  1. Towards a Profiling View for Unsupervised Traffic Classification by Exploring the Statistic Features and Link Patterns

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      NetAI'19: Proceedings of the 2019 Workshop on Network Meets AI & ML
      August 2019
      96 pages
      ISBN:9781450368728
      DOI:10.1145/3341216
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 14 August 2019

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Network traffic classification
      2. attributed networks
      3. co-clustering
      4. flow profiling
      5. non-negative matrix factorization

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      • Shenzhen Key Lab

      Conference

      SIGCOMM '19
      Sponsor:
      SIGCOMM '19: ACM SIGCOMM 2019 Conference
      August 23, 2019
      Beijing, China

      Acceptance Rates

      NetAI'19 Paper Acceptance Rate 13 of 38 submissions, 34%;
      Overall Acceptance Rate 13 of 38 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)16
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 01 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Pre-train and Refine: Towards Higher Efficiency in K-Agnostic Community Detection without Quality DegradationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671686(2467-2478)Online publication date: 25-Aug-2024
      • (2023)Towards a Better Tradeoff between Quality and Efficiency of Community Detection: An Inductive Embedding Method across GraphsACM Transactions on Knowledge Discovery from Data10.1145/359660517:9(1-34)Online publication date: 15-Jun-2023
      • (2023)Scalable Deep Reinforcement Learning-Based Online Routing for Multi-Type Service RequirementsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2023.328465134:8(2337-2351)Online publication date: Aug-2023
      • (2023)RaftGP: Random Fast Graph Partitioning2023 IEEE High Performance Extreme Computing Conference (HPEC)10.1109/HPEC58863.2023.10363495(1-7)Online publication date: 25-Sep-2023
      • (2023)Batch classifier with adaptive update for backbone traffic classificationComputer Communications10.1016/j.comcom.2023.02.013202:C(57-72)Online publication date: 15-Mar-2023
      • (2023)Clustering unknown network traffic with dual-path autoencoderNeural Computing and Applications10.1007/s00521-022-08138-9Online publication date: 7-Jan-2023
      • (2022)Performance Profiling of Operating Modes via Multi-view Analysis Using Non-negative Matrix FactorisationRecent Advancements in Multi-View Data Analytics10.1007/978-3-030-95239-6_11(289-316)Online publication date: 21-May-2022
      • (2021)BCAC: Batch Classifier based on Agglomerative Clustering for traffic classification in a backbone network2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS)10.1109/IWQOS52092.2021.9521310(1-10)Online publication date: 25-Jun-2021
      • (2021)DRL-OR: Deep Reinforcement Learning-based Online Routing for Multi-type Service RequirementsIEEE INFOCOM 2021 - IEEE Conference on Computer Communications10.1109/INFOCOM42981.2021.9488736(1-10)Online publication date: 10-May-2021
      • (2020)Cross-domain Network Traffic Classification Using Unsupervised Domain Adaptation2020 International Conference on Information Networking (ICOIN)10.1109/ICOIN48656.2020.9016470(245-250)Online publication date: Jan-2020
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media