Abstract
Online support vector machine (SVM) is an effective learning method in real-time network traffic classification tasks. However, due to its geometric characteristics, the traditional online SVMs are sensitive to noise and class imbalance. In this paper, a scalable kernel convex hull online SVM called SKCHO-SVM is proposed to solve this problem. SKCHO-SVM involves two stages: (1) offline leaning stage, in which the noise points are deleted and initial pin-SVM classifier is built; (2) online updating stage, in which the classifier is updated with newly arrived data points, while carrying out the classification task. The noise deleting strategy and pinball loss function ensure SKCHO-SVM insensitive to noise data flows. Based on the scalable kernel convex hull, a small amount of convex hull vertices are dynamically selected as the training data points in each class, and the obtained scalable kernel convex hull can relieve class imbalance. Theoretical analysis and numerical experiments show that SKCHO-SVM has the distinctive ability of training time and classification performance.



Similar content being viewed by others
References
China Internet Network Information Center (2019) Statistical Report on Internet Development in China. http://cnnic.com.cn/IDR/ReportDownloads/.[Online; Accessed 2-28-2019]
Nguyen T, Armitage G (2009) A survey of techniques for internet traffic classification using machine learning. IEEE Commun Surv Tutorials 10(4):56–76
Moore AW, Papagiannaki K (2005) Toward the accurate identification of network applications. In: Dovrolis C (ed) Passive and active network measurement. PAM 2005. Lecture Notes in Computer Science, vol 3431. Springer, Berlin, Heidelberg, pp 41–54. https://doi.org/10.1007/978-3-540-31966-5_4
Dainotti A, Pescape A, Claffy KC (2012) Issues and future directions in traffic classification. IEEE Netw 26(1):35–40
Bujlow T, Carela-Español V, Barlet-Ros P (2015) Independent comparison of popular DPI tools for traffic classification. Comput Netw 76(1):75–89
Surati S, Jinwala DC, Garg S (2017) A survey of simulators for P2P overlay networks with a case study of the P2P tree overlay using an event-driven simulator. Eng Sci Technol 20(2):705–720
Rezaei S, Liu X (2019) Deep learning for encrypted traffic classification: an overview. IEEE Commun Mag 57(5):76–81
Sun G, Chen T, Su Y, Li C (2018) Internet traffic classification based on incremental support vector machines. Mob Netw Applic 23(4):789–796
Gu XQ, Chung FL, Wang ST (2019) Extreme vector machine for fast training on large data. Int J Mach Learn Cybern 11:33–53. https://doi.org/10.1007/s13042-019-00936-3
Cao J, Fang Z, Qu G, Sun H, Zhang D (2017) An accurate traffic classification model based on support vector machines. Int J Netw Manag 27(1):1–15
Zhang J, Chen X, Xiang Y, Zhou W, Wu J (2015) Robust network traffic classification. IEEE/ACM Trans Networking 23(4):1257–1270
Divakaran DM, Su L, Liau YS, Thing VL (2015) SLIC: self-learning intelligent classifier for network traffic. Comput Netw 91(11):283–297
Ni TG, Gu XQ, Wang J, Zheng YH, Wang HY (2018) Scalable transfer support vector machine with group probabilities. Neurocomputing 273(1):570–582
Gu XQ, Chung FL, Wang ST (2018) Fast convex-hull vector machine for training on large-scale ncRNA data classification tasks. Knowl-Based Syst 151(6):149–164
Finamore A, Mellia M, Meo M, Rossi D (2010) Kiss: stochastic packet inspection classifier for udp traffic. IEEE/ACM Trans Netw 18(5):1505–1515
Gu C, Zhang S, Xue X (2011) Internet traffic classification based on fuzzy kernel K-means clustering. Int J Advancements in Comput Technol 3(3):199–209
Ertekin S, Bottou L, Giles CL (2011) Nonconvex online support vector machines. IEEE Trans Pattern Anal Mach Intell 33(2):368–381
Wang T, Chen J, Zhou Y, Snoussi H (2013) Online least squares one-class support vector machines-based abnormal visual event detection. Sensors 13(12):17130–17155
Wang D, Qiao H, Zhang B, Wang M (2013) Online support vector machine based on convex hull vertices selection. IEEE Trans Neural Netw Learn Syst 24(4):593–609
Wang J, Zhao P, Hoi SCH (2014) Cost-sensitive online classification. IEEE Trans Knowl Data Eng 26(10):2425–2438
Labovitz C, Johnson S, Oberheide J, Jahanian F, McPherson D (2010) Internet inter-domain traffic. In: Proceedings of the ACM SIGCOMM 2010 conference on applications, technologies, architectures, and protocols for computer communications. New Delhi, India, pp 75–86. https://doi.org/10.1145/1851182.1851194
Huang XL, Shi L, Suykens JAK (2014) Asymmetric least squares support vector machine classifiers. Comput Stat Data Anal 70(2):395–405
Huang XL, Shi L, Pelckmansb K, Suykens JAK (2014) Asymmetric ν-tube support vector regression. Comput Stat Data Anal 77(9):371–382
Huang XL, Shi L, Suykens JAK (2014) Support vector machine classifier with pinball loss. IEEE Trans Pattern Anal Mach Intell 36(5):984–997
Nandan M, Khargonekar PP, Talathi SS (2014) Fast SVM training using approximate extreme points. J Mach Learn Res 15(1):59–98
Almasi ON, Rouhani M (2016) Fast and de-noise support vector machine training method based on fuzzy clustering method for large real world datasets. Turk J Electr Eng Comput Sci 24(1):219–233
Ding S, Nie X, Qiao H, Zhang B (2013) A fast algorithm of convex hull vertices selection for online classification. IEEE Trans Neural Netw Learn Syst 29(4):792–806
David MJT (2004) Support vector data description. J Mach Learn Res 54(1):45–66
Network traffic data Moore datasets, https://www.cl.cam.ac.uk/research/srg/netos/projects/brasil/data/index.html [Online; Accessed 12-22-2018]
Li W, Canini M, Moore AW, Bolla R (2009) Efficient application identification and the temporal and spatial stability of classification schema. Comput Netw 53(6):790–809
Wang ST, Wang J, Chung F (2014) Kernel density estimation, kernel methods, and fast learning in large data sets. IEEE Trans Cybern 44(1):1–20
Bordes A, Ertekin S, Weston J, Bottou L (2005) Fast kernel classifiers with online and active learning. J Mach Learn Res 6(10):1579–1619
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China under Grants 61976028 and 61806026 and by the Natural Science Foundation of Jiangsu Province under Grant BK 20180956.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gu, X., Ni, T., Fan, Y. et al. Scalable kernel convex hull online support vector machine for intelligent network traffic classification. Ann. Telecommun. 75, 471–486 (2020). https://doi.org/10.1007/s12243-020-00767-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12243-020-00767-2