Privacy Preserving Classification Based on Perturbation for Network Traffic

Lu, Yue; Tian, Hui; Shen, Hong; Xu, Dongdong

doi:10.1007/978-981-13-5907-1_13

Yue Lu¹²,
Hui Tian^12,13,
Hong Shen^14,15 &
…
Dongdong Xu¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 931))

Included in the following conference series:

International Conference on Parallel and Distributed Computing: Applications and Technologies

826 Accesses
1 Citations

Abstract

Network traffic classification is important to many network applications. Machine learning is regarded as one of the most effective technique to classify network traffic. In this paper, we adopt the fast correlation-based filter algorithm to filter redundant attributes contained in network traffic. The attributes selected by this algorithm help to reduce the classification complexity and achieve high classification accuracy. Since the traffic attributes contain a large amount of users’ behavior information, the privacy of user may be revealed and illegally used by malicious users. So it’s demanding to classify traffic with certain segment of frames which encloses privacy-related information being protected. After classification, the results do not disclose privacy information, while may still be used for data analysis. Therefore, we propose a random perturbation algorithm based on relationship among different data attributes’ orders, which protects their privacy, thus ensures data security during classification. The experiment results demonstrate that data perturbed by our algorithm is classified with high accuracy rate and data utility.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Guo, L., Shen, H.: Privacy-preserving internet traffic publication. In: IEEE Trustcom/BigDataSE/ISPA, pp. 884–891 (2017)
Google Scholar
Moore, A.W., Papagiannaki, K.: Toward the accurate identification of network applications. In: Dovrolis, C. (ed.) PAM 2005. LNCS, vol. 3431, pp. 41–54. Springer, Heidelberg (2005). https://doi.org/10.1007/978-3-540-31966-5_4
Chapter Google Scholar
Madhukar, A., Williamson, C.: A longitudinal study of P2P traffic classification. In: 14th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, pp. 179–188 (2006)
Google Scholar
Kanungo, T., Mount, D.M., Netanyahu, N.S.: An efficient K-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24, 881–892 (2002)
Article Google Scholar
McGregor, A., Hall, M., Lorier, P., Brunskill, J.: Flow clustering using machine learning techniques. In: Barakat, C., Pratt, I. (eds.) PAM 2004. LNCS, vol. 3015, pp. 205–214. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-24668-8_21
Chapter Google Scholar
Zander, S., Nguyen, T., Armitage, G.: Automated traffic classification and application identification using machine learning. In: IEEE Conference on Local Computer Networks, pp. 250–257 (2005)
Google Scholar
Erman, J., Arlitt, M., Mahanti, A.: Traffic classification using clustering algorithms. In: Proceedings of the 2006 SIGCOMM Workshop on Mining Network Data, pp. 281–286 (2006)
Google Scholar
Moore, A.W., Zuev, D.: Internet traffic classification using bayesian analysis techniques. ACM SIGMETRICS Perform. Eval. Rev. 33, 50–60 (2005)
Article Google Scholar
Williams, N., Zander, S.: Evaluating machine learning algorithms for automated network application identification, Center for Advanced Internet Architectures Technical report (2006)
Google Scholar
Li, W., Moore, A.W.: A machine learning approach for efficient traffic classification. In: 15th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, pp. 310–317 (2007)
Google Scholar
Deng, H., Yang, A.M.: P2P traffic classification method based on SVM. In: Computer Engineering and Applications (2006)
Google Scholar
Aggarwal, C.C.: On k-anonymity and the curse of dimensionality. In: Proceedings of the 31st International Conference on Very Large Data Bases, pp. 901–909 (2005)
Google Scholar
Waters, B.: Efficient identity-based encryption without random oracles. In: Cramer, R. (ed.) EUROCRYPT 2005. LNCS, vol. 3494, pp. 114–127. Springer, Heidelberg (2005). https://doi.org/10.1007/11426639_7
Chapter Google Scholar
Dwork, C.: Differential Privacy. In: Bugliesi, M., Preneel, B., Sassone, V., Wegener, I. (eds.) ICALP 2006, Part II. LNCS, vol. 4052, pp. 1–12. Springer, Heidelberg (2006). https://doi.org/10.1007/11787006_1
Chapter Google Scholar
Moore, A.W., Zuev, D.: Discriminators for use in ow-based classification (2005)
Google Scholar
Yu, L., Liu, H.: Feature selection for high-dimensional data: a fast correlation-based filter solution. In: 20th International Conference on Machine Learning, pp. 856–863 (2003)
Google Scholar

Download references

Acknowledgement

This work was done under the support of Research Initiative Grant of Australian Research Council Discovery Projects funding DP150104871, Beijing Natural Science Foundation Grant No. 4172045 and National Science Foundation of China Grant No. 61501025.

Author information

Authors and Affiliations

School of Electronics and Information Engineering, Beijing Jiaotong University, Beijing, China
Yue Lu, Hui Tian & Dongdong Xu
School of Information and Communication Technology, Griffith University, Southport, Australia
Hui Tian
School of Computer Science, University of Adelaide, Adelaide, Australia
Hong Shen
School of Data and Computer Science, Sun Yat-Sen University, Guangzhou, China
Hong Shen

Authors

Yue Lu
View author publications
You can also search for this author in PubMed Google Scholar
Hui Tian
View author publications
You can also search for this author in PubMed Google Scholar
Hong Shen
View author publications
You can also search for this author in PubMed Google Scholar
Dongdong Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hui Tian .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Korea (Republic of)
Jong Hyuk Park
School of Computer Science, University of Adelaide, Adelaide, SA, Australia
Hong Shen
Department of Multimedia Engineering, Dongguk University, Seoul, Korea (Republic of)
Yunsick Sung
School of ICT, Griffith University, Gold Coast, Australia
Hui Tian

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, Y., Tian, H., Shen, H., Xu, D. (2019). Privacy Preserving Classification Based on Perturbation for Network Traffic. In: Park, J., Shen, H., Sung, Y., Tian, H. (eds) Parallel and Distributed Computing, Applications and Technologies. PDCAT 2018. Communications in Computer and Information Science, vol 931. Springer, Singapore. https://doi.org/10.1007/978-981-13-5907-1_13

Download citation

DOI: https://doi.org/10.1007/978-981-13-5907-1_13
Published: 08 February 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-5906-4
Online ISBN: 978-981-13-5907-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics