Performance Improvement of Classification Model Based on Adversarial Sample Generation

Jiang, Qian; Kang, Jie; Wu, Zhendong

doi:10.1007/978-3-031-20099-1_6

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13656))

Included in the following conference series:

International Conference on Machine Learning for Cyber Security

792 Accesses

Abstract

The bad information in the network is often filtered by the neural network model, which is easy to be attacked by various adversarial samples. In order to improve the text filtering ability of the neural network model, it is necessary to make the filtering model learn more bad text feature information, especially the feature information that is not recognized by the filtering model at present. Therefore, having more abundant and diverse high-quality data set is one of the ideal methods to improve the accuracy of neural network filtering model. First of all, aiming at the generation of Chinese adversarial samples, we propose a method to generate semantically similar adversarial samples based on GPT2 model. At the same time, we put forward the mutation strategy by using three kinds of mutation methods (homophonic substitution, visual replacement and letters replaced), in order to extend the data set. So that we can improve the performance of classification models. After retraining the classifier with the expanded data set, the accuracy of the LSTM model of the classifier is improved from 82% to 93%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
DU Xiaohu, W.H., YI Zibo, L.S., MA Ju, N.Y.J.: Adversarial text attack and defense:a review. J. Chin. Inf. Process. 35(08), 1–15 (2021)
Google Scholar
Ebrahimi, J., Lowd, D., Dou, D.: On adversarial examples for character-level neural machine translation. arXiv preprint arXiv:1806.09030 (2018)
Ebrahimi, J., Rao, A., Lowd, D., Dou, D.: Hotflip: white-box adversarial examples for text classification. arXiv preprint arXiv:1712.06751 (2017)
Gao, J., Lanchantin, J., Soffa, M.L., Qi, Y.: Black-box generation of adversarial text sequences to evade deep learning classifiers. In: 2018 IEEE Security and Privacy Workshops (SPW), pp. 50–56. IEEE (2018)
Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. Adv. Neural Inf. Process. Syst. 27 (2014)
Google Scholar
Goodfellow, I.J., Shlens, J., Szegedy, C.: Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 (2014)
Jia, R., Liang, P.: Adversarial examples for evaluating reading comprehension systems. arXiv preprint arXiv:1707.07328 (2017)
Li, L., Shao, Y., Song, D., Qiu, X., Huang, X.: Generating adversarial examples in Chinese texts using sentence-pieces. arXiv preprint arXiv:2012.14769 (2020)
Liang, B., Li, H., Su, M., Bian, P., Li, X., Shi, W.: Deep text classification can be fooled. arXiv preprint arXiv:1704.08006 (2017)
Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018)
Google Scholar
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I., et al.: Language models are unsupervised multitask learners. OpenAI Blog 1(8), 9 (2019)
Google Scholar
Szegedy, C., et al.: Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199 (2013)
Tong, X., Wang, L., Wang, R., Wang, J.: A generation method of word-level adversarial samples for Chinese text classification. Netinfo Secur. 20(9), 12–16 (2020)
Google Scholar
Tsai, Y.T., Yang, M.C., Chen, H.Y.: Adversarial attack on sentiment classification. In: Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pp. 233–240 (2019)
Google Scholar
Wang, C.L., Yang, Y.H., Deng, F., Lai, H.Y.: A review of text similarity approaches. Inf. Sci. 37(3), 158–168 (2019)
Google Scholar
Wang, W., Wang, R., Wang, L., Tang, B.: Adversarial examples generation approach for tendency classification on Chinese texts. Ruan Jian Xue Bao/J. Softw. 30, 1–14 (2019). in Chinese
Google Scholar
Wu, Z., Tian, L., Li, P., Wu, T., Jiang, M., Wu, C.: Generating stable biometric keys for flexible cloud computing authentication using finger vein. Inf. Sci. 433, 431–447 (2018)
Article Google Scholar
Wu, Z., Kang, J., Jiang, Q.: Semantic key generation based on natural language. Int. J. Intell. Syst. 37(7), 4041–4064 (2021)
Article Google Scholar
Wu, Z., Lv, Z., Kang, J., Ding, W., Zhang, J.: Fingerprint bio-key generation based on a deep neural network. Int. J. Intell. Syst. (2021)
Google Scholar

Download references

Acknowledgements

This research is funded by National Key R &D Program of China (No.2018YFB0804102), Key Projects of NSFC Joint Fund of China (No. U1866209), National Natural Science Foundation of China (No. 61772162).

Author information

Authors and Affiliations

Hangzhou Dianzi University, Hangzhou, China
Qian Jiang, Jie Kang & Zhendong Wu

Authors

Qian Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Kang
View author publications
You can also search for this author in PubMed Google Scholar
Zhendong Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhendong Wu .

Editor information

Editors and Affiliations

School of Computing and Informatics, University of Louisiana at Lafayette, Lafayette, IN, USA
Yuan Xu
Institute of Artificial Intelligence and Blockchain, Guangzhou University, Guangzhou, China
Hongyang Yan
Institute of Artificial Intelligence and Blockchain, Guangzhou University, Guangzhou, China
Huang Teng
Guangdong Polytechnic Normal University, Guangzhou, China
Jun Cai
Institute of Artificial Intelligence and Blockchain, Guangzhou University, Guangzhou, China
Jin Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, Q., Kang, J., Wu, Z. (2023). Performance Improvement of Classification Model Based on Adversarial Sample Generation. In: Xu, Y., Yan, H., Teng, H., Cai, J., Li, J. (eds) Machine Learning for Cyber Security. ML4CS 2022. Lecture Notes in Computer Science, vol 13656. Springer, Cham. https://doi.org/10.1007/978-3-031-20099-1_6

Download citation

DOI: https://doi.org/10.1007/978-3-031-20099-1_6
Published: 13 January 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20098-4
Online ISBN: 978-3-031-20099-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Performance Improvement of Classification Model Based on Adversarial Sample Generation