Research on Evasion and Detection of Malicious JavaScript Code

Ma, Yujie; Wu, Haokai; Tan, Yu-an; Li, Yuanzhang

doi:10.1007/978-981-97-2458-1_8

Yujie Ma⁹,
Haokai Wu⁹,
Yu-an Tan⁹ &
…
Yuanzhang Li⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14541))

Included in the following conference series:

International Conference on Machine Learning for Cyber Security

37 Accesses

Abstract

This thesis analyzes the malicious essence of malicious JavaScript and the implementation of malicious functions. Then, this thesis combines the result with the taint analysis technology in the field of software vulnerability analysis, and proposes a new malicious JavaScript detection method based on taint analysis. This method defines the taint source and taint sink point according to the implementation of malicious code functions, and then performs taint propagation on the abstract syntax tree of the code to obtain the characteristics of the code. After forming a feature vector through the process, this thesis finally uses machine learning models to complete detection. Experimental results show that the method can well complete the binary classification of malicious and benign samples, and the detection effect on the obfuscated samples is significantly better than mainstream online anti-malware engines. Code obfuscation can hardly affect detection results of this method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bhatia, T., Kaushal, R.: Malware detection in android based on dynamic analysis. In: 2017 International Conference on Cyber Security and Protection of Digital Services (Cyber Security), pp. 1–6. IEEE (2017)
Google Scholar
Chen, P., Gong, M.L.: The vulnerability detection method based on compression coding of abstract syntax tree. J. Inf. Secur. Res. (2022)
Google Scholar
Cui, Z., Xue, F., Cai, X., Cao, Y., Wang, G.G., Chen, J.: Detection of malicious code variants based on deep learning. IEEE Trans. Ind. Inf. 14(7), 3187–3196 (2018)
Google Scholar
Ding, Y., Xia, X., Chen, S., Li, Y.: A malware detection method based on family behavior graph. Comput. Secur. 73, 73–86 (2018)
Article Google Scholar
Edwards, M., Xie, X.: Graph based convolutional neural network. arXiv preprint arXiv:1609.08965 (2016)
Enck, W., et al.: TaintDroid: an information-flow tracking system for realtime privacy monitoring on smartphones. ACM Trans. Comput. Syst. (TOCS) 32(2), 1–29 (2014)
Article Google Scholar
Jayasundara, V., Bui, N.D.Q., Jiang, L., Lo, D.: TreeCaps: tree-structured capsule networks for program source code processing. arXiv preprint arXiv:1910.12306 (2019)
Jindal, C., Salls, C., Aghakhani, H., Long, K., Kruegel, C., Vigna, G.: Neurlux: dynamic malware analysis without feature engineering. In: Proceedings of the 35th Annual Computer Security Applications Conference, pp. 444–455 (2019)
Google Scholar
Karim, R., Tip, F., Sochrková, A., Sen, K.: Platform-independent dynamic taint analysis for JavaScript. IEEE Trans. Softw. Eng. 46(12), 1364–1379 (2018)
Article Google Scholar
Kreindl, J., Bonetta, D., Stadler, L., Leopoldseder, D., Mössenböck, H.: Multi-language dynamic taint analysis in a polyglot virtual machine. In: Proceedings of the 17th International Conference on Managed Programming Languages and Runtimes, pp. 15–29 (2020)
Google Scholar
Li: Research on JavaScript malicious code detection model based on anti-obfuscated technology. Master’s thesis, Beijing University of Posts and Telecommunications (2019)
Google Scholar
Liang, B., Pang, S., Yue, Z.: A malware detection method based on hybrid learning. Acta Electron. Sin. 49(2), 286 (2021)
Google Scholar
Likarish, P., Jung, E., Jo, I.: Obfuscated malicious JavaScript detection using classification techniques. In: 2009 4th International Conference on Malicious and Unwanted Software (MALWARE), pp. 47–54. IEEE (2009)
Google Scholar
Ming, J., Wu, D., Xiao, G., Wang, J., Liu, P.: TaintPipe: pipelined symbolic taint analysis. In: 24th USENIX Security Symposium (USENIX Security 15), pp. 65–80 (2015)
Google Scholar
Mou, L., Li, G., Zhang, L., Wang, T., Jin, Z.: Convolutional neural networks over tree structures for programming language processing. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 30 (2016)
Google Scholar
Narvekar, A.N., Joshi, K.K.: Security sandbox model for modern web environment. In: 2017 International Conference on Nascent Technologies in Engineering (ICNTE), pp. 1–6. IEEE (2017)
Google Scholar
Nawrocki, M., Wählisch, M., Schmidt, T.C., Keil, C., Schönfelder, J.: A survey on honeypot software and data analysis. arXiv preprint arXiv:1608.06249 (2016)
Rathi, D., Jindal, R.: DroidMark: a tool for android malware detection using taint analysis and Bayesian network. arXiv preprint arXiv:1805.06620 (2018)
Wang, J., Xue, Y., Liu, Y., Tan, T.H.: JSDC: a hybrid approach for JavaScript malware detection and classification. In: Proceedings of the 10th ACM Symposium on Information, Computer and Communications Security, pp. 109–120 (2015)
Google Scholar
Yu, H., Lam, W., Chen, L., Li, G., Xie, T., Wang, Q.: Neural detection of semantic code clones via tree-based convolution. In: 2019 IEEE/ACM 27th International Conference on Program Comprehension (ICPC), pp. 70–80. IEEE (2019)
Google Scholar
Zhang, S.W.: Multi-granularity android malware fast detection based on opcode. Chin. J. Netw. Inf. Secur. 5(6), 85–94 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Beijing Institute of Technology, Beijing, 100081, China
Yujie Ma, Haokai Wu, Yu-an Tan & Yuanzhang Li

Authors

Yujie Ma
View author publications
You can also search for this author in PubMed Google Scholar
Haokai Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yu-an Tan
View author publications
You can also search for this author in PubMed Google Scholar
Yuanzhang Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuanzhang Li .

Editor information

Editors and Affiliations

University of Queensland, St Lucia, QLD, Australia
Dan Dongseong Kim
RMIT University, Melbourne, VIC, Australia
Chao Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ma, Y., Wu, H., Tan, Ya., Li, Y. (2024). Research on Evasion and Detection of Malicious JavaScript Code. In: Kim, D.D., Chen, C. (eds) Machine Learning for Cyber Security. ML4CS 2023. Lecture Notes in Computer Science, vol 14541. Springer, Singapore. https://doi.org/10.1007/978-981-97-2458-1_8

Download citation

DOI: https://doi.org/10.1007/978-981-97-2458-1_8
Published: 23 April 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-2457-4
Online ISBN: 978-981-97-2458-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Research on Evasion and Detection of Malicious JavaScript Code