Conferences >2019 7th International Worksh...

Automatic Malware Clustering using Word Embeddings and Unsupervised Learning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Malware has been established as one of the major threats in the cyberspace. Current mitigation efforts are focused in suspicious files disclosure, omitting key aspects in...Show More

Metadata

Abstract:

Malware has been established as one of the major threats in the cyberspace. Current mitigation efforts are focused in suspicious files disclosure, omitting key aspects in detection, such as category clustering. While state-of-the-art provides significant advances in machine learning-based malware classification, most works solve binary classification problems. In this article, a methodology for automatic clustering of malware using NLP and unsupervised learning techniques is proposed. The latter is done by identifying malicious system calls (syscalls) from different binaries; then modelled in a textually manner to extract the most relevant features employing a statistical technique named TF-IDF. Then, a semantic and contextual representation of each syscall is computed by Word2Vec, a well-known word embedding algorithm. Weighted syscalls are subjected to KNN algorithm to find latent malware categories. A case study proves it is possible to cluster at least 60 new malware categories.

Published in: 2019 7th International Workshop on Biometrics and Forensics (IWBF)

Date of Conference: 02-03 May 2019

Date Added to IEEE Xplore: 21 June 2019

ISBN Information:

DOI: 10.1109/IWBF.2019.8739186

Conference Location: Cancun, Mexico

Contents

References is not available for this document.

Automatic Malware Clustering using Word Embeddings and Unsupervised Learning

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Automatic Malware Clustering using Word Embeddings and Unsupervised Learning

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?