research-article

A Method of Machine Learning for Social Bot Detection Combined with Sentiment Analysis

Authors:

Linglin XiaAuthors Info & Claims

MLNLP '22: Proceedings of the 2022 5th International Conference on Machine Learning and Natural Language Processing

Pages 239 - 244

https://doi.org/10.1145/3578741.3578790

Published: 06 March 2023 Publication History

Abstract

Social Bot exists widely in major social networks. Some maliciously use a social bot to guide public opinion, steal user privacy, and create rumors, which seriously affects the security of social networks. Past approaches mainly extracted large amounts of contents but ignored bots’ text sentiment features, and it is hard to detect social bot just based on contents. This paper proposes a malicious social bot detection method that combines sentiment features in response to this problem. It trains a Bidirectional Long Short-Term Memory model(Bi-LSTM) with an Attention Mechanism to perform sentiment calculation on the online text information of social accounts and analyze the sentiment fluctuations of accounts to get the new sentiment features; Then, it inputs the new features combined with metadata features into different machine learning models for analysis and comparison. Through this method, different machine learning detection models have improved the detection accuracy after combining sentiment features.

References

[1]

Abokhodair, Norah, Daisy Yoo, and David W. McDonald. 2015. Dissecting a social botnet: Growth, content and influence in Twitter. Proceedings of the 18th ACM conference on computer supported cooperative work & social computing, 839-851. https://doi.org/10.1145/2675133.2675208

Digital Library

[2]

Dickerson, John P., Vadim Kagan, and V. S. Subrahmanian. 2014. Using sentiment to detect bots on twitter: Are humans more opinionated than bots? 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014), 620-627. https://doi.org/10.1109/ASONAM.2014.6921650

[3]

Freitas, Carlos, 2015. Reverse engineering socialbot infiltration strategies in twitter. 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 5-32. https://doi.org/10.1145/2808797.2809292

Digital Library

[4]

Chen, Z., and D. Subramanian. 2018. An unsupervised approach to detect spam campaigns that use botnets on Twitter. arXiv. https://doi.org/10.48550/arXiv.1804.05232

[5]

Howard, Philip N., Samuel Woolley, and Ryan Calo. 2018. Algorithms, bots, and political communication in the US 2016 election: The challenge of automated political communication for election law and administration. Journal of information technology & politics, 15(2), 81-93. https://doi.org/10.1080/19331681.2018.1448735

[6]

Yang, Kai-Cheng, 2020. Scalable and generalizable social bot detection through data selection. Proceedings of the AAAI conference on artificial intelligence. Vol. 34. No. 01, 1096-1103. https://doi.org/10.1609/aaai.v34i01.5460

[7]

J. Pastor-Galindo, M. Zago, P. Nespoli 2020. Spotting political social bots in twitter: a use case of the 2019 Spanish general election. IEEE Transactions on Network and Service Management. vol. 17, no. 4, 2156–2170. https://doi.org/10.1109/TNSM.2020.3031573

Digital Library

[8]

W. Shi, D. Liu, J. Yang, J. Zhang, S. Wen, and J. Su. Social bots’ sentiment engagement in health emergencies: a topic-based analysis of the COVID-19 pandemic discussions on twitter. International Journal of Environmental Research and Public Health. vol. 17, no. 22; 2020; 8701. https://doi.org/10.3390/ijerph17228701

[9]

Ferrara, E, Cresci, S. & Luceri, L. 2020. Misinformation, manipulation, and abuse on social media in the era of COVID-19. J Comput Soc Sc. 3: 271–277. https://doi.org/10.1007/s42001-020-00094-5

[10]

Shafahi, Mohammad, Leon Kempers, and Hamideh Afsarmanesh. 2016. Phishing through social bots on Twitter. 2016 IEEE international conference on big data (big data). 3703-3712. https://doi.org/10.1109/BigData.2016.7841038

[11]

K. Shu, A. Sliva, S. Wang, J. Tang, and H. Liu. 2017. Fake news detection on social media: A data mining perspective. SIGKDD Explorations. vol. 19, no. 1, 22–36. https://doi.org/10.1145/3137597.3137600

Digital Library

[12]

Rajabi, Zahra, Amarda Shehu, and Hemant Purohit. 2019. User behavior modelling for fake information mitigation on social web. International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation, Springer, Cham, 234–244. https://doi.org/10.1007/978-3-030-21741-9_24

Digital Library

[13]

Z. Rajabi, A. Shehu, and O. Uzuner, 2020. A multi-channel bilstm-cnn model for multilabel emotion classification of informal text. 2020 IEEE 14th International Conference on Semantic Computing (ICSC), 303–306. https://doi.org/10.1109/ICSC.2020.00060

[14]

Davis, Clayton Allen, 2016. Botornot: A system to evaluate social bots. Proceedings of the 25th international conference companion on world wide web, 273-274. https://doi.org/10.1145/2872518.2889302

Digital Library

[15]

Varol, Onur, 2017. Online human-bot interactions: Detection, estimation, and characterization. Proceedings of the international AAAI conference on web and social media. Vol. 11. No. 1, 280-289. https://doi.org/10.1609/icwsm.v11i1.14871

[16]

Zhou, Peng, 2016. Attention-based bidirectional long short-term memory networks for relation classification. Proceedings of the 54th annual meeting of the association for computational linguistics (volume 2: Short papers), 207-212.

[17]

Ferrara E, Varol O, Davis C, 2014. The rise of social bots. Communications of the Acm, 59(7): 96-104. https://doi.org/10.1145/2818717

Digital Library

[18]

Wang G, Mohanlal M, Wilson C, 2012. Social turing tests: Crowdsourcing sybil detection. arXiv preprint arXiv:1205.3856. https://doi.org/10.48550/arXiv.1205.3856

[19]

Hu, Zijian, 2021. Simple: Similar pseudo label exploitation for semi-supervised classification. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 15099-15108. https://doi.org/10.1109/CVPR46437.2021.01485

[20]

S. B. Kotsiantis. 2007. Supervised Machine Learning: A Review of Classification Techniques. In Proceedings of the 2007 conference on Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in eHealth, HCI, Information Retrieval and Pervasive Technologies. IOS Press, NLD, 3–24.Yang, Zhi, 2014. Uncovering social network sybils in the wild. ACM Transactions on Knowledge Discovery from Data (TKDD), 8(1): 1-29.

[21]

Ping, Heng, and Sujuan Qin. 2018. A social bots detection model based on deep learning algorithm. 2018 IEEE 18th international conference on communication technology (icct), 1435-1439. https://doi.org/10.1109/ICCT.2018.8600029

[22]

Kudugunta, Sneha, and Emilio Ferrara. 2018. Deep neural networks for bot detection. Information Sciences 467, 312-322. https://doi.org/10.1016/j.ins.2018.08.019

[23]

Fazil, Mohd, Amit Kumar Sah, and Muhammad Abulaish. 2021. DeepSBD: A Deep Neural Network Model With Attention Mechanism for SocialBot Detection. IEEE Transactions on Information Forensics and Security; 16: 4211-4223. https://doi.org/10.1109/TIFS.2021.3102498

Digital Library

[24]

Gatkal, Suyash, 2021. Community Detection and Impact of Bots on Sentiment Polarity of Twitter Networks. 2021 Asian Conference on Innovation in Technology (ASIANCON), 1-6. https://doi.org/10.1109/ASIANCON51346.2021.9544691

[25]

Venkatesan, M., and P. Prabhavathy. 2019. Graph based unsupervised learning methods for edge and node anomaly detection in social network. 2019 IEEE 1st International Conference on Energy, Systems and Information Processing (ICESIP), 1-5. https://doi.org/10.1109/ICESIP46348.2019.8938364

[26]

Campello, Ricardo JGB, Davoud Moulavi, and Jörg Sander. 2013. Density-based clustering based on hierarchical density estimates. Pacific-Asia conference on knowledge discovery and data mining, 160-172. https://doi.org/10.1007/978-3-642-37456-2_14

[27]

Akoglu, Leman, Mary McGlohon, and Christos Faloutsos. 2010. Oddball: Spotting anomalies in weighted graphs. Pacific-Asia conference on knowledge discovery and data mining, 410-421. https://doi.org/10.1007/978-3-642-13672-6_40

Digital Library

[28]

Cresci, Stefano, 2017. The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race. Proceedings of the 26th international conference on world wide web companion, 963-972. https://doi.org/10.1145/3041021.3055135

Digital Library

[29]

Pennington, Jeffrey, Richard Socher, and Christopher D. 2014. Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), 1532-1543.

[30]

Vaswani, Ashish, 2017. Attention is all you need. Advances in neural information processing systems, 30.

[31]

Chawla, Nitesh V., 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16: 321-357. https://doi.org/10.1613/jair.953

Cited By

Hassan MHussain MMaab IHabib UKhan MMasood A(2024)Detection of Sarcasm in Urdu Tweets Using Deep Learning and Transformer Based Hybrid ApproachesIEEE Access10.1109/ACCESS.2024.339385612(61542-61555)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3393856

Index Terms

A Method of Machine Learning for Social Bot Detection Combined with Sentiment Analysis
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction
  2. Machine learning
    1. Machine learning algorithms
      1. Feature selection

Recommendations

Social Spammer Detection with Sentiment Information
ICDM '14: Proceedings of the 2014 IEEE International Conference on Data Mining

Social media is a popular platform for spammers to unfairly overwhelm normal users with unwanted or fake content via social networking. The spammers significantly hinder the use of social media systems for effective information dissemination and ...
Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog

As a new form of social media, microblogging provides platform sharing, wherein users can share their feelings and ideas on certain topics. Bursty topics from microblogs are the results of the emerging issues that instantly attract more followers and ...
Exposing Bot Attacks Using Machine Learning and Flow Level Analysis
DATA'21: International Conference on Data Science, E-learning and Information Systems 2021

Botnets represent a major threat to Internet security that have continuously developed in scale and complexity. Command-and-control servers (C&C) send commands to bots that execute and perform these commands, thereby implementing attacks such as ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

MLNLP '22: Proceedings of the 2022 5th International Conference on Machine Learning and Natural Language Processing

December 2022

406 pages

ISBN:9781450399067

DOI:10.1145/3578741

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 March 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Conference

MLNLP 2022

MLNLP 2022: 2022 5th International Conference on Machine Learning and Natural Language Processing

December 23 - 25, 2022

Sanya, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
71
Total Downloads

Downloads (Last 12 months)31
Downloads (Last 6 weeks)6

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Hassan MHussain MMaab IHabib UKhan MMasood A(2024)Detection of Sarcasm in Urdu Tweets Using Deep Learning and Transformer Based Hybrid ApproachesIEEE Access10.1109/ACCESS.2024.339385612(61542-61555)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3393856

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten