ABSTRACT
This paper presents a pioneering investigation into a novel form of scam advertising method on YouTube, termed "social scam bots'' (SSBs). These bots have evolved to emulate benign user behavior by posting comments and engaging with other users, oftentimes appearing prominently among the top rated comments. We analyzed the YouTube video comments and proposed a method to identify SSBs and extract the underlying scam domains. Our study revealed 1,134 SSBs promoting 72 scam campaigns responsible for infecting 31.73% of crawled videos. Further investigation revealed that SSBs exhibit advances that surpass traditional bots. Notably, they targeted specific audience by aligning scam campaigns with related video content, effectively leveraging the YouTube recommendation algorithm. We monitored these SSBs over a period of six months, enabling us to evaluate the effectiveness of YouTube's mitigation efforts. We also uncovered various strategies they use to evade mitigation attempts, including a novel strategy called "self-engagement," aimed at boosting their comment ranking. By shedding light on the phenomenon of SSBs and their evolving tactics, our study aims to raise awareness and contribute to the prevention of these malicious actors, ultimately fostering a safer online platform.
- [n. d.]. YOUTUBE USER STATISTICS 2022. https://www.globalmediainsight. com/blog/youtube-users-statistics/#stat. Accessed: 2022--11-01.Google Scholar
- 2023. YouTube policies - External links policy. https://support.google.com/ youtube/answer/9054257. Accessed: 2023-01-17.Google Scholar
- 2023. YouTube policies - Spam, deceptive practices, & scams policies. https: //support.google.com/youtube/answer/2801973. Accessed: 2023-01-17.Google Scholar
- Norah Abokhodair, Daisy Yoo, and David W McDonald. 2015. Dissecting a social botnet: Growth, content and influence in Twitter. In Proceedings of the 18th ACM conference on computer supported cooperative work & social computing. 839--851.Google ScholarDigital Library
- Faraz Ahmed and Muhammad Abulaish. 2013. A generic statistical approach for spam detection in online social networks. Computer Communications 36, 10--11 (2013), 1120--1129.Google ScholarCross Ref
- Túlio C Alberto, Johannes V Lochter, and Tiago A Almeida. 2015. Tubespam: Comment spam filtering on youtube. In 2015 IEEE 14th international conference on machine learning and applications (ICMLA). IEEE, 138--143.Google ScholarCross Ref
- Victor Benjamin and TS Raghu. 2022. Augmenting Social Bot Detection with Crowd-Generated Labels. Information Systems Research (2022).Google Scholar
- Elijah Bouma-Sims and Brad Reaves. 2021. A First Look at Scams on YouTube. arXiv preprint arXiv:2104.06515 (2021).Google Scholar
- Chiyu Cai, Linjing Li, and Daniel Zeng. 2017. Detecting social bots by jointly modeling deep behavior and content information. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 1995--1998.Google ScholarDigital Library
- Vidushi Chaudhary and Ashish Sureka. 2013. Contextual feature based one-class classifier approach for detecting video response spam on youtube. In 2013 Eleventh Annual Conference on Privacy, Security and Trust. IEEE, 195--204.Google ScholarCross Ref
- Rashid Chowdury, Md Nuruddin Monsur Adnan, GAN Mahmud, and Rashedur M Rahman. 2013. A data mining based spam detection system for youtube. In Eighth international conference on digital information management (ICDIM 2013). IEEE, 373--378.Google ScholarCross Ref
- Stefano Cresci. 2020. A decade of social bot detection. Commun. ACM 63, 10 (2020), 72--83.Google ScholarDigital Library
- Stefano Cresci, Roberto Di Pietro, Marinella Petrocchi, Angelo Spognardi, and Maurizio Tesconi. 2017. The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race. In Proceedings of the 26th international conference on world wide web companion. 963--972.Google ScholarDigital Library
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).Google Scholar
- Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu, et al . 1996. A density- based algorithm for discovering clusters in large spatial databases with noise.. In kdd, Vol. 96. 226--231.Google Scholar
- European Commission. 2016. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation) (Text with EEA relevance). https://eur-lex.europa.eu/eli/reg/2016/679/ojGoogle Scholar
- Emilio Ferrara, Onur Varol, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2016. The rise of social bots. Commun. ACM 59, 7 (2016), 96--104.Google ScholarDigital Library
- Yuriy Gorodnichenko, Tho Pham, and Oleksandr Talavera. 2021. Social media, sentiment and public opinions: Evidence from# Brexit and# USElection. European Economic Review 136 (2021), 103772.Google ScholarCross Ref
- Christian Grimme, Dennis Assenmacher, and Lena Adam. 2018. Changing perspectives: Is it sufficient to detect social bots?. In International conference on social computing and social media. Springer, 445--461.Google ScholarDigital Library
- Nir Grinberg, Kenneth Joseph, Lisa Friedland, Briony Swire-Thompson, and David Lazer. 2019. Fake news on Twitter during the 2016 US presidential election. Science 363, 6425 (2019), 374--378.Google Scholar
- Maryam Heidari and James H Jones. 2020. Using bert to extract topic-independent sentiment features for social media bot detection. In 2020 11th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON). IEEE, 0542--0547.Google Scholar
- Maryam Heidari, James H Jones, and Ozlem Uzuner. 2020. Deep contextualized word embedding for text-based online user profiling to detect social bots on twitter. In 2020 International Conference on Data Mining Workshops (ICDMW). IEEE, 480--487.Google ScholarCross Ref
- Maryam Heidari, James H Jones Jr, and Ozlem Uzuner. 2022. Online User Profiling to Detect Social Bots on Twitter. arXiv preprint arXiv:2203.05966 (2022).Google Scholar
- Roope Jaakonmäki, Oliver Müller, and Jan Vom Brocke. 2017. The impact of content, context, and creator on user engagement in social media marketing. In Proceedings of the Annual Hawaii International Conference on System Sciences, Vol. 50. IEEE Computer Society Press, 1152--1160.Google Scholar
- Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019).Google Scholar
- Luca Luceri, Ashok Deb, Adam Badawy, and Emilio Ferrara. 2019. Red bots do it better: Comparative analysis of social bot partisan behavior. In Companion proceedings of the 2019 World Wide Web conference. 1007--1012.Google ScholarDigital Library
- Federico Maggi, Alessandro Frossi, Stefano Zanero, Gianluca Stringhini, Brett Stone-Gross, Christopher Kruegel, and Giovanni Vigna. 2013. Two years of short urls internet measurement: security threats and countermeasures. In proceedings of the 22nd international conference on World Wide Web. 861--872.Google ScholarDigital Library
- Michele Mazza, Marco Avvenuti, Stefano Cresci, and Maurizio Tesconi. 2022. Investigating the difference between trolls, social bots, and humans on Twitter. Computer Communications 196 (2022), 23--36.Google ScholarDigital Library
- Guanyi Mou and Kyumin Lee. 2020. Malicious bot detection in online social networks: arming handcrafted features with deep learning. In International Conference on Social Informatics. Springer, 220--236.Google ScholarDigital Library
- Yew Chuan Ong. 2020. Characterising and Detecting Social Bots. Ph. D. Disserta- tion. University of Sheffield.Google Scholar
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. https://arxiv.org/abs/1908.10084Google ScholarCross Ref
- Chengcheng Shao, Giovanni Luca Ciampaglia, Onur Varol, Kai-Cheng Yang, Alessandro Flammini, and Filippo Menczer. 2018. The spread of low-credibility content by social bots. Nature communications 9, 1 (2018), 1--9.Google Scholar
- Nisha P Shetty, Balachandra Muniyal, Arshia Anand, and Sushant Kumar. 2022. An enhanced sybil guard to detect bots in online social networks. Journal of Cyber Security and Mobility (2022), 105--126.Google Scholar
- Aashish Singh. [n. d.]. YouTube users increasingly concerned with adult com- ments (sex bots) spamming replies on videos. https://piunikaweb.com/2022/12/19/ youtube-users-concerned-with-adult-comments-spamming-replies/. Accessed: 2022--11-01.Google Scholar
- Bongwon Suh, Lichan Hong, Peter Pirolli, and Ed H Chi. 2010. Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In 2010 IEEE second international conference on social computing. IEEE, 177--184.Google Scholar
- The YouTube Team. [n. d.]. More updates on our actions related to the safety of minors on YouTube. https://blog.youtube/news-and-events/more-updates-on- our-actions-related-to/. Accessed: 2022--11-01.Google Scholar
- Ashutosh Tripathi, Kusum Kumari Bharti, and Mohona Ghosh. 2019. A study on characterizing the ecosystem of monetizing video spams on youtube platform. In Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services. 222--231.Google ScholarDigital Library
- Onur Varol, Emilio Ferrara, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2017. Online human-bot interactions: Detection, estimation, and characterization. In Proceedings of the international AAAI conference on web and social media, Vol. 11. 280--289.Google ScholarCross Ref
- Teng Xu, Gerard Goossen, Huseyin Kerem Cevahir, Sara Khodeir, Yingyezhe Jin, Frank Li, Shawn Shan, Sagar Patel, David Freeman, and Paul Pearce. 2021. Deep entity classification: Abusive account detection for online social networks. In 30th {USENIX} Security Symposium ({USENIX} Security 21).Google Scholar
- Dong Yuan, Yuanli Miao, Neil Zhenqiang Gong, Zheng Yang, Qi Li, Dawn Song, Qian Wang, and Xiao Liang. 2019. Detecting fake accounts in online social networks at the time of registrations. In Proceedings of the 2019 ACM SIGSAC conference on computer and communications security. 1423--1438.Google ScholarDigital Library
- Savvas Zannettou, Sotirios Chatzis, Kostantinos Papadamou, and Michael Siriv- ianos. 2018. The good, the bad and the bait: Detecting and characterizing clickbait on youtube. In 2018 IEEE Security and Privacy Workshops (SPW). IEEE, 63--69Google ScholarCross Ref
Index Terms
- Evolving Bots: The New Generation of Comment Bots and their Underlying Scam Campaigns in YouTube
Recommendations
Detecting Bots Based on Keylogging Activities
ARES '08: Proceedings of the 2008 Third International Conference on Availability, Reliability and SecurityA bot is a piece of software that is usually installed on an infected machine without the user’s knowledge. A bot is controlled remotely by the attacker under a Command and Control structure. Recent statistics show that bots represent one of the fastest ...
Behavioural Correlation for Detecting P2P Bots
ICFN '10: Proceedings of the 2010 Second International Conference on Future NetworksIn the past few years, IRC bots, malicious programs which are remotely controlled by attackers through IRC servers, have become a major threat to the Internet and for users. These bots can be used in different malicious ways such as issuing distributed ...
Humans and bots in internet chat: measurement, analysis, and automated classification
The abuse of chat services by automated programs, known as chat bots, poses a serious threat to Internet users. Chat bots target popular chat networks to distribute spam and malware. In this paper, we first conduct a series of measurements on a large0 ...
Comments