research-article

Evolving Bots: The New Generation of Comment Bots and their Underlying Scam Campaigns in YouTube

Authors:
Seung Ho Na

KAIST, Daejeon, Republic of Korea

KAIST, Daejeon, Republic of Korea

0000-0003-0908-1233
View Profile

,
Sumin Cho

KAIST, Daejeon, Republic of Korea

KAIST, Daejeon, Republic of Korea

0009-0008-1506-5807
View Profile

,
Seungwon Shin

KAIST, Daejeon, Republic of Korea

KAIST, Daejeon, Republic of Korea

0000-0002-1077-5606
View Profile

IMC '23: Proceedings of the 2023 ACM on Internet Measurement ConferenceOctober 2023Pages 297–312https://doi.org/10.1145/3618257.3624822

Published:24 October 2023Publication History

IMC '23: Proceedings of the 2023 ACM on Internet Measurement Conference

Pages 297–312

ABSTRACT

This paper presents a pioneering investigation into a novel form of scam advertising method on YouTube, termed "social scam bots'' (SSBs). These bots have evolved to emulate benign user behavior by posting comments and engaging with other users, oftentimes appearing prominently among the top rated comments. We analyzed the YouTube video comments and proposed a method to identify SSBs and extract the underlying scam domains. Our study revealed 1,134 SSBs promoting 72 scam campaigns responsible for infecting 31.73% of crawled videos. Further investigation revealed that SSBs exhibit advances that surpass traditional bots. Notably, they targeted specific audience by aligning scam campaigns with related video content, effectively leveraging the YouTube recommendation algorithm. We monitored these SSBs over a period of six months, enabling us to evaluate the effectiveness of YouTube's mitigation efforts. We also uncovered various strategies they use to evade mitigation attempts, including a novel strategy called "self-engagement," aimed at boosting their comment ranking. By shedding light on the phenomenon of SSBs and their evolving tactics, our study aims to raise awareness and contribute to the prevention of these malicious actors, ultimately fostering a safer online platform.

References

[n. d.]. YOUTUBE USER STATISTICS 2022. https://www.globalmediainsight. com/blog/youtube-users-statistics/#stat. Accessed: 2022--11-01.Google Scholar
2023. YouTube policies - External links policy. https://support.google.com/ youtube/answer/9054257. Accessed: 2023-01-17.Google Scholar
2023. YouTube policies - Spam, deceptive practices, & scams policies. https: //support.google.com/youtube/answer/2801973. Accessed: 2023-01-17.Google Scholar
Norah Abokhodair, Daisy Yoo, and David W McDonald. 2015. Dissecting a social botnet: Growth, content and influence in Twitter. In Proceedings of the 18th ACM conference on computer supported cooperative work & social computing. 839--851.Google ScholarDigital Library
Faraz Ahmed and Muhammad Abulaish. 2013. A generic statistical approach for spam detection in online social networks. Computer Communications 36, 10--11 (2013), 1120--1129.Google ScholarCross Ref
Túlio C Alberto, Johannes V Lochter, and Tiago A Almeida. 2015. Tubespam: Comment spam filtering on youtube. In 2015 IEEE 14th international conference on machine learning and applications (ICMLA). IEEE, 138--143.Google ScholarCross Ref
Victor Benjamin and TS Raghu. 2022. Augmenting Social Bot Detection with Crowd-Generated Labels. Information Systems Research (2022).Google Scholar
Elijah Bouma-Sims and Brad Reaves. 2021. A First Look at Scams on YouTube. arXiv preprint arXiv:2104.06515 (2021).Google Scholar
Chiyu Cai, Linjing Li, and Daniel Zeng. 2017. Detecting social bots by jointly modeling deep behavior and content information. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 1995--1998.Google ScholarDigital Library
Vidushi Chaudhary and Ashish Sureka. 2013. Contextual feature based one-class classifier approach for detecting video response spam on youtube. In 2013 Eleventh Annual Conference on Privacy, Security and Trust. IEEE, 195--204.Google ScholarCross Ref
Rashid Chowdury, Md Nuruddin Monsur Adnan, GAN Mahmud, and Rashedur M Rahman. 2013. A data mining based spam detection system for youtube. In Eighth international conference on digital information management (ICDIM 2013). IEEE, 373--378.Google ScholarCross Ref
Stefano Cresci. 2020. A decade of social bot detection. Commun. ACM 63, 10 (2020), 72--83.Google ScholarDigital Library
Stefano Cresci, Roberto Di Pietro, Marinella Petrocchi, Angelo Spognardi, and Maurizio Tesconi. 2017. The paradigm-shift of social spambots: Evidence, theories, and tools for the arms race. In Proceedings of the 26th international conference on world wide web companion. 963--972.Google ScholarDigital Library
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).Google Scholar
Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu, et al . 1996. A density- based algorithm for discovering clusters in large spatial databases with noise.. In kdd, Vol. 96. 226--231.Google Scholar
European Commission. 2016. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation) (Text with EEA relevance). https://eur-lex.europa.eu/eli/reg/2016/679/ojGoogle Scholar
Emilio Ferrara, Onur Varol, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2016. The rise of social bots. Commun. ACM 59, 7 (2016), 96--104.Google ScholarDigital Library
Yuriy Gorodnichenko, Tho Pham, and Oleksandr Talavera. 2021. Social media, sentiment and public opinions: Evidence from# Brexit and# USElection. European Economic Review 136 (2021), 103772.Google ScholarCross Ref
Christian Grimme, Dennis Assenmacher, and Lena Adam. 2018. Changing perspectives: Is it sufficient to detect social bots?. In International conference on social computing and social media. Springer, 445--461.Google ScholarDigital Library
Nir Grinberg, Kenneth Joseph, Lisa Friedland, Briony Swire-Thompson, and David Lazer. 2019. Fake news on Twitter during the 2016 US presidential election. Science 363, 6425 (2019), 374--378.Google Scholar
Maryam Heidari and James H Jones. 2020. Using bert to extract topic-independent sentiment features for social media bot detection. In 2020 11th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON). IEEE, 0542--0547.Google Scholar
Maryam Heidari, James H Jones, and Ozlem Uzuner. 2020. Deep contextualized word embedding for text-based online user profiling to detect social bots on twitter. In 2020 International Conference on Data Mining Workshops (ICDMW). IEEE, 480--487.Google ScholarCross Ref
Maryam Heidari, James H Jones Jr, and Ozlem Uzuner. 2022. Online User Profiling to Detect Social Bots on Twitter. arXiv preprint arXiv:2203.05966 (2022).Google Scholar
Roope Jaakonmäki, Oliver Müller, and Jan Vom Brocke. 2017. The impact of content, context, and creator on user engagement in social media marketing. In Proceedings of the Annual Hawaii International Conference on System Sciences, Vol. 50. IEEE Computer Society Press, 1152--1160.Google Scholar
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019).Google Scholar
Luca Luceri, Ashok Deb, Adam Badawy, and Emilio Ferrara. 2019. Red bots do it better: Comparative analysis of social bot partisan behavior. In Companion proceedings of the 2019 World Wide Web conference. 1007--1012.Google ScholarDigital Library
Federico Maggi, Alessandro Frossi, Stefano Zanero, Gianluca Stringhini, Brett Stone-Gross, Christopher Kruegel, and Giovanni Vigna. 2013. Two years of short urls internet measurement: security threats and countermeasures. In proceedings of the 22nd international conference on World Wide Web. 861--872.Google ScholarDigital Library
Michele Mazza, Marco Avvenuti, Stefano Cresci, and Maurizio Tesconi. 2022. Investigating the difference between trolls, social bots, and humans on Twitter. Computer Communications 196 (2022), 23--36.Google ScholarDigital Library
Guanyi Mou and Kyumin Lee. 2020. Malicious bot detection in online social networks: arming handcrafted features with deep learning. In International Conference on Social Informatics. Springer, 220--236.Google ScholarDigital Library
Yew Chuan Ong. 2020. Characterising and Detecting Social Bots. Ph. D. Disserta- tion. University of Sheffield.Google Scholar
Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. https://arxiv.org/abs/1908.10084Google ScholarCross Ref
Chengcheng Shao, Giovanni Luca Ciampaglia, Onur Varol, Kai-Cheng Yang, Alessandro Flammini, and Filippo Menczer. 2018. The spread of low-credibility content by social bots. Nature communications 9, 1 (2018), 1--9.Google Scholar
Nisha P Shetty, Balachandra Muniyal, Arshia Anand, and Sushant Kumar. 2022. An enhanced sybil guard to detect bots in online social networks. Journal of Cyber Security and Mobility (2022), 105--126.Google Scholar
Aashish Singh. [n. d.]. YouTube users increasingly concerned with adult com- ments (sex bots) spamming replies on videos. https://piunikaweb.com/2022/12/19/ youtube-users-concerned-with-adult-comments-spamming-replies/. Accessed: 2022--11-01.Google Scholar
Bongwon Suh, Lichan Hong, Peter Pirolli, and Ed H Chi. 2010. Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In 2010 IEEE second international conference on social computing. IEEE, 177--184.Google Scholar
The YouTube Team. [n. d.]. More updates on our actions related to the safety of minors on YouTube. https://blog.youtube/news-and-events/more-updates-on- our-actions-related-to/. Accessed: 2022--11-01.Google Scholar
Ashutosh Tripathi, Kusum Kumari Bharti, and Mohona Ghosh. 2019. A study on characterizing the ecosystem of monetizing video spams on youtube platform. In Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services. 222--231.Google ScholarDigital Library
Onur Varol, Emilio Ferrara, Clayton Davis, Filippo Menczer, and Alessandro Flammini. 2017. Online human-bot interactions: Detection, estimation, and characterization. In Proceedings of the international AAAI conference on web and social media, Vol. 11. 280--289.Google ScholarCross Ref
Teng Xu, Gerard Goossen, Huseyin Kerem Cevahir, Sara Khodeir, Yingyezhe Jin, Frank Li, Shawn Shan, Sagar Patel, David Freeman, and Paul Pearce. 2021. Deep entity classification: Abusive account detection for online social networks. In 30th {USENIX} Security Symposium ({USENIX} Security 21).Google Scholar
Dong Yuan, Yuanli Miao, Neil Zhenqiang Gong, Zheng Yang, Qi Li, Dawn Song, Qian Wang, and Xiao Liang. 2019. Detecting fake accounts in online social networks at the time of registrations. In Proceedings of the 2019 ACM SIGSAC conference on computer and communications security. 1423--1438.Google ScholarDigital Library
Savvas Zannettou, Sotirios Chatzis, Kostantinos Papadamou, and Michael Siriv- ianos. 2018. The good, the bad and the bait: Detecting and characterizing clickbait on youtube. In 2018 IEEE Security and Privacy Workshops (SPW). IEEE, 63--69Google ScholarCross Ref

Index Terms

Evolving Bots: The New Generation of Comment Bots and their Underlying Scam Campaigns in YouTube
1. Information systems
  1. World Wide Web

Recommendations

Detecting Bots Based on Keylogging Activities
ARES '08: Proceedings of the 2008 Third International Conference on Availability, Reliability and Security

A bot is a piece of software that is usually installed on an infected machine without the user’s knowledge. A bot is controlled remotely by the attacker under a Command and Control structure. Recent statistics show that bots represent one of the fastest ...
Read More
Behavioural Correlation for Detecting P2P Bots
ICFN '10: Proceedings of the 2010 Second International Conference on Future Networks

In the past few years, IRC bots, malicious programs which are remotely controlled by attackers through IRC servers, have become a major threat to the Internet and for users. These bots can be used in different malicious ways such as issuing distributed ...
Read More
Humans and bots in internet chat: measurement, analysis, and automated classification

The abuse of chat services by automated programs, known as chat bots, poses a serious threat to Internet users. Chat bots target popular chat networks to distribute spam and malware. In this paper, we first conduct a series of measurements on a large0 ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
IMC '23: Proceedings of the 2023 ACM on Internet Measurement Conference
October 2023
746 pages
ISBN:9798400703829
DOI:10.1145/3618257
General Chairs:
Marie-José Montpetit
McGill University, Canada
,
Aris Leivadeas
École de Technologie Supérieure, Canada
,
Program Chairs:
Steve Uhlig
Queen Mary University of London, United Kingdom
,
Mobin Javed
Lahore University of Management Sciences, Pakistan
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 October 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
bot
measurement
scam
web
youtube
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate277of1,083submissions,26%
Upcoming Conference
IMC '24

Sponsor:

sigcomm

sigcomm

ACM Internet Measurement Conference

November 4 - 6, 2024

Madrid , AA , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 198
  Total Downloads
- Downloads (Last 12 months)198
- Downloads (Last 6 weeks)39
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Evolving Bots: The New Generation of Comment Bots and their Underlying Scam Campaigns in YouTube

IMC '23: Proceedings of the 2023 ACM on Internet Measurement Conference

ABSTRACT

References

Cited By

Index Terms

Recommendations

Detecting Bots Based on Keylogging Activities

Behavioural Correlation for Detecting P2P Bots

Humans and bots in internet chat: measurement, analysis, and automated classification