Abstract
Twitter spam is used to describe any form of unwanted or unsolicited communications, tweets, or activities that users encounter on the social media platform Twitter. This can include things like spam messages from fake accounts, tweets with links to malicious or scam websites, or automated tweets that are sent out in large quantities. Training a model to detect patterns in tweets indicative of spam. The process typically involves collecting a large dataset of labeled tweets, and then using this dataset to train a machine learning model. The proposed model is divided into two parts that operate on the input dataset, which includes tweets and additional meta-data such as the number of followers and actions. The first part focuses on the text of the tweets, using the GLoVe language model to extract vocabulary features which are then used to detect spam using an LSTM deep learning model. The second part utilizes meta-data from the tweets and additional meta-heuristic features, such as the tweet length and number of question marks, and classifies them using a CNN model. The final decision is determined by combining the results from both the LSTM and CNN models.











Similar content being viewed by others
Data Availability
The dataset generated and analyzed during the current study are available from the corresponding author on reasonable request.
References
Pardo C, Pagani M, Savinien J. The strategic role of social media in business-to-business contexts. Ind Mark Manage. 2022;101:82–97.
Bayer JB, Anderson IA, Tokunaga R. Building and breaking social media habits. Curr Opin Psychol. 2022;45:101303.
Mahdikhani M. Predicting the popularity of tweets by analyzing public opinion and emotions in different stages of Covid-19 pandemic. Int J Inf Manag Data Insights. 2022;2(1): 100053.
Ahmed N, Amin R, Aldabbas H, Koundal D, Alouffi B, Shah T. Machine learning techniques for spam detection in email and IoT platforms: analysis and research challenges. Secur Commun Netw. 2022;2022:1–19.
Sahoo SR, Gupta BB, Peraković D, Peñalvo FJG, Cvitić I. Spammer detection approaches in online social network (OSNs): a survey. In: Sustainable Management of Manufacturing Systems in Industry 4.0. Cham: Springer International Publishing; 2022. p. 159–80.
Ghanem R, Erbay H. Spam detection on social networks using deep contextualized word representation. Multimed Tools Appl. 2022;82:3697–712.
Bacanin N, Zivkovic M, Stoean C, Antonijevic M, Janicijevic S, Sarac M, Strumberger I. Application of natural language processing and machine learning boosted with swarm intelligence for spam email filtering. Mathematics. 2022;10(22):4173.
Alsmadi I, Ahmad K, Nazzal M, Alam F, Al-Fuqaha A, Khreishah A, Algosaibi A. Adversarial nlp for social network applications: Attacks, defenses, and research directions. IEEE Transactions on Computational Social Systems; 2022.
Alom Z, Carminati B, Ferrari E. A deep learning model for Twitter spam detection. Online Soc Netw Media. 2020;18: 100079.
Alsaffar D, Alfahhad A, Alqhtani B, Alamri L, Alansari S, Alqahtani N, Alboaneen DA. Machine and deep learning algorithms for Twitter spam detection. In: Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2019. Springer International Publishing; 2020. p. 483–91.
Jain G, Sharma M, Agarwal B. Spam detection in social media using convolutional and long short term memory neural network. Ann Math Artif Intell. 2019;85(1):21–44.
Ahmad SB, Sheikh MR, Ghorabie SM. Spam detection on Twitter using a support vector machine and users’ features by identifying their interactions. Multim Tools Appl. 2021;80(8):11583–605.
Jacob WS. Multi-objective genetic algorithm and CNN-based deep learning architectural scheme for effective spam detection. Int J Intell Netw. 2022;3:9–15.
Vidya Kumari KR, Kavitha CR. Spam detection using machine learning in R. In: International Conference on Computer Networks and Communication Technologies: ICCNCT 2018. Singapore: Springer; 2019. p. 55–64.
Thomas M, Meshram BB. ChSO-DNFNet: Spam detection in Twitter using feature fusion and optimized Deep Neuro Fuzzy Network. Adv Eng Softw. 2023;175: 103333.
Kawintiranon K, Singh L, Budak C. Traditional and context-specific spam detection in low resource settings. Mach Learn. 2022;111(7):2515–36.
Elakkiya E, Selvakumar S, LeelaVelusamy R. TextSpamDetector: textual content based deep learning framework for social spam detection using conjoint attention mechanism. J Ambient Intell Humaniz Comput. 2021;12:9287–302.
Jain G, Sharma M, Agarwal B. Optimizing semantic LSTM for spam detection. Int J Inf Technol. 2019;11:239–50.
Mubarak H, Abdelali A, Hassan S, Darwish K. Spam detection on arabic twitter. In: Social Informatics: 12th International Conference, SocInfo 2020, Pisa, Italy, October 6–9, 2020, Proceedings 12. Springer International Publishing; 2020. p. 237–251.
Ghanem R, Erbay H. Context-dependent model for spam detection on social networks. SN Applied Sciences. 2020;2:1–8.
Acknowledgements
The authors acknowledged the Lovely Professional University, Phagwara, Punjab, India and Neil Gogte Institute of Technology, Hyderabad, India for supporting the research work by providing the facilities.
Funding
No funding received for this research.
Author information
Authors and Affiliations
Contributions
This research endeavor was made possible by the collaboration and contributions of all authors.
Corresponding author
Ethics declarations
Conflict of interest
No conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Advances in Computational Approaches for Image Processing, Wireless Networks, Cloud Applications and Network Security” guest edited by P. Raviraj, Maode Ma and Roopashree H R.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Manasa, P., Malik, A. & Batra, I. Detection of Twitter Spam Using GLoVe Vocabulary Features, Bidirectional LSTM and Convolution Neural Network. SN COMPUT. SCI. 5, 206 (2024). https://doi.org/10.1007/s42979-023-02518-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-023-02518-1