Skip to main content
Log in

Detection of Twitter Spam Using GLoVe Vocabulary Features, Bidirectional LSTM and Convolution Neural Network

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Twitter spam is used to describe any form of unwanted or unsolicited communications, tweets, or activities that users encounter on the social media platform Twitter. This can include things like spam messages from fake accounts, tweets with links to malicious or scam websites, or automated tweets that are sent out in large quantities. Training a model to detect patterns in tweets indicative of spam. The process typically involves collecting a large dataset of labeled tweets, and then using this dataset to train a machine learning model. The proposed model is divided into two parts that operate on the input dataset, which includes tweets and additional meta-data such as the number of followers and actions. The first part focuses on the text of the tweets, using the GLoVe language model to extract vocabulary features which are then used to detect spam using an LSTM deep learning model. The second part utilizes meta-data from the tweets and additional meta-heuristic features, such as the tweet length and number of question marks, and classifies them using a CNN model. The final decision is determined by combining the results from both the LSTM and CNN models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data Availability

The dataset generated and analyzed during the current study are available from the corresponding author on reasonable request.

References

  1. Pardo C, Pagani M, Savinien J. The strategic role of social media in business-to-business contexts. Ind Mark Manage. 2022;101:82–97.

    Article  Google Scholar 

  2. Bayer JB, Anderson IA, Tokunaga R. Building and breaking social media habits. Curr Opin Psychol. 2022;45:101303.

    Article  Google Scholar 

  3. Mahdikhani M. Predicting the popularity of tweets by analyzing public opinion and emotions in different stages of Covid-19 pandemic. Int J Inf Manag Data Insights. 2022;2(1): 100053.

    Google Scholar 

  4. Ahmed N, Amin R, Aldabbas H, Koundal D, Alouffi B, Shah T. Machine learning techniques for spam detection in email and IoT platforms: analysis and research challenges. Secur Commun Netw. 2022;2022:1–19.

    Article  Google Scholar 

  5. Sahoo SR, Gupta BB, Peraković D, Peñalvo FJG, Cvitić I. Spammer detection approaches in online social network (OSNs): a survey. In: Sustainable Management of Manufacturing Systems in Industry 4.0. Cham: Springer International Publishing; 2022. p. 159–80.

    Chapter  Google Scholar 

  6. Ghanem R, Erbay H. Spam detection on social networks using deep contextualized word representation. Multimed Tools Appl. 2022;82:3697–712.

    Article  Google Scholar 

  7. Bacanin N, Zivkovic M, Stoean C, Antonijevic M, Janicijevic S, Sarac M, Strumberger I. Application of natural language processing and machine learning boosted with swarm intelligence for spam email filtering. Mathematics. 2022;10(22):4173.

    Article  Google Scholar 

  8. Alsmadi I, Ahmad K, Nazzal M, Alam F, Al-Fuqaha A, Khreishah A, Algosaibi A. Adversarial nlp for social network applications: Attacks, defenses, and research directions. IEEE Transactions on Computational Social Systems; 2022.

    Google Scholar 

  9. Alom Z, Carminati B, Ferrari E. A deep learning model for Twitter spam detection. Online Soc Netw Media. 2020;18: 100079.

    Article  Google Scholar 

  10. Alsaffar D, Alfahhad A, Alqhtani B, Alamri L, Alansari S, Alqahtani N, Alboaneen DA. Machine and deep learning algorithms for Twitter spam detection. In: Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2019. Springer International Publishing; 2020. p. 483–91.

    Chapter  Google Scholar 

  11. Jain G, Sharma M, Agarwal B. Spam detection in social media using convolutional and long short term memory neural network. Ann Math Artif Intell. 2019;85(1):21–44.

    Article  Google Scholar 

  12. Ahmad SB, Sheikh MR, Ghorabie SM. Spam detection on Twitter using a support vector machine and users’ features by identifying their interactions. Multim Tools Appl. 2021;80(8):11583–605.

    Article  Google Scholar 

  13. Jacob WS. Multi-objective genetic algorithm and CNN-based deep learning architectural scheme for effective spam detection. Int J Intell Netw. 2022;3:9–15.

    Google Scholar 

  14. Vidya Kumari KR, Kavitha CR. Spam detection using machine learning in R. In: International Conference on Computer Networks and Communication Technologies: ICCNCT 2018. Singapore: Springer; 2019. p. 55–64.

    Chapter  Google Scholar 

  15. Thomas M, Meshram BB. ChSO-DNFNet: Spam detection in Twitter using feature fusion and optimized Deep Neuro Fuzzy Network. Adv Eng Softw. 2023;175: 103333.

    Article  Google Scholar 

  16. Kawintiranon K, Singh L, Budak C. Traditional and context-specific spam detection in low resource settings. Mach Learn. 2022;111(7):2515–36.

    Article  MathSciNet  Google Scholar 

  17. Elakkiya E, Selvakumar S, LeelaVelusamy R. TextSpamDetector: textual content based deep learning framework for social spam detection using conjoint attention mechanism. J Ambient Intell Humaniz Comput. 2021;12:9287–302.

    Article  Google Scholar 

  18. Jain G, Sharma M, Agarwal B. Optimizing semantic LSTM for spam detection. Int J Inf Technol. 2019;11:239–50.

    Google Scholar 

  19. Mubarak H, Abdelali A, Hassan S, Darwish K. Spam detection on arabic twitter. In: Social Informatics: 12th International Conference, SocInfo 2020, Pisa, Italy, October 6–9, 2020, Proceedings 12. Springer International Publishing; 2020. p. 237–251.

  20. Ghanem R, Erbay H. Context-dependent model for spam detection on social networks. SN Applied Sciences. 2020;2:1–8.

    Article  Google Scholar 

Download references

Acknowledgements

The authors acknowledged the Lovely Professional University, Phagwara, Punjab, India and Neil Gogte Institute of Technology, Hyderabad, India for supporting the research work by providing the facilities.

Funding

No funding received for this research.

Author information

Authors and Affiliations

Authors

Contributions

This research endeavor was made possible by the collaboration and contributions of all authors.

Corresponding author

Correspondence to Pinnapureddy Manasa.

Ethics declarations

Conflict of interest

No conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Advances in Computational Approaches for Image Processing, Wireless Networks, Cloud Applications and Network Security” guest edited by P. Raviraj, Maode Ma and Roopashree H R.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Manasa, P., Malik, A. & Batra, I. Detection of Twitter Spam Using GLoVe Vocabulary Features, Bidirectional LSTM and Convolution Neural Network. SN COMPUT. SCI. 5, 206 (2024). https://doi.org/10.1007/s42979-023-02518-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-023-02518-1

Keywords