Skip to main content

Advertisement

Log in

Tweet and user validation with supervised feature ranking and rumor classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Filtering fake news from social network posts and detecting social network users who are responsible for generating and propagating these rumors have become two major issues with the increased popularity of social networking platforms. As any user can post anything on social media and that post can instantly propagate to all over the world, it is important to recognize if the post is rumor or not. Twitter is one of the most popular social networking platforms used for news broadcasting mostly as tweets and retweets. Hence, validating tweets and users based on their posts and behavior on Twitter has become a social, political and international issue. In this paper, we proposed a method to classify rumor and non-rumor tweets by applying a novel tweet and user feature ranking approach with Decision Tree and Logistic Regression that were applied on both tweet and user features extracted from a benchmark rumor dataset ‘PHEME’. The effect of the ranking model was then shown by classifying the dataset with the ranked features and comparing them with the basic classifications with various combination of features. Both supervised classification algorithms (namely, Support Vector Machine, Naïve Bayes, Random Forest and Logistic Regression) and deep learning algorithms (namely, Convolutional Neural Network and Long Short-Term Memory) were used for rumor detection. The classification accuracy showed that the feature ranking classification results were comparable to the original classification performances. The ranking models were also used to list the topmost tweets and users with different conditions and the results showed that even if the features were ranked differently by LR and RF, the topmost results for tweets and users for both rumors and non-rumors were the same.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Ahsan M, Kumari M, Sharma TP (2019) Detection of context-varying rumors on Twitter through deep learning. Int J Adv Sci Technol 128:45–58

    Article  Google Scholar 

  2. Allport GW, Postman LJ (1945) Section of psychology: The basic psychology of rumor. Transactions of the New York Academy of Sciences 8(2 Series II):61–81

    Article  Google Scholar 

  3. Arya A, Hine M, Khataei A (2018) User trust graph: A model to measure trustworthiness. In: Proceedings of the Personalization in Persuasive Technology Workshop, Persuasive Technology 2018, Waterloo, Canada (ir.library.carleton.ca)

  4. Castillo C, Mendoza M, Poblete B (2011) Information credibility on twitter. In: Proceedings of the 20th international conference on World wide web. ACM, pp 675–684

  5. Cresci S, Lillo F, Regoli D, Tardelli S, Tesconi M (2019) Cashtag piggybacking: Uncovering spam and bot activity in stock microblogs on twitter. ACM Trans Web (TWEB) 13(2):1–27

    Article  Google Scholar 

  6. Dayani R, Chhabra N, Kadian T, Kaushal R (2015) Rumor detection in twitter: An analysis in retrospect. In: 2015 IEEE international conference on advanced networks and telecommuncations systems (ANTS). IEEE, pp 1–3

  7. Gupta A, Kumaraguru P (2012) Credibility ranking of tweets during high impact events. In: Proceedings of the 1st workshop on privacy and security in online social media. ACM, p 2

  8. Hamidian S, Diab M (2016) Rumor identification and belief investigation on twitter. In: Proceedings of the 7th workshop on computational approaches to subjectivity, sentiment and social media analysis, pp 3–8

  9. Hamidian S, Diab MT (2019) Rumor detection and classification for twitter data. arXiv preprint arXiv:1912.08926

  10. Hassan NY, Gomaa WH, Khoriba GA, Haggag MH (2018) Supervised learning approach for twitter credibility detection. In: 2018 13th international conference on computer engineering and systems (ICCES). IEEE, pp 196–201

  11. He W, Xu G, Kim Y, Dwivedi R, Zhang J, Jeong SR (2016) Competitive intelligence in social media Twitter: iPhone 6 vs. Galaxy S5. Online Information Review

  12. Hernandez E, Wunsch DC (2018) Graphical trust models for agent-based systems. IEEE Potentials 37(5):25–33

    Article  Google Scholar 

  13. Knapp RH (1994) A psychology of rumor. Public Opinion Quarterly 8(1):22–37

    Article  Google Scholar 

  14. Kochkina E, Liakata M, Zubiaga A PHEME dataset for Rumour Detection and Veracity Classification. figshare, 10-Jun-2018 [Online]. [Online]. Available: https://figshare.com/articles/PHEME-dataset-for-Rumour-Detection-and-Veracity-Classification/6392078/1

  15. Kochkina E, Liakata M, Zubiaga A (2018) All-in-one: Multi-task learning for rumour verification. arXiv preprint arXiv:1806.03713

  16. Kostka J, Oswald YA, Wattenhofer R (2008) Word of mouth: Rumor dissemination in social networks. In: International colloquium on structural information and communication complexity. Springer, Berlin, Heidelberg, pp 185–196

  17. Li P, Zhao W, Yang J (2018) CoRank: A coupled dual networks approach to trust evaluation on Twitter. In: International conference on Web information systems engineering. Springer, Cham, pp 145–160

  18. Li P, Zhao W, Yang J, Wu J (2019) Cotrrank: trust evaluation of users and tweets. In: Proceedings of the 28th international joint conference on artificial intelligence. AAAI Press, pp 6542–6544

  19. Mondal T, Roy T, Bhattacharya I, Bhattacharya S, Das I (2019) A study on rumor propagation trends and features in a post disaster situation. In: Proceedings of the 20th international conference on distributed computing and networking, pp 381–384

  20. Oberlo. 10 Twitter Statistics Every Marketer Should Know in 2020 [Infographic]. [Online]. Available: https://www.oberlo.ca/blog/twitter-statistics/

  21. Page L, Brin S, Motwani R, Winograd T (1999) The PageRank citation ranking: Bringing order to the Web. Stanford InfoLab

  22. Park S, Chihyung B, Ok M, Chae BK (2016) Using Twitter data for cruise tourism marketing and research. Journal of Travel & Tourism Marketing, pp 885–898

  23. Pathak AR, Mahajan A, Singh K, Patil A, Nair A (2020) Analysis of techniques for rumor detection in social media. Procedia Comput Sci 167:2286–2296

    Article  Google Scholar 

  24. Peterson WA, Gist NP (1951) Rumor and public opinion. American Journal of Sociology 57(2):159–167

    Article  Google Scholar 

  25. Pitsilis GK, El-Hadedy M (2016) Posting with credibility in Micro-blogging systems using Digital Signatures and Watermarks: A case study on Twitter. arXiv preprint arXiv:1612.09480

  26. Rath B, Gao W, Ma J, Srivastava J (2017) From retweet to believability: Utilizing trust to identify rumor spreaders on Twitter. In: Proceedings of the 2017 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 179–186

  27. Ravikumar S, Balakrishnan R, Kambhampati S (2012) Ranking tweets considering trust and relevance. In: Proceedings of the Ninth international workshop on information integration on the Web. ACM, p 4

  28. Ravikumar S, Talamadupula K, Balakrishnan R, Kambhampati S (2013) RAProp: ranking tweets by exploiting the tweet/user/web ecosystem and inter-tweet agreement. In: Proceedings of the 22nd ACM international conference on information & knowledge management. ACM, pp 2345–2350

  29. Ross J, Thirunarayan K (2016) Features for ranking tweets based on credibility and newsworthiness. In: 2016 international conference on collaboration technologies and systems (CTS). IEEE, pp 18–25

  30. Sailunaz K, Alhajj R (2019) Emotion and sentiment analysis from Twitter text. J Comput Sci 36:101003

    Article  Google Scholar 

  31. Sicilia R, Giudice SL, Pei Y, Pechenizkiy M, Soda P (2018) Twitter rumour detection in the health domain. Expert Systems with Applications 110:33–40

    Article  Google Scholar 

  32. Singh JP, Kumar A, Rana NP, Dwivedi YK (2020) Attention-based LSTM network for rumor veracity estimation of tweets. Information Systems Frontiers, pp 1–16

  33. Statista. Number of social network users worldwide from 2010 to 2023(in billions). [Online]. Available: https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/

  34. Takahashi T, Igata N (2012) Rumor detection on twitter. In: The 6th international conference on soft computing and intelligent systems, and The 13th international symposium on advanced intelligence systems. IEEE, pp 452–457

  35. Thakur HK, Gupta A, Bhardwaj A, Verma D (2018) Rumor detection on twitter using a supervised machine learning framework. International Journal of Information Retrieval Research (IJIRR) 8(3):1–13

    Article  Google Scholar 

  36. Twitter, Inc. Twitter. [Online].Available: https://www.twitter.com/

  37. Veyseh APB, Thai MT, Nguyen TH, Dou D (2019) Rumor detection in social networks via deep contextual modeling. In: Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining, pp 113–120

  38. Waskale PM, Jain P (2019) A review rumors detection on twitter using machine learning techniques. International Journal of Scientific Research & Engineering Trends 5(3)

  39. Weng J, Lim E, Jiang J, He Q (2010) Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM international conference on Web search and data mining, pp 261–270

  40. Zamani S, Asadpour M, Moazzami D (2017) Rumor detection for Persian Tweets. In: 2017 Iranian conference on electrical engineering(ICEE). IEEE, pp 1532–1536

  41. Zhang Q, Zhang S, Dong J, Xiong J, Cheng X (2015) Automatic detection of rumor on social network. In: Natural language processing and chinese computing. Springer, Cham, pp 113–122

  42. Zubiaga A, Hoi GWS, Liakata M, Procter R PHEME dataset of rumours and non-rumours. figshare, 24-Oct-2016 [Online]. [online]. Available: https://figshare.com/articles/PHEME-dataset-of-rumours-and-non-rumours/4010619/1

  43. Zubiaga A, Liakata M, Procter R (2016) Learning reporting dynamics during breaking news for rumour detection in social media. arXiv preprint arXiv:1610.07363

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Reda Alhajj.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sailunaz, K., Kawash, J. & Alhajj, R. Tweet and user validation with supervised feature ranking and rumor classification. Multimed Tools Appl 81, 31907–31927 (2022). https://doi.org/10.1007/s11042-022-12616-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12616-6

Keywords

Navigation