Skip to main content

Uncovering Discussion Groups on Claims of Election Fraud from Twitter

  • Conference paper
  • First Online:
Social Informatics (SocInfo 2022)

Abstract

Twitter was widely used during the 2020 U.S. election to disseminate claims of election fraud. As a result, a number of works have examined this phenomenon from a variety of perspectives. However, none of them focus on analyzing topics behind the general fraud claims and associating them with user communities. To fill this gap, we propose to uncover and characterize groups of Twitter users engaging in discussions about election fraud claims during the 2020 U.S. election using a large dataset that spans seven weeks during this period. To accomplish this, we model a sequence of co-retweet networks and employ a backbone extraction method that controls for inherent traits of social media applications, particularly, user activity levels and the popularity of tweets (which together generate many spurious edges in the network), thus allowing us to reveal topics of tweets that lead users to retweet them. After extracting the backbones, we identify user groups representative of the communities present in the network backbones and finally analyze the topics behind the retweeted tweets to understand how they contributed to the spread of fraud claims at that time. Our main results show that (i) our approach uncovers better-structured communities than the original network in terms of users spreading discussions about fraud; and (ii) these users discuss 25 topics with specific psycholinguistic and temporal characteristics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.nytimes.com/2021/01/11/technology/twitter-removes-70000-qanon-accounts.html.

  2. 2.

    https://www.bbc.com/news/technology-55638558.

  3. 3.

    https://voterfraud2020.io/.

  4. 4.

    The election week began on November \(3^{rd}\) 2020.

  5. 5.

    We consider a week from Sunday to Saturday.

  6. 6.

    https://huggingface.co/sentence-transformers/all-mpnet-base-v2.

  7. 7.

    We have disregarded communities with less than 100 users.

  8. 8.

    https://maartengr.github.io/BERTopic/getting_started/parameter%20tuning/parametertuning.html.

  9. 9.

    We have noted the presence of some fake news according to the Fact Check Agencies. However, we did not analyze this aspect because it is beyond the scope of this paper.

  10. 10.

    https://www.cbc.ca/news/world/trump-supporters-washington-march-1.5802409.

  11. 11.

    https://www.theguardian.com/us-news/2021/dec/28/donald-trump-georgia -2020-election-dead-people.

  12. 12.

    https://www.bbc.com/news/election-us-2020-54874120.

  13. 13.

    https://www.oann.com/.

  14. 14.

    The z-score normalization is defined by \(z = (x - \mu )/\sigma \).

References

  1. Abilov, A., Hua, Y., Matatov, H., Amir, O., Naaman, M.: Voterfraud 2020: a multi-modal dataset of election fraud claims on twitter. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 15, pp. 901–912 (2021)

    Google Scholar 

  2. Altamirano, M., et al.: Unsupervised characterization of lessons according to temporal patterns of teacher talk via topic modeling. Neurocomputing 484, 211–222 (2022)

    Article  Google Scholar 

  3. Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech: Theory Exp. 2008(10), P10008 (2008)

    Article  Google Scholar 

  4. Cann, T.J., Weaver, I.S., Williams, H.T.: Ideological biases in social sharing of online information about climate change. PLoS ONE 16(4), e0250656 (2021)

    Article  Google Scholar 

  5. Chaudhry, H.N., Javed, Y., Kulsoom, F., Mehmood, Z., Khan, Z.I., Shoaib, U., Janjua, S.H.: Sentiment analysis of before and after elections: Twitter data of us election 2020. Electronics 10(17), 2082 (2021)

    Article  Google Scholar 

  6. Childs, M.C., Buntain, C., Trujillo, M.Z., Horne, B.D.: Characterizing Youtube and Bitchute content and mobilizers during us election fraud discussions on twitter. In: ACM Conference on Web Science (2022)

    Google Scholar 

  7. Chowdhury, F.A., Saha, D., Hasan, M.R., Saha, K., Mueen, A.: Examining factors associated with twitter account suspension following the 2020 us presidential election. arXiv preprint arXiv:2101.09575 (2021)

  8. Coscia, M., Neffke, F.M.: Network backboning with noisy data. In: 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pp. 425–436. IEEE (2017)

    Google Scholar 

  9. Dai, Y.: Using 2020 U.S. Presidential election to study patterns of user influence, community formation and behaviors on twitter. Ph.D. thesis, The Pennsylvania State University (2021)

    Google Scholar 

  10. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, June 2019

    Google Scholar 

  11. Ebeling, R., Sáenz, C.A.C., Nobre, J., Becker, K.: The effect of political polarization on social distance stances in the brazilian covid-19 scenario. J. Inf. Data Manage. 12(1) (2021)

    Google Scholar 

  12. Ebeling, R., Sáenz, C.A.C., Nobre, J.C., Becker, K.: Analysis of the influence of political polarization in the vaccination stance: the Brazilian Covid-19 scenario. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 16, pp. 159–170 (2022)

    Google Scholar 

  13. Egger, R., Yu, J.: A topic modeling comparison between LDA, NMF, Top2Vec, and BERTopic to demystify twitter posts. Front. Soc. 7 (2022)

    Google Scholar 

  14. Ferrara, E., Chang, H., Chen, E., Muric, G., Patel, J.: Characterizing social media manipulation in the 2020 us presidential election. First Monday (2020)

    Google Scholar 

  15. Ferreira, C.H.G., Ferreira, F.M., de Sousa Matos, B., de Almeida, J.M.: Modeling dynamic ideological behavior in political networks. J. Web Sci. 7 (2019)

    Google Scholar 

  16. Ferreira, C.H., et al.: On the dynamics of political discussions on Instagram: a network perspective. Online So. Netw. Media 25, 100155 (2021)

    Article  Google Scholar 

  17. Finn, S., Mustafaraj, E., Metaxas, P.T.: The co-retweeted network and its applications for measuring the perceived political polarization (2014)

    Google Scholar 

  18. Freelon, D., Lokot, T.: Russian twitter disinformation campaigns reach across the American political spectrum. Misinformation Rev. (2020)

    Google Scholar 

  19. Fudolig, M.I., Alshaabi, T., Arnold, M.V., Danforth, C.M., Dodds, P.S.: Sentiment and structure in word co-occurrence networks on twitter. Appl. Netw. Sci. 7(1), 1–27 (2022)

    Article  Google Scholar 

  20. Gallagher, R.J., Reagan, A.J., Danforth, C.M., Dodds, P.S.: Divergent discourse between protests and counter-protests:# blacklivesmatter and# alllivesmatter. PLoS ONE 13(4), e0195644 (2018)

    Article  Google Scholar 

  21. Gomes Ferreira, C.H., et al.: Unveiling community dynamics on Instagram political network. In: 12th ACM Conference on Web Science, pp. 231–240 (2020)

    Google Scholar 

  22. Gomes Ferreira, C.H., de Sousa Matos, B., Almeira, J.M.: Analyzing dynamic ideological communities in congressional voting networks. In: Staab, S., Koltsova, O., Ignatov, D.I. (eds.) SocInfo 2018. LNCS, vol. 11185, pp. 257–273. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01129-1_16

    Chapter  Google Scholar 

  23. Grimminger, L., Klinger, R.: Hate towards the political opponent: a twitter corpus study of the 2020 US elections on the basis of offensive speech and stance detection. In: Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (2021)

    Google Scholar 

  24. Grootendorst, M.: Bertopic: Leveraging Bert and C-TF-IDF to create easily interpretable topics. (2020). https://doi.org/10.5281/zenodo.4381785

  25. Grootendorst, M.: Bertopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv preprint arXiv:2203.05794 (2022)

  26. Guerrero-Solé, F.: Community detection in political discussions on twitter: an application of the retweet overlap network method to the Catalan process toward independence. Soc. Sci. Comput. Rev. 35(2), 244–261 (2017)

    Article  Google Scholar 

  27. Malagoli, L.G., Stancioli, J., Ferreira, C.H., Vasconcelos, M., Couto da Silva, A.P., Almeida, J.M.: A look into Covid-19 vaccination debate on twitter. In: 13th ACM Web Science Conference 2021, pp. 225–233 (2021)

    Google Scholar 

  28. McInnes, L., Healy, J., Saul, N., Großberger, L.: Umap: uniform manifold approximation and projection. J. Open Source Softw. 3(29), 861 (2018)

    Article  Google Scholar 

  29. Metaxas, P., Mustafaraj, E., Wong, K., Zeng, L., O’Keefe, M., Finn, S.: What do retweets indicate? results from user survey and meta-review of research. In: Proceedings of the International AAAI Conference on Web and Social Media (2015)

    Google Scholar 

  30. Newman, M.E., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69, 026113 (2004)

    Article  Google Scholar 

  31. Nobre, G.P., Ferreira, C.H.G., Almeida, J.M.: Beyond groups: uncovering dynamic communities on the WhatsApp network of information dissemination. In: Aref, S., et al. (eds.) SocInfo 2020. LNCS, vol. 12467, pp. 252–266. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60975-7_19

    Chapter  Google Scholar 

  32. Nobre, G.P., Ferreira, C.H., Almeida, J.M.: A hierarchical network-oriented analysis of user participation in misinformation spread on Whatsapp. Inf. Process. Manage. 59(1), 102757 (2022)

    Article  Google Scholar 

  33. Pacheco, D., Hui, P.M., Torres-Lugo, C., Truong, B.T., Flammini, A., Menczer, F.: Uncovering coordinated networks on social media: methods and case studies. Int. AAAI Conf. Web Soc. Media 21, 455–466 (2021)

    Google Scholar 

  34. Pennebaker, J.W., Boyd, R.L., Jordan, K., Blackburn, K.: The development and psychometric properties of liwc2015. Technical Report (2015)

    Google Scholar 

  35. Rahimi, A., Cohn, T., Baldwin, T.: Twitter user geolocation using a unified text and network prediction model. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics (2015)

    Google Scholar 

  36. Reimers, N., Gurevych, I.: Sentence-bert: Sentence embeddings using siamese bert-networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 3982–3992 (2019)

    Google Scholar 

  37. Resende, G., Melo, P., CS Reis, J., Vasconcelos, M., Almeida, J.M., Benevenuto, F.: Analyzing textual (mis) information shared in Whatsapp groups. In: Proceedings of the 10th ACM Conference on Web Science, pp. 225–234 (2019)

    Google Scholar 

  38. Sharma, K., Ferrara, E., Liu, Y.: Characterizing online engagement with disinformation and conspiracies in the 2020 U.S. presidential election. In: International Conference on Web and Social Media (2022)

    Google Scholar 

  39. Tausczik, Y.R., Pennebaker, J.W.: The psychological meaning of words: LIWC and computerized text analysis methods. J. Lang. Soc. Psychol. 29(1), 24–54 (2010)

    Article  Google Scholar 

  40. Tran, H.D.: Studying the community of trump supporters on twitter during the 2020 us presidential election via hashtags# maga and# trump2020. Journalism Media 2(4), 709–731 (2021)

    Article  Google Scholar 

  41. Xia, E., Yue, H., Liu, H.: Tweet sentiment analysis of the 2020 us presidential election. In: Proceedings of the Web Conference 2021, pp. 367–371 (2021)

    Google Scholar 

  42. Yitzhaki, S.: Relative deprivation and the Gini coefficient. Q. J. Econ. 321–324 (1979)

    Google Scholar 

  43. Zannettou, S.: I won the election!: An empirical analysis of soft moderation interventions on twitter. In: International AAAI Conference on Web and Social Media, pp. 865–876 (2021)

    Google Scholar 

Download references

Acknowledgments

We thank the Voter Fraud Project and Abilov et al. [1] for providing the data to support our work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jose Martins da Rosa Jr .

Editor information

Editors and Affiliations

Appendix I

Appendix I

Table 5. Keywords, description and external links related to the extracted topics.

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

da Rosa, J.M., Linhares, R.S., Ferreira, C.H.G., Nobre, G.P., Murai, F., Almeida, J.M. (2022). Uncovering Discussion Groups on Claims of Election Fraud from Twitter. In: Hopfgartner, F., Jaidka, K., Mayr, P., Jose, J., Breitsohl, J. (eds) Social Informatics. SocInfo 2022. Lecture Notes in Computer Science, vol 13618. Springer, Cham. https://doi.org/10.1007/978-3-031-19097-1_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-19097-1_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-19096-4

  • Online ISBN: 978-3-031-19097-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics