Analyzing Twitter networks using graph embeddings: an application to the British case

Won, Miguel; Fernandes, Jorge M.

doi:10.1007/s42001-021-00128-6

Analyzing Twitter networks using graph embeddings: an application to the British case

Research Article
Published: 06 June 2021

Volume 5, pages 253–263, (2022)
Cite this article

Journal of Computational Social Science Aims and scope Submit manuscript

597 Accesses
3 Citations
Explore all metrics

Abstract

Embeddings have gained traction in the social sciences in recent years. Existing work focuses on text-as-data to estimate word embeddings. In this paper, we turn to graph embeddings as a tool whose use has been overlooked in the analysis of social networks. Graph embeddings have two primary uses. First, to encode users and their interactions onto a single vector. Second, graph embeddings can be used as inputs for machine-learning classifiers. In this paper, we use the British political Twitter to illustrate both uses of graph embeddings. We encode users’ partisanship. Furthermore, we use an SVM and a NN to estimate the partisan proximity of Twitter users. Results suggest that graph embeddings yield high precision predictions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

TwitterTagNet: an extensive graph dataset for node classification in co-occurring hashtag networks

Article Open access 17 March 2025

Validating Vector-Label Propagation for Graph Embedding

Graph Embeddings for Abusive Language Detection

Article 12 January 2021

Notes

There are several legislators whose Twitter activity puts them below this threshold. This decreases our number of legislators to 561.
https://github.com/VHRanger/graph2vec.
Results are robust to changes in dimensions (20, 30, 40, 50, 100, and 150). For computational parsimony, we use ten dimensions. Results using different dimensions are available authors.
We acknowledge that legislators tweet about other topics as well. However, their utterances tend to be dominated by political matters.
We define outliers as users whose Twitter bio identifies them as Labour and Liberal Democrats but our embedding, based on their Twitter activities, puts them in the vicinity of the Conservatives. To be sure, there are outliers in other areas of the graph. However, the observation of the graph suggests that the highlighted area has a particularly high number of outliers.
Results and technical notes are available from the authors.
N Outliers: 103 N Conservative: 246 N Tweets: 3003.
Tables S2–S5 in the online appendix show results per party.
We use ‘predict_proba’ property of the SVC function in the scikit-learn.

References

Barberá, P. (2015). Birds of the same feather tweet together: Bayesian ideal point estimation using Twitter data. Political Analysis, 23(1), 76–91.
Article Google Scholar
Barberá, P., Jost, J. T., Nagler, J., Tucker, J. A., & Bonneau, R. (2015). Tweeting from left to right: Is online political communication more than an echo chamber? Psychological Science, 26(10), 1531–1542.
Article Google Scholar
Cai, H., Zheng, V. W., & Chang, K.C.-C. (2018). A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Transactions on Knowledge and Data Engineering, 30(9), 1616–1637.
Article Google Scholar
Goyal, P., & Ferrara, E. (2018). Graph embedding techniques, applications, and performance: A survey. Knowledge-Based Systems, 151, 78–94.
Article Google Scholar
Grover, A. & Leskovec, J. (2016). node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. ACM pp. 855–864.
Hamdi, T., Slimi, H., Bounhas, I., & Slimani, Y. (2020). A hybrid approach for fake news detection in twitter based on user features and graph embedding. In International Conference on Distributed Computing and Internet Technology. Springer pp. 266–280.
Harris, Z. S. (1954). Distributional structure. Word, 10(2–3), 146–162.
Article Google Scholar
Hobolt, S. B., et al. (2018). Brexit and the 2017 UK general election. Journal of Common Market Studies, 56(S1), 39–50.
Article Google Scholar
Laver, M. (2014). Measuring policy positions in political space. Annual Review of Political Science, 17, 207–223.
Article Google Scholar
Masood, M.A., & Abbasi, R.A. (2021). Using graph embedding and machine learning to identify rebels on Twitter. Journal of Informetrics, 15(1), 101–121. https://doi.org/10.1016/j.joi.2020.101121.
Article Google Scholar
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. pp. 3111–3119.
Google Scholar
Řehůřek, R., & Sojka, P. (2010). Software framework for topic modelling with large corpora. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. Valletta, Malta: ELRA pp. 45–50. http://is.muni.cz/publication/884893/en.
Rheault, L., & Cochrane, C. (2020). Word embeddings for the analysis of ideological placement in parliamentary corpora. Political Analysis, 28(1), 112–133.
Article Google Scholar
Rodman, E. (2020). A timely intervention: Tracking the changing meanings of political concepts with word vectors. Political Analysis, 28(1), 87–111.
Article Google Scholar
Spirling, A., & Rodriguez, P. (forthcoming). Word embeddings: What works, what doesn’t, and how to tell the difference for applied research.” Journal of Politics forthcoming.
van der Maaten, Laurens, & Hinton, Geoffrey. (2008). Visualizing data using t-SNE. Journal of Machine Learning Research, 9(Nov), 2579–2605.
Google Scholar

Download references

Acknowledgements

The authors would like to thank Pablo Barberá, Michael Laver, Christopher Cochrane, and Carsten Schwemmer for helpful comments of early drafts. The usual disclaimer applies.

Author information

Authors and Affiliations

INESC-RD, Av. Alves Redol, 9, Lisboa, Portugal
Miguel Won
Institute of Social Sciences, University of Lisbon, Av. Prof. Aníbal Bettencourt, 9, Lisboa, Portugal
Jorge M. Fernandes

Authors

Miguel Won
View author publications
You can also search for this author inPubMed Google Scholar
Jorge M. Fernandes
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Jorge M. Fernandes.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 44 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Won, M., Fernandes, J.M. Analyzing Twitter networks using graph embeddings: an application to the British case. J Comput Soc Sc 5, 253–263 (2022). https://doi.org/10.1007/s42001-021-00128-6

Download citation

Received: 16 January 2021
Accepted: 21 May 2021
Published: 06 June 2021
Issue Date: May 2022
DOI: https://doi.org/10.1007/s42001-021-00128-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analyzing Twitter networks using graph embeddings: an application to the British case

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

TwitterTagNet: an extensive graph dataset for node classification in co-occurring hashtag networks

Validating Vector-Label Propagation for Graph Embedding

Graph Embeddings for Abusive Language Detection

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary material 1 (PDF 44 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now