Abstract
What does this or that population think about a given issue? Which topics ‘go viral’ and why? How does disinformation spread? How do populations view issues in light of national ‘master narratives’? These are all questions which automated approaches to analyzing social media promise to help answer.
We have adapted a technique for multilingual topic modeling to look at differences between what is discussed in Russian versus English. This kills several birds with one stone. We turn the data’s multilinguality from an impediment into a leverageable advantage. But most importantly, we play to unsupervised machine learning’s strengths: its ability to detect large-scale trends, anomalies, similarities and differences, in a highly general way.
Applying this approach to different Twitter datasets, we were able to draw out several interesting and non-obvious insights about Russian cyberspace and how it differs from its English counterpart. We show how these insights reveal aspects of how master narratives are instantiated, and how sentiment plays out on a large scale, in Russian discourse relating to NATO.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Duda, R.O., Hart, P.E., Stork, D.G.: Unsupervised learning and clustering. In: Pattern Classification, 2nd edn. Wiley, New York (2001). ISBN: 0-471-05669-3
Kim, S.-M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), pp. 1367–1373 (2004)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, July 2002, pp. 79–86 (2002)
Bader, B.W., Berry, M.W., Browne, M.: Discussion tracking in Enron email using PARAFAC. In: Berry, M.W., Castellanos, M. (eds.) Survey of Text Mining II, pp. 147–163. Springer, London (2008)
Chew, P.A.: ‘Linguistics-Lite’ topic extraction from multilingual social media data. In: Agarwal, N., Xu, K., Osgood, N. (eds.) SBP 2015. LNCS, vol. 9021, pp. 276–282. Springer, Cham (2015). doi:10.1007/978-3-319-16268-3_30
Tsikerdekis, M., Zeadally, S.: Online deception in social media. Commun. ACM 57(9), 72–80 (2014)
Center for Computational Analysis of Social and Organizational Systems: Multilingual Twitter sentiment analysis (2016). http://www.casos.cs.cmu.edu/projects/projects/mltsa.php. Accessed 27 July 2016
Halverson, J., Corman, S., Goodall, H.: Master Narratives of Islamist Extremism. Macmillan, New York (2011)
Chew, P.: Multilingual retrieval and topic modeling using vector-space word alignment. Galisteo Consulting Group, Inc. Technical report GCG002, February 2016. doi:10.13140/RG.2.2.21482.11205
Bouveng, K.: The role of messianism in contemporary Russian identity and statecraft. Durham Theses, Durham University (2010). http://etheses.dur.ac.uk/438
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Chew, P.A., Turnley, J.G. (2017). Understanding Russian Information Operations Using Unsupervised Multilingual Topic Modeling. In: Lee, D., Lin, YR., Osgood, N., Thomson, R. (eds) Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2017. Lecture Notes in Computer Science(), vol 10354. Springer, Cham. https://doi.org/10.1007/978-3-319-60240-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-60240-0_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60239-4
Online ISBN: 978-3-319-60240-0
eBook Packages: Computer ScienceComputer Science (R0)