Understanding Russian Information Operations Using Unsupervised Multilingual Topic Modeling

Chew, Peter A.; Turnley, Jessica G.

doi:10.1007/978-3-319-60240-0_12

Peter A. Chew¹⁷ &
Jessica G. Turnley¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10354))

Included in the following conference series:

International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation

2045 Accesses
1 Citations

Abstract

What does this or that population think about a given issue? Which topics ‘go viral’ and why? How does disinformation spread? How do populations view issues in light of national ‘master narratives’? These are all questions which automated approaches to analyzing social media promise to help answer.

We have adapted a technique for multilingual topic modeling to look at differences between what is discussed in Russian versus English. This kills several birds with one stone. We turn the data’s multilinguality from an impediment into a leverageable advantage. But most importantly, we play to unsupervised machine learning’s strengths: its ability to detect large-scale trends, anomalies, similarities and differences, in a highly general way.

Applying this approach to different Twitter datasets, we were able to draw out several interesting and non-obvious insights about Russian cyberspace and how it differs from its English counterpart. We show how these insights reveal aspects of how master narratives are instantiated, and how sentiment plays out on a large scale, in Russian discourse relating to NATO.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Duda, R.O., Hart, P.E., Stork, D.G.: Unsupervised learning and clustering. In: Pattern Classification, 2nd edn. Wiley, New York (2001). ISBN: 0-471-05669-3
Google Scholar
Kim, S.-M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), pp. 1367–1373 (2004)
Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, July 2002, pp. 79–86 (2002)
Google Scholar
Bader, B.W., Berry, M.W., Browne, M.: Discussion tracking in Enron email using PARAFAC. In: Berry, M.W., Castellanos, M. (eds.) Survey of Text Mining II, pp. 147–163. Springer, London (2008)
Chapter Google Scholar
Chew, P.A.: ‘Linguistics-Lite’ topic extraction from multilingual social media data. In: Agarwal, N., Xu, K., Osgood, N. (eds.) SBP 2015. LNCS, vol. 9021, pp. 276–282. Springer, Cham (2015). doi:10.1007/978-3-319-16268-3_30
Google Scholar
Tsikerdekis, M., Zeadally, S.: Online deception in social media. Commun. ACM 57(9), 72–80 (2014)
Article Google Scholar
Center for Computational Analysis of Social and Organizational Systems: Multilingual Twitter sentiment analysis (2016). http://www.casos.cs.cmu.edu/projects/projects/mltsa.php. Accessed 27 July 2016
Halverson, J., Corman, S., Goodall, H.: Master Narratives of Islamist Extremism. Macmillan, New York (2011)
Book Google Scholar
Chew, P.: Multilingual retrieval and topic modeling using vector-space word alignment. Galisteo Consulting Group, Inc. Technical report GCG002, February 2016. doi:10.13140/RG.2.2.21482.11205
Bouveng, K.: The role of messianism in contemporary Russian identity and statecraft. Durham Theses, Durham University (2010). http://etheses.dur.ac.uk/438

Download references

Author information

Authors and Affiliations

Galisteo Consulting Group, Inc., 4004 Carlisle Blvd NE Suite H, Albuquerque, NM, 87107, USA
Peter A. Chew & Jessica G. Turnley

Authors

Peter A. Chew
View author publications
You can also search for this author in PubMed Google Scholar
Jessica G. Turnley
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter A. Chew .

Editor information

Editors and Affiliations

Penn State University, State College, Pennsylvania, USA
Dongwon Lee
University of Saskatchewan, Saskatoon, Saskatchewan, Canada
Yu-Ru Lin
University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Nathaniel Osgood
United States Military Academy, West Point, New York, USA
Robert Thomson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chew, P.A., Turnley, J.G. (2017). Understanding Russian Information Operations Using Unsupervised Multilingual Topic Modeling. In: Lee, D., Lin, YR., Osgood, N., Thomson, R. (eds) Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2017. Lecture Notes in Computer Science(), vol 10354. Springer, Cham. https://doi.org/10.1007/978-3-319-60240-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-60240-0_12
Published: 15 June 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-60239-4
Online ISBN: 978-3-319-60240-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics