Abstract
Agent-based simulation models are an important methodology for explaining social behavior and forecasting social change. However, a major drawback to using such models is that they are difficult to instantiate for specific cases and so are rarely reused. We describe a text-mining network analytic approach for rapidly instantiating a model for predicting the tendency toward revolution and violence based on social and cultural characteristics of a large collection of actors. We illustrate our approach using an agent-based dynamic network framework, Construct, and newspaper data for the 16 countries associated with the Arab Spring. We assess the overall accuracy of the base model across independent runs for 20 different months during the Arab Spring, observing that although predictions led to several false positives, the model is able to predict revolution before it occurs in three of the four nations in which the government was successfully overthrown.
Similar content being viewed by others
Notes
Code used for this article, as well as a sample of the data, can be found at https://github.com/kennyjoseph/arabspring.
We will hereafter refer to a set of articles as the collection of all articles within a single month.
E.g. the FuzzyWuzzy python module, https://github.com/seatgeek/fuzzywuzzy, used by companies like StubHub.
Trivially, see http://brenocon.com/blog/2012/04/f-scores-dice-and-jaccard-set-similarity/ for a derivation.
So, for example, the intra-country range for Egypt’s change in violence belief in January of 2011 would be computed as the IQR of Egypt’s change in violence belief from all runs covering January of 2011 or November or December of 2010.
All replications for a single month were run in parallel using 8 cores of a 60 core machine with a 250 GB SSD drive and 120 GB of RAM.
References
Buskens V, Yamaguchi K (2002) A new model for information diffusion in heterogeneous social networks. Soc Methodol 29(1):281–325
Campante FR, Chor D (2012) Why was the Arab world poised for revolution? schooling, economic opportunities, and the Arab spring. J Econ Perspect 26(2):167–187
Carley K (1990) Coordinating for Success: Trading Information Redundancy for Task Simplicity. Technical Report, Institute for Software Research, Carnegie Mellon University
Carley K (1991) A theory of group stability. Am Soc Rev 56(3):331–354
Carley KM (2002) Smart agents and organizations of the future. In: The handbook of new media, vol 12, pp 206–220
Carley KM, Martin MK, Hirshman BR (2009) The etiology of social change. Topics Cogn Sci 1(4):621–650
Carley KM, Filonuk DT, Joseph K, Kowalchuck M, Lanham MJ, Morgan GP (2012) Construct User Guide. Technical Report, Institute for Software Research, Carnegie Mellon University
Carley KM, Pfeffer J, Reminga J, Storrick J, Columbus D (2012) ORA User’s Guide 2012. DTIC Document
Carley KM, Morgan G, Lanham M, Pfeffer J (2012c) Multi-modeling and socio-cultural complexity: reuse and validation. Adv Design Cross Cult Act 2:128
Carpineto C, Romano G (2012) A survey of automatic query expansion in information retrieval. ACM Comput Surv 44(1):1:1–1:50
Centola D, Michael M (2007) Complex contagions and the weakness of long ties. Am J Soc 113(3):702–734
Diesner J, Carley KM (2008) Conditional random fields for entity extraction and ontological text coding. Comput Math Organ Theory 14(3):248–262
Diesner J, Frantz TL, Carley KM (2005) Communication networks from the Enron email corpus ‘it’s always about the people. Enron is no different. Comput Math Organ Theory 11(3):201–228
Eisenstein J, O’Connor B, Smith NA, Xing EP (2010) A latent variable model for geographic lexical variation. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP’10. Association for Computational Linguistics. Stroudsburg, PA, USA, pp 1277–1287
Epstein JM, Axtell R (1996) Growing artificial societies: social science from the bottom up. Brookings Institution Press, Washington, D.C.
Finkel JR, Grenager T, Manning C (2005) Incorporating non-local information into information extraction systems by Gibbs sampling. In: Proceedings of the 43rd annual meeting on association for computational linguistics, ACL’05. Association for Computational Linguistics, Stroudsburg, pp 363–370
Fraley C, Raftery AE, Murphy TB, Scrucca L (2012) Mclust Version 4 for R: Normal Mixture Modeling for Model-Based Clustering, Classification, and Density Estimation. Technical Report
Gilbert N (2007) Agent-based models. Sage Publications Inc, Thousand Oaks
Greenwald AG, Banaji MR (1995) Implicit social cognition: attitudes, self-esteem, and stereotypes. Psychol Rev 102(1):4–27
Hamdy N, Gomaa EH (2012) Framing the Egyptian uprising in Arabic language newspapers and social media. J Commun 62 (2)
Hilton JL, von Hippel W (1996) Stereotypes. Annu Rev Psychol 47(1):237–271
Hyndman RJ, Fan Y (1996) Sample quantiles in statistical packages. Am Stat 50(4):361–365
Joseph K, Morgan GP, Carley KM (2014) On the coevolution of stereotype, culture and social relationships. Soc Sci Comput Rev
Kossinets G, Watts DJ (2009) Origins of homophily in an evolving social network 1. Am J Soc 115(2):405–450
Krivitsky PN, Handcock MS, Raftery AE, Hoff PD (2009) Representing degree distributions, clustering, and homophily in social networks with latent cluster random effects models. Soc Netw 31(3):204–213
Lanham MJ, Morgan GP, Carley KM (2014) Social network modeling and agent-based simulation in support of crisis De-escalation. IEEE Trans Human Mach Syst
Lazarsfeld PF, Merton RK (1954) Friendship as a social process: a substantive and methodological analysis. In: Freedom and Control in Modern Society. Van Nostrand, pp 18–66
Li Y, Duan H, Zhai CX (2012) A generalized hidden Markov model with discriminative training for query spelling correction. In: Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval, pp 611–620
Licoppe C (2004) Connected presence: the emergence of a new repertoire for managing social relationships in a changing communication technoscape. Environ Plan D Soc Space 22:135–156
Martin MK, Pfeffer J, Carley KM (2013) Network text analysis of conceptual overlap in interviews, newspaper articles and keywords. Soc Netw Anal Min, pp 1–13
McPherson JM, Ranger-Moore JR (1991) Evolution on a dancing landscape: organizations and networks in dynamic Blau space. Soc Forces 70(1):19–42
McPherson M, Lovin L, Cook J (2001) Birds of a feather: homophily in social networks. Annu Rev Soc 1:415–444
Mead GH (1925) The genesis of the self and social control. Int J Ethics 35(3):251–277
Pang B, Lee L (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1–2):1–135
Papacharissi Z, de Fatima Oliveira M (2012) Affective news and networked publics: the rhythms of news storytelling on #Egypt. J Commun 62(2):266–282
Pfeffer J, Carley KM (2012) Rapid modeling and analyzing networks extracted from pre-structured news articles. Comput Math Organ Theory 18(3):280–299
Pfeffer J, Carley KM (2013) The importance of local clusters for the diffusion of opinions and beliefs in interpersonal communication networks. Int J Innov Technol Manag 10(5)
Raina R, Ng AY, Koller D (2006) Constructing informative priors using transfer learning. In: Proceedings of the 23rd international conference on machine learning, pp 713–720
Rogers EM (2003) Diffusion of Innovations, 5th edn. Simon and Schuster, New York
R Core Team (2012) R: a language and environment for statistical computing. Vienna, Austria. http://www.R-project.org/
Schreiber C, Carley KM (2012) Validating agent interactions in construct against empirical communication networks using the calibrated grounding technique. IEEE Trans Syst Man Cybern Part A Syst Humans 99:1–9
Tajfel H, Turner JC (1979) An integrative theory of intergroup conflict. In: Austin W Worche S (eds) The social psychology of intergroup relations. Brooks/Cole, Monterey, pp 33–47
Wasserman L (2003) All of statistics: a concise course in statistical inference. Springer, Berlin
Wegner DM (1995) A computer network model of human transactive memory. Soc Cognit 13(3):319–339
Acknowledgments
This work was supported in part by the Air Force Office of Sponsored Research, FA9550-11-1-0179 and the Office of Naval Research through a Minerva N000141310835. Additional support was provided by the center for Computational Analysis of Social and Organizational Systems and the Institute for Software Research at Carnegie Mellon University. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the National Science Foundation, or the U.S. government.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Joseph, K., Carley, K.M., Filonuk, D. et al. Arab Spring: from newspaper. Soc. Netw. Anal. Min. 4, 177 (2014). https://doi.org/10.1007/s13278-014-0177-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-014-0177-5