ABSTRACT
We describe the EMBERS AutoGSR system that conducts automated coding of civil unrest events from news articles published in multiple languages. The nuts and bolts of the AutoGSR system constitute an ecosystem of filtering, ranking, and recommendation models to determine if an article reports a civil unrest event and, if so, proceed to identify and encode specific characteristics of the civil unrest event such as the when, where, who, and why of the protest. AutoGSR is a deployed system for the past 6 months continually processing data 24x7 in languages such as Spanish, Portuguese, English and encoding civil unrest events in 10 countries of Latin America: Argentina, Brazil, Chile, Colombia, Ecuador, El Salvador, Mexico, Paraguay, Uruguay, and Venezuela. We demonstrate the superiority of AutoGSR over both manual approaches and other state-of-the-art encoding systems for civil unrest.
- E. Boschee, P. Natarajan, and R. Weischedel. Automatic extraction of events from open source text for predictive forecasting. In Handbook of Computational Approaches to Counterterrorism, pages 51--67. Springer, 2013.Google ScholarCross Ref
- C. Cortes and V. Vapnik. Support-vector networks. Mach. Learn., 20(3):273--297, Sept. 1995. Google ScholarDigital Library
- F. Hogenboom, F. Frasincar, U. Kaymak, and F. De Jong. An overview of event extraction from text. In DeRiVE Workshop at ISWC 2011, volume 779, pages 48--57. Citeseer, 2011.Google Scholar
- T. Joachims. Text categorization with suport vector machines: Learning with many relevant features. In ECML '98, ECML '98, pages 137--142, London, UK, UK, 1998. Springer-Verlag. Google ScholarDigital Library
- Q. V. Le and T. Mikolov. Distributed representations of sentences and documents. arXiv preprint arXiv:1405.4053, 2014.Google Scholar
- K. Leetaru and P. A. Schrodt. Gdelt: Global data on events, location, and tone, 1979--2012. In ISA Annual Convention, volume 2. Citeseer, 2013.Google Scholar
- T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013.Google Scholar
- T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111--3119, 2013.Google ScholarDigital Library
- K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Learning to classify text from labeled and unlabeled documents. In AAAI '98, AAAI '98/IAAI '98, pages 792--799, Menlo Park, CA, USA, 1998. American Association for Artificial Intelligence. Google ScholarDigital Library
- K. Nigam, A. K. McCallum, S. Thrun, and T. Mitchell. Text classification from labeled and unlabeled documents using em. Mach. Learn., 39(2--3):103--134, May 2000. Google ScholarDigital Library
- S. P. O'brien. Crisis early warning and decision support: Contemporary approaches and thoughts on future research. International Studies Review, 12(1):87--104, 2010.Google ScholarCross Ref
- S. Osinski and D. Weiss. A concept-driven algorithm for clustering search results. IEEE Intelligent Systems, 20(3):48--54, May 2005. Google ScholarDigital Library
- N. Ramakrishnan and P. Butler et. al. 'beating the news' with embers: Forecasting civil unrest using open source indicators. In KDD '14, KDD '14, pages 1799--1808, New York, NY, USA, 2014. ACM. Google ScholarDigital Library
- L. Ramshaw, E. Boschee, M. Freedman, J. MacBride, R. Weischedel, and A. Zamanian. Serif language processing effective trainable language understanding. Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation, pages 636--644, 2011.Google Scholar
- P. A. Schrodt. Tabari: Textual analysis by augmented replacement instructions. Dept. of Political Science, University of Kansas, Blake Hall, Version 0.7. 3B3, pages 1--137, 2009.Google Scholar
- P. A. Schrodt. Cameo: Conflict and mediation event observations event and actor codebook. Pennsylvania State University, 2012.Google Scholar
Index Terms
- EMBERS AutoGSR: Automated Coding of Civil Unrest Events
Recommendations
Digital transformation model: analytic approach on participatory governance & community engagement in India
dg.o '18: Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data AgeGovernments around the globe are more and more aiming at digital and participatory governance to become more integrative and responsive for citizen-centric superior service delivery. Reconstruction of the technical and structural framework is also going ...
News Feature Extraction for Events on Social Network Platforms
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web CompanionMicroblog-based social network platforms like Twitter and Sina Weibo have been important sources for news event extraction. However, existing works on microblog event extraction, which usually use keywords, entities, or selected microblogs to represent ...
Extraction and Compilation of Events and Sub-events from Twitter
WI-IAT '12: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01Twitter has emerged as a great source to provide insights about upcoming planned and unplanned events of social, economic and political relevance. Big events are publicized and known in advance, but smaller, unplanned sub-events around them are not ...
Comments