research-article

EMBERS AutoGSR: Automated Coding of Civil Unrest Events

Authors:
Parang Saraf

Virginia Tech, Arlington, VA, USA

Virginia Tech, Arlington, VA, USA
View Profile

,
Naren Ramakrishnan

Virginia Tech, Arlington, VA, USA

Virginia Tech, Arlington, VA, USA
View Profile

KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data MiningAugust 2016Pages 599–608https://doi.org/10.1145/2939672.2939737

Published:13 August 2016Publication History

KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 599–608

ABSTRACT

We describe the EMBERS AutoGSR system that conducts automated coding of civil unrest events from news articles published in multiple languages. The nuts and bolts of the AutoGSR system constitute an ecosystem of filtering, ranking, and recommendation models to determine if an article reports a civil unrest event and, if so, proceed to identify and encode specific characteristics of the civil unrest event such as the when, where, who, and why of the protest. AutoGSR is a deployed system for the past 6 months continually processing data 24x7 in languages such as Spanish, Portuguese, English and encoding civil unrest events in 10 countries of Latin America: Argentina, Brazil, Chile, Colombia, Ecuador, El Salvador, Mexico, Paraguay, Uruguay, and Venezuela. We demonstrate the superiority of AutoGSR over both manual approaches and other state-of-the-art encoding systems for civil unrest.

References

E. Boschee, P. Natarajan, and R. Weischedel. Automatic extraction of events from open source text for predictive forecasting. In Handbook of Computational Approaches to Counterterrorism, pages 51--67. Springer, 2013.Google ScholarCross Ref
C. Cortes and V. Vapnik. Support-vector networks. Mach. Learn., 20(3):273--297, Sept. 1995. Google ScholarDigital Library
F. Hogenboom, F. Frasincar, U. Kaymak, and F. De Jong. An overview of event extraction from text. In DeRiVE Workshop at ISWC 2011, volume 779, pages 48--57. Citeseer, 2011.Google Scholar
T. Joachims. Text categorization with suport vector machines: Learning with many relevant features. In ECML '98, ECML '98, pages 137--142, London, UK, UK, 1998. Springer-Verlag. Google ScholarDigital Library
Q. V. Le and T. Mikolov. Distributed representations of sentences and documents. arXiv preprint arXiv:1405.4053, 2014.Google Scholar
K. Leetaru and P. A. Schrodt. Gdelt: Global data on events, location, and tone, 1979--2012. In ISA Annual Convention, volume 2. Citeseer, 2013.Google Scholar
T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013.Google Scholar
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111--3119, 2013.Google ScholarDigital Library
K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Learning to classify text from labeled and unlabeled documents. In AAAI '98, AAAI '98/IAAI '98, pages 792--799, Menlo Park, CA, USA, 1998. American Association for Artificial Intelligence. Google ScholarDigital Library
K. Nigam, A. K. McCallum, S. Thrun, and T. Mitchell. Text classification from labeled and unlabeled documents using em. Mach. Learn., 39(2--3):103--134, May 2000. Google ScholarDigital Library
S. P. O'brien. Crisis early warning and decision support: Contemporary approaches and thoughts on future research. International Studies Review, 12(1):87--104, 2010.Google ScholarCross Ref
S. Osinski and D. Weiss. A concept-driven algorithm for clustering search results. IEEE Intelligent Systems, 20(3):48--54, May 2005. Google ScholarDigital Library
N. Ramakrishnan and P. Butler et. al. 'beating the news' with embers: Forecasting civil unrest using open source indicators. In KDD '14, KDD '14, pages 1799--1808, New York, NY, USA, 2014. ACM. Google ScholarDigital Library
L. Ramshaw, E. Boschee, M. Freedman, J. MacBride, R. Weischedel, and A. Zamanian. Serif language processing effective trainable language understanding. Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation, pages 636--644, 2011.Google Scholar
P. A. Schrodt. Tabari: Textual analysis by augmented replacement instructions. Dept. of Political Science, University of Kansas, Blake Hall, Version 0.7. 3B3, pages 1--137, 2009.Google Scholar
P. A. Schrodt. Cameo: Conflict and mediation event observations event and actor codebook. Pennsylvania State University, 2012.Google Scholar

Index Terms

EMBERS AutoGSR: Automated Coding of Civil Unrest Events
1. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Digital transformation model: analytic approach on participatory governance & community engagement in India
dg.o '18: Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age

Governments around the globe are more and more aiming at digital and participatory governance to become more integrative and responsive for citizen-centric superior service delivery. Reconstruction of the technical and structural framework is also going ...
Read More
News Feature Extraction for Events on Social Network Platforms
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web Companion

Microblog-based social network platforms like Twitter and Sina Weibo have been important sources for news event extraction. However, existing works on microblog event extraction, which usually use keywords, entities, or selected microblogs to represent ...
Read More
Extraction and Compilation of Events and Sub-events from Twitter
WI-IAT '12: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01

Twitter has emerged as a great source to provide insights about upcoming planned and unplanned events of social, economic and political relevance. Big events are publicized and known in advance, but smaller, unplanned sub-events around them are not ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2016
2176 pages
ISBN:9781450342322
DOI:10.1145/2939672
General Chairs:
Balaji Krishnapuram
IBM
,
Mohak Shah
Bosch
,
Program Chairs:
Alex Smola
Amazon
,
Charu Aggarwal
IBM
,
Dou Shen
Baidu
,
Rajeev Rastogi
Amazon
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 August 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
event encoding
event extraction
text mining
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '16 Paper Acceptance Rate66of1,115submissions,6%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 185
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

EMBERS AutoGSR: Automated Coding of Civil Unrest Events

KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Digital transformation model: analytic approach on participatory governance & community engagement in India

News Feature Extraction for Events on Social Network Platforms

Extraction and Compilation of Events and Sub-events from Twitter