Abstract
A major problem in the field of peace and conflict studies is to extract events from a variety of news sources. The events need to be coded with an event type and annotated with entities from a domain specific ontology for future retrieval and analysis. The problem is dynamic in nature, characterised by new or changing groups and targets, and the emergence of new types of events. A number of automated event extraction systems exist that detect thousands of events on a daily basis. The resulting datasets, however, lack sufficient coverage of specific domains and suffer from too many duplicated and irrelevant events. Therefore expert event coding and validation is required to ensure sufficient quality and coverage of a conflict. We propose a new framework for semi-automatic rule-based event extraction and coding based on the use of deep syntactic-semantic patterns created from normal user input to an event annotation system. The method is implemented in a prototype Event Coding Assistant that processes news articles to suggest relevant events to a user who can correct or accept the suggestions. Over time as a knowledge base of patterns is built, event extraction accuracy improves and, as shown by analysis of system logs, the workload of the user is decreased.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Best, R.H., Carpino, C., Crescenzi, M.J.: An analysis of the TABARI coding system. Confl. Manag. Peace Sci. 30, 335–348 (2013)
Bond, D., Bond, J., Oh, C., Jenkins, J.C., Taylor, C.L.: Integrated data for events analysis (IDEA): an event typology for automated events data development. J. Peace Res. 40, 733–745 (2003)
Bui, Q.C., Sloot, P.: Extracting biological events from text using simple syntactic patterns. In: Proceedings of the BioNLP Shared Task 2011 Workshop, pp. 143–146 (2011)
Fung, G.P.C., Yu, J.X., Yu, P.S., Lu, H.: Parameter free bursty events detection in text streams. In: Proceedings of the 31st International Conference on Very Large Data Bases, pp. 181–192 (2005)
Gerner, D.J., Schrodt, P.A., Yilmaz, O., Abu-Jabr, R.: Conflict and mediation event observations (CAMEO): a new event data framework for the analysis of foreign policy interactions. In: The Annual Meetings of the International Studies Association, New Orleans, LA (2002)
Ji, H., Grishman, R.: Refining event extraction through cross-document inference. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, pp. 254–262 (2008)
Kennedy, R.: Making useful conflict predictions. J. Peace Res. 52, 649–664 (2015)
Kuzey, E., Vreeken, J., Weikum, G.: A fresh look on knowledge bases: distilling named events from news. In: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management, pp. 1689–1698 (2014)
LaFree, G., Dugan, L.: Introducing the global terrorism database. Terrorism Polit. Violence 19, 181–204 (2007)
Leetaru, K., Schrodt, P.A.: GDELT: global data on events, location, and tone, 1979–2012. In: The Annual Meetings of the International Studies Association, San Francisco, CA (2013)
Nardulli, P.F., Althaus, S.L., Hayes, M.: A progressive supervised-learning approach to generating rich civil strife data. Sociol. Methodol. 45, 148–183 (2015)
O’Brien, S.P.: Crisis early warning and decision support: contemporary approaches and thoughts on future research. Int. Stud. Rev. 12, 87–104 (2010)
Pham, S.B., Hoffmann, A.: Incremental knowledge acquisition for extracting temporal relations. In: Proceedings of the 2005 12th IEEE International Conference on Natural Language Processing and Knowledge Engineering, pp. 354–359 (2005)
Raleigh, C., Linke, A., Hegre, H., Karlsen, J.: Introducing ACLED: an armed conflict location and event dataset. J. Peace Res. 47, 651–660 (2010)
Ruiz-Sánchez, J.M., Valencia-GarcÃa, R., Fernández-Breis, J.T., MartÃnez-Béjar, R., Compton, P.: An approach for incremental knowledge acquisition from text. Expert Syst. Appl. 25, 77–86 (2003)
Rusu, D., Dali, L., Fortuna, B., Grobelnik, M., Mladenić, D.: Triplet extraction from sentences. In: Proceedings of the 10th International Multiconference Information Society - IS 2008, pp. 8–12 (2007)
Rusu, D., Hodson, J., Kimball, A.: Unsupervised techniques for extracting and clustering complex events in news. In: Proceedings of the Second Workshop on EVENTS: Definition, Detection, Coreference and Representation, pp. 26–34 (2014)
Schrodt, P.A.: Automated production of high-volume, real-time political event data. In: APSA 2010 Annual Meeting Papers (2010)
Schrodt, P.A., Beieler, J., Idris, M.: Three’s a charm?: open event data coding with EL:DIABLO, PETRARCH, and the open event data alliance. In: The Annual Meetings of the International Studies Association, Toronto, ON (2014)
Schrodt, P.A., Yonamine, J.E.: A guide to event data: past, present, and future. All Azimuth 2(2), 5–22 (2013)
Sha, L., Liu, J., Lin, C.Y., Li, S., Chang, B., Sui, Z.: RBPB: regularization-based pattern balancing method for event extraction. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1224–1234 (2016)
Shellman, S.M.: Coding disaggregated intrastate conflict: machine processing the behavior of substate actors over time and space. Polit. Anal. 16, 464–477 (2008)
Shieber, S.M.: An Introduction to Unification-Based Approaches to Grammar. CSLI Publications, Stanford, CA (1986)
Ward, M.D., Beger, A., Cutler, J., Dickenson, M., Dorff, C., Radford, B.: Comparing GDELT and ICEWS event data. Analysis 21, 267–297 (2013)
Acknowledgements
This work was supported by Data to Decisions Cooperative Research Centre. We are grateful to Michael Burnside and Kaitlyn Hedditch for coding the AfPak event data.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Krzywicki, A., Wobcke, W., Bain, M., Schmeidl, S., Heap, B. (2018). A Knowledge Acquisition Method for Event Extraction and Coding Based on Deep Patterns. In: Yoshida, K., Lee, M. (eds) Knowledge Management and Acquisition for Intelligent Systems. PKAW 2018. Lecture Notes in Computer Science(), vol 11016. Springer, Cham. https://doi.org/10.1007/978-3-319-97289-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-97289-3_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97288-6
Online ISBN: 978-3-319-97289-3
eBook Packages: Computer ScienceComputer Science (R0)