Skip to main content

A Knowledge Acquisition Method for Event Extraction and Coding Based on Deep Patterns

  • Conference paper
  • First Online:
Knowledge Management and Acquisition for Intelligent Systems (PKAW 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11016))

Included in the following conference series:

Abstract

A major problem in the field of peace and conflict studies is to extract events from a variety of news sources. The events need to be coded with an event type and annotated with entities from a domain specific ontology for future retrieval and analysis. The problem is dynamic in nature, characterised by new or changing groups and targets, and the emergence of new types of events. A number of automated event extraction systems exist that detect thousands of events on a daily basis. The resulting datasets, however, lack sufficient coverage of specific domains and suffer from too many duplicated and irrelevant events. Therefore expert event coding and validation is required to ensure sufficient quality and coverage of a conflict. We propose a new framework for semi-automatic rule-based event extraction and coding based on the use of deep syntactic-semantic patterns created from normal user input to an event annotation system. The method is implemented in a prototype Event Coding Assistant that processes news articles to suggest relevant events to a user who can correct or accept the suggestions. Over time as a knowledge base of patterns is built, event extraction accuracy improves and, as shown by analysis of system logs, the workload of the user is decreased.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://opennlp.apache.org/.

References

  1. Best, R.H., Carpino, C., Crescenzi, M.J.: An analysis of the TABARI coding system. Confl. Manag. Peace Sci. 30, 335–348 (2013)

    Article  Google Scholar 

  2. Bond, D., Bond, J., Oh, C., Jenkins, J.C., Taylor, C.L.: Integrated data for events analysis (IDEA): an event typology for automated events data development. J. Peace Res. 40, 733–745 (2003)

    Article  Google Scholar 

  3. Bui, Q.C., Sloot, P.: Extracting biological events from text using simple syntactic patterns. In: Proceedings of the BioNLP Shared Task 2011 Workshop, pp. 143–146 (2011)

    Google Scholar 

  4. Fung, G.P.C., Yu, J.X., Yu, P.S., Lu, H.: Parameter free bursty events detection in text streams. In: Proceedings of the 31st International Conference on Very Large Data Bases, pp. 181–192 (2005)

    Google Scholar 

  5. Gerner, D.J., Schrodt, P.A., Yilmaz, O., Abu-Jabr, R.: Conflict and mediation event observations (CAMEO): a new event data framework for the analysis of foreign policy interactions. In: The Annual Meetings of the International Studies Association, New Orleans, LA (2002)

    Google Scholar 

  6. Ji, H., Grishman, R.: Refining event extraction through cross-document inference. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics, pp. 254–262 (2008)

    Google Scholar 

  7. Kennedy, R.: Making useful conflict predictions. J. Peace Res. 52, 649–664 (2015)

    Article  Google Scholar 

  8. Kuzey, E., Vreeken, J., Weikum, G.: A fresh look on knowledge bases: distilling named events from news. In: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management, pp. 1689–1698 (2014)

    Google Scholar 

  9. LaFree, G., Dugan, L.: Introducing the global terrorism database. Terrorism Polit. Violence 19, 181–204 (2007)

    Article  Google Scholar 

  10. Leetaru, K., Schrodt, P.A.: GDELT: global data on events, location, and tone, 1979–2012. In: The Annual Meetings of the International Studies Association, San Francisco, CA (2013)

    Google Scholar 

  11. Nardulli, P.F., Althaus, S.L., Hayes, M.: A progressive supervised-learning approach to generating rich civil strife data. Sociol. Methodol. 45, 148–183 (2015)

    Article  Google Scholar 

  12. O’Brien, S.P.: Crisis early warning and decision support: contemporary approaches and thoughts on future research. Int. Stud. Rev. 12, 87–104 (2010)

    Article  Google Scholar 

  13. Pham, S.B., Hoffmann, A.: Incremental knowledge acquisition for extracting temporal relations. In: Proceedings of the 2005 12th IEEE International Conference on Natural Language Processing and Knowledge Engineering, pp. 354–359 (2005)

    Google Scholar 

  14. Raleigh, C., Linke, A., Hegre, H., Karlsen, J.: Introducing ACLED: an armed conflict location and event dataset. J. Peace Res. 47, 651–660 (2010)

    Article  Google Scholar 

  15. Ruiz-Sánchez, J.M., Valencia-García, R., Fernández-Breis, J.T., Martínez-Béjar, R., Compton, P.: An approach for incremental knowledge acquisition from text. Expert Syst. Appl. 25, 77–86 (2003)

    Article  Google Scholar 

  16. Rusu, D., Dali, L., Fortuna, B., Grobelnik, M., Mladenić, D.: Triplet extraction from sentences. In: Proceedings of the 10th International Multiconference Information Society - IS 2008, pp. 8–12 (2007)

    Google Scholar 

  17. Rusu, D., Hodson, J., Kimball, A.: Unsupervised techniques for extracting and clustering complex events in news. In: Proceedings of the Second Workshop on EVENTS: Definition, Detection, Coreference and Representation, pp. 26–34 (2014)

    Google Scholar 

  18. Schrodt, P.A.: Automated production of high-volume, real-time political event data. In: APSA 2010 Annual Meeting Papers (2010)

    Google Scholar 

  19. Schrodt, P.A., Beieler, J., Idris, M.: Three’s a charm?: open event data coding with EL:DIABLO, PETRARCH, and the open event data alliance. In: The Annual Meetings of the International Studies Association, Toronto, ON (2014)

    Google Scholar 

  20. Schrodt, P.A., Yonamine, J.E.: A guide to event data: past, present, and future. All Azimuth 2(2), 5–22 (2013)

    Google Scholar 

  21. Sha, L., Liu, J., Lin, C.Y., Li, S., Chang, B., Sui, Z.: RBPB: regularization-based pattern balancing method for event extraction. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1224–1234 (2016)

    Google Scholar 

  22. Shellman, S.M.: Coding disaggregated intrastate conflict: machine processing the behavior of substate actors over time and space. Polit. Anal. 16, 464–477 (2008)

    Article  Google Scholar 

  23. Shieber, S.M.: An Introduction to Unification-Based Approaches to Grammar. CSLI Publications, Stanford, CA (1986)

    MATH  Google Scholar 

  24. Ward, M.D., Beger, A., Cutler, J., Dickenson, M., Dorff, C., Radford, B.: Comparing GDELT and ICEWS event data. Analysis 21, 267–297 (2013)

    Google Scholar 

Download references

Acknowledgements

This work was supported by Data to Decisions Cooperative Research Centre. We are grateful to Michael Burnside and Kaitlyn Hedditch for coding the AfPak event data.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alfred Krzywicki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Krzywicki, A., Wobcke, W., Bain, M., Schmeidl, S., Heap, B. (2018). A Knowledge Acquisition Method for Event Extraction and Coding Based on Deep Patterns. In: Yoshida, K., Lee, M. (eds) Knowledge Management and Acquisition for Intelligent Systems. PKAW 2018. Lecture Notes in Computer Science(), vol 11016. Springer, Cham. https://doi.org/10.1007/978-3-319-97289-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97289-3_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97288-6

  • Online ISBN: 978-3-319-97289-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics