Skip to main content

Techniques for Multilingual Security-Related Event Extraction from Online News

  • Chapter
Computational Linguistics

Part of the book series: Studies in Computational Intelligence ((SCI,volume 458))

Abstract

This chapter presents a number of techniques for multilingual event extraction, the main task is to accurately and efficiently detect key information about security-related events from electronic news media and summarize it in the form of database-like structures. Gathering such information over time is an important task for developing global news surveillance systems, particularly in the context of security threats and mass emergencies. In particular, this chapter describes novel techniques for dealing with specific extraction tasks, including: an event type classification method based on domain-specific inference rules, an approach to event geo-tagging based on utilisation of lexico-semantic patterns, a simple method for cross-lingual event information fusion, and techniques for scoring the relevance rank of automatically extracted facts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Appelt, D.: Introduction to Information Extraction Technology. Tutorial Held at IJCAI 1999 (1999)

    Google Scholar 

  2. Ashish, N., Appelt, D., Freitag, D., Zelenko, D.: In: Proceedings of the Workshop on Event Extraction and Synthesis. Held in conjunction with the AAAI 2006 (2006)

    Google Scholar 

  3. Atkinson, M., van der Goot, E.: Near Real Time Information Mining in Multilingual News. In: Proceedings of WWW 2009 (2009)

    Google Scholar 

  4. Atkinson, M., Piskorski, J., Van der Goot, E., Yangarber, R.: Multilingual Real-Time Event Extraction for Border Security Intelligence Gathering. In: Counterterrorism and Open Source Intelligence Series. Lecture Notes in Social Networks, vol. 2 (2011)

    Google Scholar 

  5. Chen, Z., Ji, H.: Can one Language Bootstrap the Other: A Case Study on Event Extraction. In: Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing (2009)

    Google Scholar 

  6. Downey, D., Etzioni, O., Soderland, S.: A Probabilistic Model of Redundancy in Information Extraction. In: Proceedings of IJCAI 2005 (2005)

    Google Scholar 

  7. Hall, M.A.: Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning. In: Proceedings of ICML (2000)

    Google Scholar 

  8. Huttunen, S., Vihavainen, A., von Etter, P., Yangarber, R.: Relevance Prediction in Information Extraction Using Discourse and Lexical Features. In: Proceedings of the 18th Nordic Conference on Computational Linguistics, NODALIDA (2011)

    Google Scholar 

  9. Grishman, R., Huttunen, S., Yangarber, R.: Real-time Event Extraction for Infectious Disease Outbreaks. In: Proceedings of HLT 2002 (2002)

    Google Scholar 

  10. Ji, H., Grishman, R.: Refining Event Extraction through Cross-Document Inference. In: Proceedings of ACL 2008, pp. 254–262 (2008)

    Google Scholar 

  11. Ji, H.: Challenges from Information Extraction to Information Fusion. In: Proceedings of ACL 2008, pp. 507–515 (2010)

    Google Scholar 

  12. King, G., Lowe, W.: An Automated Information Extraction Tool For International Conflict Data with Performance as Good as Human Coders. In: International Organization, vol. 57 (2003)

    Google Scholar 

  13. Kohavi, R., John, G.H.: Wrappers for Feature Subset Selection. In: Artificial Intelligence, vol. 57(1) (1997)

    Google Scholar 

  14. Lee, A., Passantino, M., Ji, H., Qi, G., Huang, T.: Enhancing Multi-lingual Information Extraction via Cross-Media Inference and Fusion. In: Proceedings of COLING 2010: Posters, pp. 630–638 (2010)

    Google Scholar 

  15. Li, J., Li, J., Tang, J.: A Flexible Topic-driven Framework for News Exploration. In: Proceedings of KDD 2007 (2007)

    Google Scholar 

  16. Liao, S., Grishman, R.: Using Document Level Cross-Event Inference to Improve Event Extraction. In: Proceedings of ACL 2010, pp. 789–797 (2010)

    Google Scholar 

  17. Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. In: Proceedings of EACL 1999 (1999)

    Google Scholar 

  18. Naughton, M., Kushmerick, N., Carthy, J.: Event Extraction from Heterogeneous News Sources. In: Proceedings of the AAAI 2006 Workshop on Event Extraction and Synthesis (2006)

    Google Scholar 

  19. Patwardhan, S., Riloff, E.: Effective Information Extraction with Semantic Affinity Patterns and Relevant Regions. In: Proceedings of EMNLP-CONLL 2007 (2007)

    Google Scholar 

  20. Piskorski, J.: ExPRESS: Extraction Pattern Recognition Engine and Specification Suite. In: Proceedings of the International Workshop Finite-State Methods and Natural Language Processing (2007)

    Google Scholar 

  21. Piskorski, J., Tanev, H., Atkinson, M., van der Goot, E., Zavarella, V.: Online News Event Extraction for Global Crisis Surveillance. In: Nguyen, N.T. (ed.) Transactions on CCI V. LNCS, vol. 6910, pp. 182–212. Springer, Heidelberg (2011)

    Google Scholar 

  22. Pouliquen, B., Kimler, M., Steinberger, R., Ignat, C., Oellinger, T., Blackler, K., Fluart, F., Zaghouani, W., Widiger, A., Forslund, A.-C., Best, C.: Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation. In: Proceedings of LREC 2006, Genoa, Italy, pp. 24–26 (2006)

    Google Scholar 

  23. Snover, M., Li, X., Lin, W.-P., Chen, Z., Tamang, S., Ge, M., Lee, A., Li, Q., Li, H., Anzaroot, S., Ji, H.: Cross-lingual Slot Filling from Comparable Corpora. In: Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web, pp. 110–119 (2011)

    Google Scholar 

  24. Sudo, K., Sekine, S., Grishman, R.: Cross-lingual Information Extraction System Evaluation. In: Proceedings of COLING 2004 (2004)

    Google Scholar 

  25. Tanev, H., Piskorski, J., Atkinson, M.: Real-Time News Event Extraction for Global Crisis Monitoring. In: Proceedings of NLDB 2008 (2008)

    Google Scholar 

  26. Tanev, H., Zavarella, V., Linge, J., Kabadjov, M., Piskorski, J., Atkinson, M., Steinberger, R.: Exploiting Machine Learning Techniques to Build an Event Extraction System for Portuguese and Spanish. Linguamatica (NLP Journal for Iberian Languages) 2 (2009)

    Google Scholar 

  27. Thorsten, J.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, Springer, Heidelberg (1998)

    Google Scholar 

  28. Tyler, A.R. (ed.): Expert Systems Research Trends. Nova Science Publishers, New York (2007)

    Google Scholar 

  29. Yangarber, R., Jokipii, L., Rauramo, A., Huttunen, S.: Extracting Information about Outbreaks of Infectious Epidemics. In: Proceedings of the HLT-EMNLP 2005 (2005)

    Google Scholar 

  30. Yangarber, R.: Verification of Facts across Document Boundaries. In: Proceedings of International Workshop on Intelligent Information Access (2006)

    Google Scholar 

  31. Zhang, N.N.: Movement within a Spatial Phrase. In: Cuyckens, H., Radden, G. (eds.) Perspectives on Prepositions. Linguistische Arbeiten. Band, vol. 454, pp. 47–63. Max Niemeyer, Tübingen (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Martin Atkinson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Atkinson, M., Du, M., Piskorski, J., Tanev, H., Yangarber, R., Zavarella, V. (2013). Techniques for Multilingual Security-Related Event Extraction from Online News. In: Przepiórkowski, A., Piasecki, M., Jassem, K., Fuglewicz, P. (eds) Computational Linguistics. Studies in Computational Intelligence, vol 458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34399-5_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34399-5_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34398-8

  • Online ISBN: 978-3-642-34399-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics