Skip to main content

Socio-Political Event Extraction Using a Rule-Based Approach

  • Conference paper
Book cover On the Move to Meaningful Internet Systems: OTM 2014 Workshops (OTM 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8842))

Abstract

News reports are currently one of the most studied data sources in the field of information extraction. Event descriptions that come from these sources are controversial, complementary and reflect relationships between the participating entities. The aim of the present work is to test a group of predefined patterns and rules to obtain sets of automatically filled scenario templates for socio-political events (case study: protests) and to apply clustering algorithms. At this stage the information is extracted from Russian news titles. The results of the pattern quality assessment and clustering are presented.

This work is partially financially supported by the Government of Russian Federation, Grant 074-U01 and by research work 414648 ”Development of monitoring system for socio-cultural processes in cyberspac”.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dementieva, I.N.: Theory and methodology of social protest study. Journal of Public Opinion Monitoring 4(116), 3–12 (2013)

    Google Scholar 

  2. Braha, D.: A Universal Model of Global Civil Unrest. PLoS ONE 7(10) e48596 (2012)

    Google Scholar 

  3. Hayes, M., Nardulli, P.F.: SPEEDs Societal Stability Protocol and the Study of Civil Unrest: an Overview and Comparison with Other Event Data Projects (white paper). Cline Center for Democracy. University of Illinois at Urbana-Champaign (2011)

    Google Scholar 

  4. Lejeune, G.: Structure patterns in Information Extraction: a multilingual solution? In: Advances in Method of Information and Communication Technology, AMICT 2009, Petrozavodsk, Russia, vol. 11, pp. 105–111 (2009)

    Google Scholar 

  5. Wunderwald, M.: Event Extraction from News Articles (Diploma Thesis), Dresden University of Technology. Dept. of Computer Science (2011)

    Google Scholar 

  6. Piskorski, J., Tanev, H., Atkinson, M., van der Goot, E., Zavarella, V.: Online news event extraction for global crisis surveillance. In: Nguyen, N.T. (ed.) Transactions on CCI V. LNCS, vol. 6910, pp. 182–212. Springer, Heidelberg (2011)

    Google Scholar 

  7. Pinto, D., Rosso, P., Jiménez, H.: A Self-Enriching Methodology for Clustering Narrow Domain Short Texts. Comput. J. 54(7), 1148–1165 (2011)

    Article  Google Scholar 

  8. Pinto, D.: Analysis of narrow-domain short texts clustering. In: Research report for Diploma de Estudios Avanzados (DEA), Department of Information Systems and Computation, UPV (2007)

    Google Scholar 

  9. Hogenboom, F., Frasincar, F., Kaymak, U., de Jong, F.: An Overview of Event Extraction from Text. In: Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011). CEUR Workshop Proceedings, vol. 779, pp. 48–57 (2011)

    Google Scholar 

  10. Grishman, R.: Information Extraction: Capabilities and Challenges. In: Notes for the 2012 International Winter School in Language and Speech Technologies. Rovira i Virgili University. Tarragona, Spain (2012)

    Google Scholar 

  11. Piskorski, J., Yangarber, R.: Information Extraction: Past, Present and Future. Survey. In: Poibeau, T., et al. (eds.) Multi-source, Multilingual Information Extraction and Summarization?, Theory and Applications of Natural Language Processing, Springer, Heidelberg (2012)

    Google Scholar 

  12. Atkinson, M., Piskorski, J., Van der Goot, E., Yangarber, Y.: Multilingual Real- time Event Extraction for Border Security Intelligence Gathering. In: Wiil, U.K. (ed.) Counterterrorism and Open Source Intelligence. LNS, vol. 355, Springer, Wien (2011)

    Google Scholar 

  13. Hogenboom, A., Hogenboom, F., Frasincar, F., Schouten, K., van der Meer, O.: Semantics-based information extraction for detecting economic events. Multimed Tools Appl 64, 27–52 (2013)

    Article  Google Scholar 

  14. IJntema, W., Sangers, J., Hogenboom, F., Frasincar, F.: A lexico-semantic pattern language for learning ontology instances from text. Web Semantics: Science, Services and Agents on the World Wide Web 15, 37–50 (2012)

    Article  Google Scholar 

  15. Huang, R., Riloff, E.: Multi-faceted Event Recognition with Bootstrapped Dictio- naries. In: Proceedings of NAACL-HLT, pp. 41–51 (2013)

    Google Scholar 

  16. Solovyev, V., Ivanov, V., Gareev, R., Serebryakov, S., Vassilieva, N.: Methodology for Building Extraction Templates for Russian Language in Knowledge-Based IE Systems. In: HP Laboratories Technical report. HPL-2012-211 (2012)

    Google Scholar 

  17. Du, M., von Etter, P., Kopotev, M., Novikov, M., Tarbeeva, N., Yangarber, R.: Building support tools for Russian-language information extraction. In: Habernal, I., Matoušek, V. (eds.) TSD 2011. LNCS (LNAI), vol. 6836, pp. 380–387. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  18. Pivovarova, L., Du, M., Yangarber, R.: Adapting the PULS event extraction frame- work to analyze Russian text. In: At ACL: 4th Biennial Workshop on Balto-Slavic Natural Language Processing, Sofia, Bulgaria (2013)

    Google Scholar 

  19. Errecalde, M., Ingaramo, D., Rosso, P.: ITSA ⋆ : An effective iterative method for short-text clustering tasks. In: García-Pedrajas, N., Herrera, F., Fyfe, C., Benítez, J.M., Ali, M. (eds.) IEA/AIE 2010, Part I. LNCS, vol. 6096, pp. 550–559. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  20. MacQueen, J.B.: Some Methods for classification and Analysis of Multivariate Observations. In: Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, Berkeley (1967)

    Google Scholar 

  21. Manning, C., Raghavan, P., Schutze, H.: Introduction to Information Retrieval. Cambridge University Press (2009)

    Google Scholar 

  22. Zu Eissen, S.M., Stein, B.: Analysis of clustering algorithms for web-based search. In: Karagiannis, D., Reimer, U. (eds.) PAKM 2002. LNCS (LNAI), vol. 2569, pp. 168–178. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  23. Stein, B., zu Eissen, S.M., Wißbrock, F.: On Cluster Validity and the Information Need of Users. In: Hanza, M.H. (ed.) 3rd IASTED Int. Conference on Artificial Intelligence and Applications (AIA 2003), Benalmádena, Spain, pp. 216–221. ACTA Press, IASTED (2003)

    Google Scholar 

  24. Hogenboom, F., Frasincar, F., Kaymak, U., de Jong, F.: An Overview of Event Extraction from Text. In: Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011), CEUR Workshop Proceedings, vol. 779, pp. 48–57 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Danilova, V., Popova, S. (2014). Socio-Political Event Extraction Using a Rule-Based Approach. In: Meersman, R., et al. On the Move to Meaningful Internet Systems: OTM 2014 Workshops. OTM 2014. Lecture Notes in Computer Science, vol 8842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45550-0_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-45550-0_55

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-45549-4

  • Online ISBN: 978-3-662-45550-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics