Abstract
News reports are currently one of the most studied data sources in the field of information extraction. Event descriptions that come from these sources are controversial, complementary and reflect relationships between the participating entities. The aim of the present work is to test a group of predefined patterns and rules to obtain sets of automatically filled scenario templates for socio-political events (case study: protests) and to apply clustering algorithms. At this stage the information is extracted from Russian news titles. The results of the pattern quality assessment and clustering are presented.
This work is partially financially supported by the Government of Russian Federation, Grant 074-U01 and by research work 414648 ”Development of monitoring system for socio-cultural processes in cyberspac”.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dementieva, I.N.: Theory and methodology of social protest study. Journal of Public Opinion Monitoring 4(116), 3–12 (2013)
Braha, D.: A Universal Model of Global Civil Unrest. PLoS ONE 7(10) e48596 (2012)
Hayes, M., Nardulli, P.F.: SPEEDs Societal Stability Protocol and the Study of Civil Unrest: an Overview and Comparison with Other Event Data Projects (white paper). Cline Center for Democracy. University of Illinois at Urbana-Champaign (2011)
Lejeune, G.: Structure patterns in Information Extraction: a multilingual solution? In: Advances in Method of Information and Communication Technology, AMICT 2009, Petrozavodsk, Russia, vol. 11, pp. 105–111 (2009)
Wunderwald, M.: Event Extraction from News Articles (Diploma Thesis), Dresden University of Technology. Dept. of Computer Science (2011)
Piskorski, J., Tanev, H., Atkinson, M., van der Goot, E., Zavarella, V.: Online news event extraction for global crisis surveillance. In: Nguyen, N.T. (ed.) Transactions on CCI V. LNCS, vol. 6910, pp. 182–212. Springer, Heidelberg (2011)
Pinto, D., Rosso, P., Jiménez, H.: A Self-Enriching Methodology for Clustering Narrow Domain Short Texts. Comput. J. 54(7), 1148–1165 (2011)
Pinto, D.: Analysis of narrow-domain short texts clustering. In: Research report for Diploma de Estudios Avanzados (DEA), Department of Information Systems and Computation, UPV (2007)
Hogenboom, F., Frasincar, F., Kaymak, U., de Jong, F.: An Overview of Event Extraction from Text. In: Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011). CEUR Workshop Proceedings, vol. 779, pp. 48–57 (2011)
Grishman, R.: Information Extraction: Capabilities and Challenges. In: Notes for the 2012 International Winter School in Language and Speech Technologies. Rovira i Virgili University. Tarragona, Spain (2012)
Piskorski, J., Yangarber, R.: Information Extraction: Past, Present and Future. Survey. In: Poibeau, T., et al. (eds.) Multi-source, Multilingual Information Extraction and Summarization?, Theory and Applications of Natural Language Processing, Springer, Heidelberg (2012)
Atkinson, M., Piskorski, J., Van der Goot, E., Yangarber, Y.: Multilingual Real- time Event Extraction for Border Security Intelligence Gathering. In: Wiil, U.K. (ed.) Counterterrorism and Open Source Intelligence. LNS, vol. 355, Springer, Wien (2011)
Hogenboom, A., Hogenboom, F., Frasincar, F., Schouten, K., van der Meer, O.: Semantics-based information extraction for detecting economic events. Multimed Tools Appl 64, 27–52 (2013)
IJntema, W., Sangers, J., Hogenboom, F., Frasincar, F.: A lexico-semantic pattern language for learning ontology instances from text. Web Semantics: Science, Services and Agents on the World Wide Web 15, 37–50 (2012)
Huang, R., Riloff, E.: Multi-faceted Event Recognition with Bootstrapped Dictio- naries. In: Proceedings of NAACL-HLT, pp. 41–51 (2013)
Solovyev, V., Ivanov, V., Gareev, R., Serebryakov, S., Vassilieva, N.: Methodology for Building Extraction Templates for Russian Language in Knowledge-Based IE Systems. In: HP Laboratories Technical report. HPL-2012-211 (2012)
Du, M., von Etter, P., Kopotev, M., Novikov, M., Tarbeeva, N., Yangarber, R.: Building support tools for Russian-language information extraction. In: Habernal, I., Matoušek, V. (eds.) TSD 2011. LNCS (LNAI), vol. 6836, pp. 380–387. Springer, Heidelberg (2011)
Pivovarova, L., Du, M., Yangarber, R.: Adapting the PULS event extraction frame- work to analyze Russian text. In: At ACL: 4th Biennial Workshop on Balto-Slavic Natural Language Processing, Sofia, Bulgaria (2013)
Errecalde, M., Ingaramo, D., Rosso, P.: ITSA ⋆ : An effective iterative method for short-text clustering tasks. In: García-Pedrajas, N., Herrera, F., Fyfe, C., Benítez, J.M., Ali, M. (eds.) IEA/AIE 2010, Part I. LNCS, vol. 6096, pp. 550–559. Springer, Heidelberg (2010)
MacQueen, J.B.: Some Methods for classification and Analysis of Multivariate Observations. In: Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, Berkeley (1967)
Manning, C., Raghavan, P., Schutze, H.: Introduction to Information Retrieval. Cambridge University Press (2009)
Zu Eissen, S.M., Stein, B.: Analysis of clustering algorithms for web-based search. In: Karagiannis, D., Reimer, U. (eds.) PAKM 2002. LNCS (LNAI), vol. 2569, pp. 168–178. Springer, Heidelberg (2002)
Stein, B., zu Eissen, S.M., Wißbrock, F.: On Cluster Validity and the Information Need of Users. In: Hanza, M.H. (ed.) 3rd IASTED Int. Conference on Artificial Intelligence and Applications (AIA 2003), Benalmádena, Spain, pp. 216–221. ACTA Press, IASTED (2003)
Hogenboom, F., Frasincar, F., Kaymak, U., de Jong, F.: An Overview of Event Extraction from Text. In: Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2011), CEUR Workshop Proceedings, vol. 779, pp. 48–57 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Danilova, V., Popova, S. (2014). Socio-Political Event Extraction Using a Rule-Based Approach. In: Meersman, R., et al. On the Move to Meaningful Internet Systems: OTM 2014 Workshops. OTM 2014. Lecture Notes in Computer Science, vol 8842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45550-0_55
Download citation
DOI: https://doi.org/10.1007/978-3-662-45550-0_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45549-4
Online ISBN: 978-3-662-45550-0
eBook Packages: Computer ScienceComputer Science (R0)