Abstract
In the paper, we deal with the problem of spatial expression recognition. The goal of this task is to recognize in text information structures that represent a relative spatial relationship between two objects (a trajector and a landmark) indicated by a preposition of location, for example, a book on the table. We used the Corpus of Polish Spatial Texts (PST) to evaluate the knowledge-based approach to spatial expression recognition. We focused on the evaluation of the recall of the method for filtering candidates of spatial expressions. Our goal was to identify the bottlenecks of the existing preprocessing pipeline and the knowledge-based approach. We have shown that it is necessary to focus on three main areas, i.e., coreference resolution (relations from implied subjects and pronouns to nouns and named entities), word sense disambiguation, and cognitive schemas.
Work financed as part of the investment in the CLARIN-PL research infrastructure funded by the Polish Ministry of Science and Higher Education.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
FrameNet: http://framenet.icsi.berkeley.edu/. Accessed 3 Jan 2020
Acedański, S.: A morphosyntactic brill tagger for inflectional languages. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) NLP 2010. LNCS (LNAI), vol. 6233, pp. 3–14. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14770-8_3
Dobnik, S., Kelleher, J.: Exploration of functional semantics of prepositions from corpora of descriptions of visual scenes. In: Proceedings of the Third Workshop on Vision and Language, pp. 33–37, Dublin City University and the Association for Computational Linguistics, Dublin, Ireland, August 2014. https://doi.org/10.3115/v1/W14-5405, https://www.aclweb.org/anthology/W14-5405
Fellbaum, C., Miller, G.: The Lexical Database. MITP (1998)
Garrod, S., Ferrier, G., Campbell, S.: In and on: investigating the functional geometry of spatial prepositions. Cognition 72(2), 167–189 (1999). https://doi.org/10.1016/S0010-0277(99)00038-4,http://www.sciencedirect.com/science/article/pii/S0010027799000384
Głowińska, K.: Anotacja składniowa NKJP. In: Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.) Narodowy Korpus Języka Polskiego, pp. 107–127. Wydawnictwo Naukowe PWN, Warsaw (2012)
Jenge, C., Kawaletz, S., Schade, U.: Combining different NLP methods for HUMINT report analysis (2009)
Kaczmarek, A., Marcińczuk, M.: Heuristic algorithm for zero subject detection in polish. In: Král, P., Matoušek, V. (eds.) TSD 2015. LNCS (LNAI), vol. 9302, pp. 378–386. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24033-6_43
Kolomiyets, O., Kordjamshidi, P., Bethard, S., Moens, M.: SemEval-2013 task 3: spatial role labeling. Second joint conference on lexical and computational semantics (SEM). In: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013), East Stroudsburg, PA, ACL, Atlanta, USA, June 2013
Kopeć, M., Ogrodniczuk, M.: Creating a coreference resolution system for polish. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012. ELRA, Istanbul, Turkey, pp. 192–195 (2012)
Kopeć, M.: Zero subject detection for polish. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Short Papers, vol. 2, pp. 221–225. Association for Computational Linguistics, Gothenburg (2014)
Kędzia, P., Piasecki, M., Orlińska, M.: WoSeDon (2016). http://hdl.handle.net/11321/290, CLARIN-PL digital repository
Mani, I., et al.: SpatialML: annotation scheme, resources, and evaluation. Lang. Resour. Eval. 44, 263–280 (2010)
Marcińczuk, M., Kocoń, J., Janicki, M.: Liner2 – a customizable framework for proper names recognition for Polish. In: Bembenik, R., Skonieczny, Ł., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intelligent Tools for Building a Scientific Information Platform, Studies in Computational Intelligence, vol. 467, pp. 231–253. Springer (2013). https://doi.org/10.1007/978-3-642-35647-6_17, http://dblp.uni-trier.de/db/series/sci/sci467.html#MarcinczukKJ13
Marcińczuk, M., Oleksy, M.: Inforex – a collaborative system for text corpora annotation and analysis goes open. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP, pp. 711–719 (2019)
Marcińczuk, M.M., Oleksy, M., Wieczorek, J.: Towards recognition of spatial relations between entities for polish. Cognitive Studies|Études cognitives (16), 119–132 (2016)
Marcińczuk, M.: Fine-grained named entity recognition for polish using deep learning. In: Proceedings of PP-RAI 2019 Conference, Department of Systems and Computer Networks, Faculty of Electronics, Wroclaw University of Science and Technology, Wrocław, pp. 219–222 (2019)
Maziarz, M., Piasecki, M., Szpakowicz, S.: Approaching plWordNet 2.0. In: Proceedings of the 6th Global Wordnet Conference, Matsue, Japan, January 2012
Oleksy, M., Marcińczuk, M., Bernaś, T., Wieczorek, J., Kocoń, J.: KPWr annotation guidelines - spatial expressions (2.0) (2019). http://hdl.handle.net/11321/719, CLARIN-PL digital repository
Oleksy, M., Wieczorek, J., Bernaś, T., Marcińczuk, M.: Polish Spatial Texts (PST) 2.0 (2019). http://hdl.handle.net/11321/721, CLARIN-PL digital repository
Pease, A., Niles, I., Li, J.: The suggested upper merged ontology: a large ontology for the semantic web and its applications. In: In Working Notes of the AAAI-2002 Workshop on Ontologies and the Semantic Web (2002)
Przepiórkowski, A.: Powierzchniowe przetwarzanie języka polskiego. Problemy współczesnej nauki, teoria i zastosowania: Inżynieria lingwistyczna, Akademicka Oficyna Wydawnicza “Exit” (2008). https://books.google.pl/books?id=V076OgAACAAJ
Pustejovsky, J., Moszkowicz, J., Verhagen, M.: A linguistically grounded annotation language for spatial information. TAL 53(2), 87–113 (2012). http://atala.org/Extraction-de-dates-saillantes
Radziszewski, A.: Metody znakowania morfosyntaktycznego i automatycznej płytkiej analizy składniowej języka polski. Ph.D. thesis, Politechnika Wrocławska, Wrocław (2012)
Radziszewski, A.: A tiered CRF tagger for Polish. In: Bembenik, R., Skonieczny, L., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intelligent Tools for Building a Scientific Information Platform: Advanced Architectures and Solutions. Springer Verlag (2013). https://doi.org/10.1007/978-3-642-35647-6_16
Radziszewski, A., Pawlaczek, A.: Large-scale experiments with np chunking of Polish. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS (LNAI), vol. 7499, pp. 143–149. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32790-2_17
Roberts, K., Rodriguez, L., Shooshan, S.E., Demner-Fushman, D.: Automatic extraction and post-coordination of spatial relations in consumer language. In: AMIA ... Annual Symposium Proceedings. AMIA Symposium 2015, pp. 1083–1092 (2015)
Waszczuk, J.: Harnessing the CRF complexity with domain-specific constraints. The case of morphosyntactic tagging of a highly inflected language. In: Proceedings of COLING 2012, pp. 2789–2804, December 2012 . http://cse.iitk.ac.in/users/cs671/2013/hw3/waszczuk-12coling_CRF-w-domainspecific-constraints-for-morpho-tagging.pdf
Wieczorek, J., Oleksy, M.: NE\_SUMO\_PLWN\_mapping (2016). http://hdl.handle.net/11321/286, CLARIN-PL digital repository
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Marcińczuk, M., Oleksy, M., Wieczorek, J. (2020). Evaluation of Knowledge-Based Recognition of Spatial Expressions for Polish. In: Nguyen, N.T., Hoang, B.H., Huynh, C.P., Hwang, D., Trawiński, B., Vossen, G. (eds) Computational Collective Intelligence. ICCCI 2020. Lecture Notes in Computer Science(), vol 12496. Springer, Cham. https://doi.org/10.1007/978-3-030-63007-2_53
Download citation
DOI: https://doi.org/10.1007/978-3-030-63007-2_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63006-5
Online ISBN: 978-3-030-63007-2
eBook Packages: Computer ScienceComputer Science (R0)