Abstract
Metaphorical collocations are a subset of collocations in which a semantic shift has occurred in one of the components. The main goal of this paper is to describe the process of identifying metaphorical collocations in different languages – English, German and Croatian. Approaches to annotating metaphorical collocations from a list of word sketches for the three languages are presented using one of the most common nouns for all three languages – “year” for English, “Jahr" (Engl. year) for German, and “godina" (Engl. year) for Croatian. The compilation of a list of relevant grammatical relations in the identification of metaphorical collocations for each language is also described. Finally, the procedures for automatic classification of metaphorical collocations for Croatian, German and English are performed and compared.
Supported by Croatian Science Foundation under the project Metaphorical collocations – Syntagmatic word combinations between semantics and pragmatics (IP-2020-02-6319).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Stojić, A., Košuta, N.: Izrada inventara metaforičkih kolokacija u hrvatskome jeziku - na primjeru imenice godina. Fluminensia 34(1), 9–29 (2022)
Shutova, E.: Models of Metaphor in NLP. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 688–697. Association for Computational Linguistics, Uppsala, Sweden (2010)
Tsvetkov, Y., Boytsov, L., Gershman, A., Nyberg, E., Dyer, C.: Metaphor detection with cross-lingual model transfer. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 248–258. Association for Computational Linguistics, Baltimore (2014)
Choi, M., Lee, S., Choi, E., Park, H., Lee, J., Lee, D., Lee, J.: MelBERT: metaphor detection via contextualized late interaction using metaphorical identification theories. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1763–1773. Association for Computational Linguistics (2021)
Wan, H., Lin, J., Du, J., Shen, D., Zhang, M.: Enhancing metaphor detection by gloss-based interpretations. In: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pp. 1971–1981. Association for Computational Linguistics (2021)
Church, K.W., Patrick, H.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990)
Dunning, T.: Accurate methods for the statistics of surprise and coincidence. Comput. Linguist. 19(1), 61–74 (1993)
Kita, K., et al.: A comparative study of automatic extraction of collocations from corpora: mutual information vs. cost criteria. J. Natural Lang. Process., 21–33 (1994)
Smadja, F.: Retrieving collocations from text: Xtract. Comput. Linguist. 19(1), 143–78 (1993)
Thanopoulos, A., Fakotakis, N., Kokkinakis, G: Comparative evaluation of collocation extraction metrics. In: Proceedings of the Third International Conference on Language Resources and Evaluation, pp. 620–25 (2002)
Seretan, V., Wehrli, E.: Multilingual collocation extraction with a syntactic parse. Lang. Resour. Eval. 43(1), 71–85 (2009)
Lin, D.: Extracting collocations from text corpora. In: Proceedings of the First Workshop on Computational Terminology, pp. 57–63 (1998)
Krenn, B.: Collocation Mining: Exploiting Corpora for Collocation Identification and Representation. Entropy, 1–6 (2000)
Pearce, D.: Synonymy in Collocation Extraction. In: Proceedings of the Workshop on WordNet and Other Lexical Resources, Second Meeting of the North American Chapter of the Association for Computational Linguistics, pp. 41–46. Association for Computational Linguistics, (2021)
Karan, M., Šnajder, J., Dalbelo Bašić, B.: Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12), pp. 657–62. European Language Resources Association (ELRA), Istanbul, Turkey (2012)
Ljubešić, N., Logar, N., Kosem, I.: Collocation ranking: frequency vs semantics. Slovenscina 2.0, 9(2), 41–70 (2021)
Brkic Bakaric, M., Nacinovic Prskao, L., Matetic, M.: Insights into automatic extraction of metaphorical collocations. Rasprave (Manuscript submitted for publication)
Kilgarriff, A. et al.: The Sketch Engine: Ten Years On. Lexicography, pp. 7–36 (2014)
Ljubešić, N., Erjavec, T.: hrWaC and slWac: compiling web corpora for croatian and slovene. In: Habernal, I., Matoušek, V. (eds.) TSD 2011. LNCS (LNAI), vol. 6836, pp. 395–402. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23538-2_50
Jakubíček, M. et al.: The TenTen Corpus Family. In: Proceedings of the 7th International Corpus Linguistics Conference CL, pp. 125–127 (2013)
Ljubešić, N. et al.: New inflectional lexicons and training corpora for improved morphosyntactic annotation of Croatian and Serbian. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association (ELRA), Parise, France (2016)
Rychlý, P.: A Lexicographer-Friendly Association Score. In: Proceedings of the Recent Advances in Slavonic Natural Language Processing, pp. 6–9 (2008)
Stojić, A., Košuta, N.: METAPHORISCHE KOLLOKATIONEN - EINBLICKE IN EINE KORPUSBASIERTE STUDIE. Linguistica 61(1), 81–91 (2022)
Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov, T.: Learning Word Vectors for 157 Languages. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018), pp. 3483–3487 (2018). https://fasttext.cc/
Acknowledgement
This work has been fully supported by Croatian Science Foundation under the project Metaphorical collocations - Syntagmatic word combinations between semantics and pragmatics (IP-2020-02-6319).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Nacinovic Prskalo, L., Brkic Bakaric, M. (2022). Identification of Metaphorical Collocations in Different Languages – Similarities and Differences. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2022. Lecture Notes in Computer Science(), vol 13502. Springer, Cham. https://doi.org/10.1007/978-3-031-16270-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-031-16270-1_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16269-5
Online ISBN: 978-3-031-16270-1
eBook Packages: Computer ScienceComputer Science (R0)