Abstract
The paper addresses an experiment in detecting metaphorical usage of adjectives and nouns in Polish data. First, we describe the data developed for the experiment. The corpus consists of 1833 excerpts containing adjective-noun phrases which can have both metaphorical and literal senses. Annotators assign literal or metaphorical senses to all adjectives and nouns in the data. Then, we describe two methods for literal/metaphorical sense classification. The first method uses Bi-LSTM neural network architecture and word embeddings of both token- and character-level. We examine the influence of adversarial training and perform analysis by part-of-speech. The second method uses the BERT token-level classifier. On our relatively small data, the LSTM based approach gives significantly better results and achieves an F1 score equal to 0.81.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Al-Rfou, R., Perozzi, B., Skiena, S.: Polyglot: distributed word representations for multilingual NLP. In: Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pp. 183–192. Association for Computational Linguistics, Sofia (2013). http://www.aclweb.org/anthology/W13-3520
Beigman Klebanov, B., Leong, C.W., Gutierrez, E.D., Shutova, E., Flor, M.: Semantic classifications for detection of verb metaphors. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 101–106. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/P16-2017,http://aclweb.org/anthology/P16-2017
Beigman Klebanov, B., Shutova, E., Lichtenstein, P., Muresan, S., Wee, C. (eds.): Proceedings of the Workshop on Figurative Language Processing. Association for Computational Linguistics (2018). http://aclweb.org/anthology/W18-0900
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/n19-1423
Ezen-Can, A.: A comparison of LSTM and BERT for small corpus. arXiv preprint arXiv:2009.05451 (2020)
Gutiérrez, E.D., Shutova, E., Marghetis, T., Bergen, B.: Literal and metaphorical senses in compositional distributional semantic models. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), pp. 183–193. Association for Computational Linguistics, Berlin (2016). https://doi.org/10.18653/v1/P16-1018,https://aclanthology.org/P16-1018
Klebanov, B.B., et al. (eds.): Proceedings of the Second Workshop on Figurative Language Processing, Fig-Lang@ACL 2020, Online, 9 July 2020. Association for Computational Linguistics (2020). https://aclanthology.org/volumes/2020.figlang-1/
Kłeczek, D.: Polbert: attacking Polish NLP tasks with transformers. In: Ogrodniczuk, M., Łukasz, K. (eds.) Proceedings of the PolEval 2020 Workshop. Institute of Computer Science, Polish Academy of Sciences (2020)
Lakoff, G., Johnson, M.: Metaphors We Live by. University of Chicago Press (2008)
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260–270. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/N16-1030,http://aclweb.org/anthology/N16-1030
Marhula, J., Rosiński, M.: Co oferuje MIPVU jako metoda identyfikacji metafory? Polonica XXXVII, 37 (2017)
Marhula, J., Rosiński, M.: Chapter 9: Linguistic metaphor identification in Polish. In: Metaphor Identification in Multiple Languages: MIPVU Around the World. https://osf.io/phf9q/ (2018)
Mroczkowski, R., Rybak, P., Wróblewska, A., Gawlik, I.: HerBERT: efficiently pretrained transformer-based language model for Polish. In: Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing, pp. 1–10. Association for Computational Linguistics, Kiyv (2021). https://www.aclweb.org/anthology/2021.bsnlp-1.1
Mykowiecka, A., Wawer, A., Marciniak, M.: Detecting figurative word occurrences using recurrent neural networks. In: Proceedings of the Workshop on Figurative Language Processing, pp. 124–127. Association for Computational Linguistics, New Orleans (2018). https://doi.org/10.18653/v1/W18-0916
Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.): Narodowy Korpus Języka Polskiego. Wydawnictwo Naukowe PWN, Warszawa (2012)
Shutova, E.: Design and evaluation of metaphor processing systems. Comput. Linguist. 41(4), 579–623 (2015)
Steen, G.J., Dorst, A.G., Herrmann, J.B., Kaal, A., Krennmayr, T., Pasma, T.: A method for linguistic metaphor identification. From MIP to MIPVU. No. 14 in Converging Evidence in Language and Communication Research, John Benjamins (2010)
Stemle, E., Onysko, A.: Using language learner data for metaphor detection. In: Proceedings of the Workshop on Figurative Language Processing, pp. 133–138. Association for Computational Linguistics, New Orleans (2018). https://doi.org/10.18653/v1/W18-0918
Tsvetkov, Y., Boytsov, L., Gershman, A., Nyberg, E., Dyer, C.: Metaphor detection with cross-lingual model transfer. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 248–258. Association of Computational Linguistics (2014)
Waszczuk, J.: Harnessing the CRF complexity with domain-specific constraints. The case of morphosyntactic tagging of a highly inflected language. In: Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), pp. 2789–2804 (2012)
Wawer, A., Mykowiecka, A.: Detecting metaphorical phrases in the Polish language. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pp. 772–777. INCOMA Ltd., Varna (2017)
Wolf, T., et al.: Huggingface’s Transformers: State-of-the-Art Natural Language Processing (2020)
Wu, C., Wu, F., Chen, Y., Wu, S., Yuan, Z., Huang, Y.: Neural metaphor detecting with CNN-LSTM model. In: Proceedings of the Workshop on Figurative Language Processing (2018)
Yasunaga, M., Kasai, J., Radev, D.R.: Robust multilingual part-of-speech tagging via adversarial training. In: Proceedings of NAACL. Association for Computational Linguistics (2018)
Acknowledgments
This work was supported by the Polish National Science Centre project Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts (2014/15/B/ST6/05186).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Wawer, A., Marciniak, M., Mykowiecka, A. (2022). Neural Nets in Detecting Word Level Metaphors in Polish. In: Vetulani, Z., Paroubek, P., Kubis, M. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2019. Lecture Notes in Computer Science(), vol 13212. Springer, Cham. https://doi.org/10.1007/978-3-031-05328-3_18
Download citation
DOI: https://doi.org/10.1007/978-3-031-05328-3_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05327-6
Online ISBN: 978-3-031-05328-3
eBook Packages: Computer ScienceComputer Science (R0)