Neural Nets in Detecting Word Level Metaphors in Polish

Wawer, Aleksander; Marciniak, Małgorzata; Mykowiecka, Agnieszka

doi:10.1007/978-3-031-05328-3_18

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13212))

Included in the following conference series:

Language and Technology Conference

333 Accesses

Abstract

The paper addresses an experiment in detecting metaphorical usage of adjectives and nouns in Polish data. First, we describe the data developed for the experiment. The corpus consists of 1833 excerpts containing adjective-noun phrases which can have both metaphorical and literal senses. Annotators assign literal or metaphorical senses to all adjectives and nouns in the data. Then, we describe two methods for literal/metaphorical sense classification. The first method uses Bi-LSTM neural network architecture and word embeddings of both token- and character-level. We examine the influence of adversarial training and perform analysis by part-of-speech. The second method uses the BERT token-level classifier. On our relatively small data, the LSTM based approach gives significantly better results and achieves an F1 score equal to 0.81.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/google-research/bert.

References

Al-Rfou, R., Perozzi, B., Skiena, S.: Polyglot: distributed word representations for multilingual NLP. In: Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pp. 183–192. Association for Computational Linguistics, Sofia (2013). http://www.aclweb.org/anthology/W13-3520
Beigman Klebanov, B., Leong, C.W., Gutierrez, E.D., Shutova, E., Flor, M.: Semantic classifications for detection of verb metaphors. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 101–106. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/P16-2017,http://aclweb.org/anthology/P16-2017
Beigman Klebanov, B., Shutova, E., Lichtenstein, P., Muresan, S., Wee, C. (eds.): Proceedings of the Workshop on Figurative Language Processing. Association for Computational Linguistics (2018). http://aclweb.org/anthology/W18-0900
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/n19-1423
Ezen-Can, A.: A comparison of LSTM and BERT for small corpus. arXiv preprint arXiv:2009.05451 (2020)
Gutiérrez, E.D., Shutova, E., Marghetis, T., Bergen, B.: Literal and metaphorical senses in compositional distributional semantic models. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), pp. 183–193. Association for Computational Linguistics, Berlin (2016). https://doi.org/10.18653/v1/P16-1018,https://aclanthology.org/P16-1018
Klebanov, B.B., et al. (eds.): Proceedings of the Second Workshop on Figurative Language Processing, Fig-Lang@ACL 2020, Online, 9 July 2020. Association for Computational Linguistics (2020). https://aclanthology.org/volumes/2020.figlang-1/
Kłeczek, D.: Polbert: attacking Polish NLP tasks with transformers. In: Ogrodniczuk, M., Łukasz, K. (eds.) Proceedings of the PolEval 2020 Workshop. Institute of Computer Science, Polish Academy of Sciences (2020)
Google Scholar
Lakoff, G., Johnson, M.: Metaphors We Live by. University of Chicago Press (2008)
Google Scholar
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260–270. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/N16-1030,http://aclweb.org/anthology/N16-1030
Marhula, J., Rosiński, M.: Co oferuje MIPVU jako metoda identyfikacji metafory? Polonica XXXVII, 37 (2017)
Google Scholar
Marhula, J., Rosiński, M.: Chapter 9: Linguistic metaphor identification in Polish. In: Metaphor Identification in Multiple Languages: MIPVU Around the World. https://osf.io/phf9q/ (2018)
Mroczkowski, R., Rybak, P., Wróblewska, A., Gawlik, I.: HerBERT: efficiently pretrained transformer-based language model for Polish. In: Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing, pp. 1–10. Association for Computational Linguistics, Kiyv (2021). https://www.aclweb.org/anthology/2021.bsnlp-1.1
Mykowiecka, A., Wawer, A., Marciniak, M.: Detecting figurative word occurrences using recurrent neural networks. In: Proceedings of the Workshop on Figurative Language Processing, pp. 124–127. Association for Computational Linguistics, New Orleans (2018). https://doi.org/10.18653/v1/W18-0916
Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.): Narodowy Korpus Języka Polskiego. Wydawnictwo Naukowe PWN, Warszawa (2012)
Google Scholar
Shutova, E.: Design and evaluation of metaphor processing systems. Comput. Linguist. 41(4), 579–623 (2015)
Article MathSciNet Google Scholar
Steen, G.J., Dorst, A.G., Herrmann, J.B., Kaal, A., Krennmayr, T., Pasma, T.: A method for linguistic metaphor identification. From MIP to MIPVU. No. 14 in Converging Evidence in Language and Communication Research, John Benjamins (2010)
Google Scholar
Stemle, E., Onysko, A.: Using language learner data for metaphor detection. In: Proceedings of the Workshop on Figurative Language Processing, pp. 133–138. Association for Computational Linguistics, New Orleans (2018). https://doi.org/10.18653/v1/W18-0918
Tsvetkov, Y., Boytsov, L., Gershman, A., Nyberg, E., Dyer, C.: Metaphor detection with cross-lingual model transfer. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 248–258. Association of Computational Linguistics (2014)
Google Scholar
Waszczuk, J.: Harnessing the CRF complexity with domain-specific constraints. The case of morphosyntactic tagging of a highly inflected language. In: Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), pp. 2789–2804 (2012)
Google Scholar
Wawer, A., Mykowiecka, A.: Detecting metaphorical phrases in the Polish language. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pp. 772–777. INCOMA Ltd., Varna (2017)
Google Scholar
Wolf, T., et al.: Huggingface’s Transformers: State-of-the-Art Natural Language Processing (2020)
Google Scholar
Wu, C., Wu, F., Chen, Y., Wu, S., Yuan, Z., Huang, Y.: Neural metaphor detecting with CNN-LSTM model. In: Proceedings of the Workshop on Figurative Language Processing (2018)
Google Scholar
Yasunaga, M., Kasai, J., Radev, D.R.: Robust multilingual part-of-speech tagging via adversarial training. In: Proceedings of NAACL. Association for Computational Linguistics (2018)
Google Scholar

Download references

Acknowledgments

This work was supported by the Polish National Science Centre project Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts (2014/15/B/ST6/05186).

Author information

Authors and Affiliations

Institute of Computer Science PAS, Jana Kazimierza 5, 01-248, Warszawa, Poland
Aleksander Wawer, Małgorzata Marciniak & Agnieszka Mykowiecka

Authors

Aleksander Wawer
View author publications
You can also search for this author in PubMed Google Scholar
Małgorzata Marciniak
View author publications
You can also search for this author in PubMed Google Scholar
Agnieszka Mykowiecka
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Małgorzata Marciniak .

Editor information

Editors and Affiliations

Adam Mickiewicz University, Poznań, Poland
Zygmunt Vetulani
LIMSI-CNRS, Orsay, France
Patrick Paroubek
Adam Mickiewicz University, Poznań, Poland
Marek Kubis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wawer, A., Marciniak, M., Mykowiecka, A. (2022). Neural Nets in Detecting Word Level Metaphors in Polish. In: Vetulani, Z., Paroubek, P., Kubis, M. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2019. Lecture Notes in Computer Science(), vol 13212. Springer, Cham. https://doi.org/10.1007/978-3-031-05328-3_18

Download citation

DOI: https://doi.org/10.1007/978-3-031-05328-3_18
Published: 05 June 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05327-6
Online ISBN: 978-3-031-05328-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Neural Nets in Detecting Word Level Metaphors in Polish