Skip to main content

Neural Nets in Detecting Word Level Metaphors in Polish

  • Conference paper
  • First Online:
Human Language Technology. Challenges for Computer Science and Linguistics (LTC 2019)

Abstract

The paper addresses an experiment in detecting metaphorical usage of adjectives and nouns in Polish data. First, we describe the data developed for the experiment. The corpus consists of 1833 excerpts containing adjective-noun phrases which can have both metaphorical and literal senses. Annotators assign literal or metaphorical senses to all adjectives and nouns in the data. Then, we describe two methods for literal/metaphorical sense classification. The first method uses Bi-LSTM neural network architecture and word embeddings of both token- and character-level. We examine the influence of adversarial training and perform analysis by part-of-speech. The second method uses the BERT token-level classifier. On our relatively small data, the LSTM based approach gives significantly better results and achieves an F1 score equal to 0.81.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/google-research/bert.

References

  1. Al-Rfou, R., Perozzi, B., Skiena, S.: Polyglot: distributed word representations for multilingual NLP. In: Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pp. 183–192. Association for Computational Linguistics, Sofia (2013). http://www.aclweb.org/anthology/W13-3520

  2. Beigman Klebanov, B., Leong, C.W., Gutierrez, E.D., Shutova, E., Flor, M.: Semantic classifications for detection of verb metaphors. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 101–106. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/P16-2017,http://aclweb.org/anthology/P16-2017

  3. Beigman Klebanov, B., Shutova, E., Lichtenstein, P., Muresan, S., Wee, C. (eds.): Proceedings of the Workshop on Figurative Language Processing. Association for Computational Linguistics (2018). http://aclweb.org/anthology/W18-0900

  4. Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/n19-1423

  5. Ezen-Can, A.: A comparison of LSTM and BERT for small corpus. arXiv preprint arXiv:2009.05451 (2020)

  6. Gutiérrez, E.D., Shutova, E., Marghetis, T., Bergen, B.: Literal and metaphorical senses in compositional distributional semantic models. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), pp. 183–193. Association for Computational Linguistics, Berlin (2016). https://doi.org/10.18653/v1/P16-1018,https://aclanthology.org/P16-1018

  7. Klebanov, B.B., et al. (eds.): Proceedings of the Second Workshop on Figurative Language Processing, Fig-Lang@ACL 2020, Online, 9 July 2020. Association for Computational Linguistics (2020). https://aclanthology.org/volumes/2020.figlang-1/

  8. Kłeczek, D.: Polbert: attacking Polish NLP tasks with transformers. In: Ogrodniczuk, M., Łukasz, K. (eds.) Proceedings of the PolEval 2020 Workshop. Institute of Computer Science, Polish Academy of Sciences (2020)

    Google Scholar 

  9. Lakoff, G., Johnson, M.: Metaphors We Live by. University of Chicago Press (2008)

    Google Scholar 

  10. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260–270. Association for Computational Linguistics (2016). https://doi.org/10.18653/v1/N16-1030,http://aclweb.org/anthology/N16-1030

  11. Marhula, J., Rosiński, M.: Co oferuje MIPVU jako metoda identyfikacji metafory? Polonica XXXVII, 37 (2017)

    Google Scholar 

  12. Marhula, J., Rosiński, M.: Chapter 9: Linguistic metaphor identification in Polish. In: Metaphor Identification in Multiple Languages: MIPVU Around the World. https://osf.io/phf9q/ (2018)

  13. Mroczkowski, R., Rybak, P., Wróblewska, A., Gawlik, I.: HerBERT: efficiently pretrained transformer-based language model for Polish. In: Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing, pp. 1–10. Association for Computational Linguistics, Kiyv (2021). https://www.aclweb.org/anthology/2021.bsnlp-1.1

  14. Mykowiecka, A., Wawer, A., Marciniak, M.: Detecting figurative word occurrences using recurrent neural networks. In: Proceedings of the Workshop on Figurative Language Processing, pp. 124–127. Association for Computational Linguistics, New Orleans (2018). https://doi.org/10.18653/v1/W18-0916

  15. Przepiórkowski, A., Bańko, M., Górski, R.L., Lewandowska-Tomaszczyk, B. (eds.): Narodowy Korpus Języka Polskiego. Wydawnictwo Naukowe PWN, Warszawa (2012)

    Google Scholar 

  16. Shutova, E.: Design and evaluation of metaphor processing systems. Comput. Linguist. 41(4), 579–623 (2015)

    Article  MathSciNet  Google Scholar 

  17. Steen, G.J., Dorst, A.G., Herrmann, J.B., Kaal, A., Krennmayr, T., Pasma, T.: A method for linguistic metaphor identification. From MIP to MIPVU. No. 14 in Converging Evidence in Language and Communication Research, John Benjamins (2010)

    Google Scholar 

  18. Stemle, E., Onysko, A.: Using language learner data for metaphor detection. In: Proceedings of the Workshop on Figurative Language Processing, pp. 133–138. Association for Computational Linguistics, New Orleans (2018). https://doi.org/10.18653/v1/W18-0918

  19. Tsvetkov, Y., Boytsov, L., Gershman, A., Nyberg, E., Dyer, C.: Metaphor detection with cross-lingual model transfer. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, pp. 248–258. Association of Computational Linguistics (2014)

    Google Scholar 

  20. Waszczuk, J.: Harnessing the CRF complexity with domain-specific constraints. The case of morphosyntactic tagging of a highly inflected language. In: Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), pp. 2789–2804 (2012)

    Google Scholar 

  21. Wawer, A., Mykowiecka, A.: Detecting metaphorical phrases in the Polish language. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pp. 772–777. INCOMA Ltd., Varna (2017)

    Google Scholar 

  22. Wolf, T., et al.: Huggingface’s Transformers: State-of-the-Art Natural Language Processing (2020)

    Google Scholar 

  23. Wu, C., Wu, F., Chen, Y., Wu, S., Yuan, Z., Huang, Y.: Neural metaphor detecting with CNN-LSTM model. In: Proceedings of the Workshop on Figurative Language Processing (2018)

    Google Scholar 

  24. Yasunaga, M., Kasai, J., Radev, D.R.: Robust multilingual part-of-speech tagging via adversarial training. In: Proceedings of NAACL. Association for Computational Linguistics (2018)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the Polish National Science Centre project Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts (2014/15/B/ST6/05186).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Małgorzata Marciniak .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wawer, A., Marciniak, M., Mykowiecka, A. (2022). Neural Nets in Detecting Word Level Metaphors in Polish. In: Vetulani, Z., Paroubek, P., Kubis, M. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2019. Lecture Notes in Computer Science(), vol 13212. Springer, Cham. https://doi.org/10.1007/978-3-031-05328-3_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-05328-3_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-05327-6

  • Online ISBN: 978-3-031-05328-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics