Skip to main content

Wordnet – a Basic Resource for Natural Language Processing: The Case of plWordNet

  • Conference paper
  • First Online:
Advances in Computational Collective Intelligence (ICCCI 2020)

Abstract

This paper presents a wide scope of wordnet applications on the example of applications of plWordNet – a wordnet of Polish. Wordnets are large lexical-semantic databases functioning as primary resources for language technology. They are machine-readable dictionaries. Thus, they are indispensible for tasks such as basic flow of text processing, text mining, word sense disambiguation, information extraction and retrieval. On a larger scale, wordnets are used in research, education and business. In this paper a few examples of specific plWordNet applications are described in detail.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    As early as 2004, as Morato et al. mentioned, ontologies and semantic web were one of the most dynamically developing areas of wordnet applications.

References

  1. CiteSeerX. citeseerx.ist.psu.edu. Accessed 14 Jan 2020

  2. CLARIN homepage. www.clarin.eu. Accessed 14 Jan 2020

  3. Engineering Research Database. www.elsevier.com/solutions/engineering-village/content/inspec. Accessed 14 Jan 2020

  4. Google Scholar. https://scholar.google.com/. Accessed 14 Jan 2020

  5. Institute of Electrical and Electronics Engineers. www.ieee.org. Accessed 14 Jan 2020

  6. Library and Information Science Abstracts. www.proquest.com/products-services/lisa-set-c.html. Accessed 14 Jan 2020

  7. Library of the university of carlos III of madrid. www.uc3m.es/Home. Accessed 14 Jan 2020

  8. WordNet. https://wordnet.princeton.edu/. Accessed 14 Jan 2020

  9. Bond, F., Foster, R.: Linking and extending an open multilingual wordnet. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 1, pp. 1352–1362 (2013)

    Google Scholar 

  10. Bond, F., Janz, A., Piasecki, M.: A comparison of sense-level sentiment scores. In: Proceedings of the 10th Global Wordnet Conference, pp. 363–372 (2019)

    Google Scholar 

  11. Czerski, D., Ciesielski, K., Dramiński, M., Kłopotek, M., Łoziński, P., Wierzchoń, S.: What NEKST?—semantic search engine for polish internet. In: De Tré, G., Grzegorzewski, P., Kacprzyk, J., Owsiński, J.W., Penczek, W., Zadrożny, S. (eds.) Challenging Problems and Solutions in Intelligent Systems. SCI, vol. 634, pp. 335–347. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30165-5_16

    Chapter  Google Scholar 

  12. Dębowski, Ł., Broda, B., Nitoń, B., Charzyńska, E.: Jasnopis-a program to compute readability of texts in polish based on psycholinguistic research. Nat. Lang. Process. Cogn. Sci. pp. 51–61 (2015)

    Google Scholar 

  13. Dziob, A., Piasecki, M.: Dynamic verbs in the wordnet of polish. Cogn. Stud. (18) (2018)

    Google Scholar 

  14. Dziob, A., Piasecki, M., Rudnicka, E.: plWordNet 4.1-a linguistically motivated, corpus-based bilingual resource. In: Proceedings of the 10th Global Wordnet Conference, pp. 353–362 (2019)

    Google Scholar 

  15. Eder, M., Piasecki, M., Walkowiak, T.: An open stylometric system based on multilevel text analysis. Cognitive Studies| Études cognitives (17) (2017)

    Google Scholar 

  16. Graliński, F., Jassem, K., Marcińczuk, M., Wawrzyniak, P.: Named entity recognition in machine anonymization. Recent Advances in Intelligent Information Systems, pp. 247–260 (2009)

    Google Scholar 

  17. Griesel, M., Bosch, S., Mojapelo, M.L.: Thinking globally, acting locally-progress in the african wordnet project. In: Proceedings of the 10th Global Wordnet Conference, pp. 191–196 (2019)

    Google Scholar 

  18. Hajnicz, E., Bartosiak, T.: Connections between the semantic layer of Walenty valency dictionary and plWordNet. In: Proceedings of the 10th Global Wordnet Conference, pp. 99–107 (2019)

    Google Scholar 

  19. Kędzia, P., Piasecki, M., Orlińska, M.: Word sense disambiguation based on large scale Polish CLARIN heterogeneous lexical resources. Cognitive Studies (15) (2015)

    Google Scholar 

  20. Kocoń, J., Janz, A., Piasecki, M.: Context-sensitive sentiment propagation in WordNet. In: Proceedings of the 9th Global Wordnet Conference, pp. 333–338 (2018)

    Google Scholar 

  21. Kocoń, J., Marcińczuk, M.: Generating of events dictionaries from polish wordNet for the recognition of events in polish documents. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2016. LNCS (LNAI), vol. 9924, pp. 12–19. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45510-5_2

    Chapter  Google Scholar 

  22. Kocoń, J., Marcińczuk, M.: Supervised approach to recognise Polish temporal expressions and rule-based interpretation of timexes. Nat. Lang. Eng. 23(3), 385–418 (2017)

    Article  Google Scholar 

  23. Maciołek, P., Dobrowolski, G.: Cluo: web-scale text mining system for open source intelligence purposes. Comput. Sci. 14(1), 45–62 (2013)

    Article  MathSciNet  Google Scholar 

  24. Marchewka, A., et al.: Recognition of emotions, valence and arousal in large-scale multi-domain text reviews pp. 274–280 (2019)

    Google Scholar 

  25. Marcińczuk, M., Oleksy, M., Wieczorek, J.: Preliminary study on automatic recognition of spatial expressions in Polish texts. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2016. LNCS (LNAI), vol. 9924, pp. 154–162. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45510-5_18

    Chapter  Google Scholar 

  26. Maryl, M., Piasecki, M., Walkowiak, T.: Literary exploration machine: a Web-based application for textual scholars. In: Selected papers from the CLARIN Annual Conference, (147) pp. 128–144 (2018)

    Google Scholar 

  27. Maziarz, M., Piasecki, M.: Towards mapping thesauri onto plWordNet. In: Proceedings of the 9th Global WordNet Conference (GWC 2018), pp. 45–53 (2018)

    Google Scholar 

  28. Maziarz, M., Piasecki, M., Rudnicka, E.: Słowosieć-polski wordnet. Proces tworzenia tezaurusa. Polonica 34, 79–98 (2014)

    Google Scholar 

  29. Maziarz, M., Szpakowicz, S., Piasecki, M.: A procedural definition of multi-word lexical units. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, pp. 427–435 (2015)

    Google Scholar 

  30. McCrae, J.P., Rademaker, A., Bond, F., Rudnicka, E., Fellbaum, C.: English wordnet 2019–an open-source wordnet for english. In: Proceedings of the 10th Global Wordnet Conference, pp. 245–252 (2019)

    Google Scholar 

  31. Miller, G.: WordNet: An Electronic Lexical Database. MIT Press (1998)

    Google Scholar 

  32. Morato, J., Marzal, M.A., Lloréns, J., Moreiro, J.: WordNet applications. In: Proceedings of 2nd Global Wordnet Conference, pp. 270–278 (2004)

    Google Scholar 

  33. Mykowiecka, A., Marciniak, M.: Combining wordnet and morphosyntactic information in terminology clustering. In: Proceedings of COLING 2012, pp. 1951–1962 (2012)

    Google Scholar 

  34. Naskręt, T.: A collaborative system for building and maintaining wordnets. In: Proceedings of the 10th Global Wordnet Conference, pp. 323–328 (2019)

    Google Scholar 

  35. Naskręt, T., Dziob, A., Piasecki, M., Saedi, C., Branco, A.: WordnetLoom-a multilingual wordnet editing system focused on graph-based presentation. In: Proceedings of the 9th Global Wordnet Conference, pp. 191–200 (2018)

    Google Scholar 

  36. Nowaczyk, A., Jackowska-Strumiłło, L.: Rozpoznawanie emocji w tekstach polskojęzycznych z wykorzystaniem metody słów kluczowych. Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska 7(2), 102–105 (2017)

    Article  Google Scholar 

  37. Ogrodniczuk, M., Bronk, Z., Kieras, W.: Multisłownik: linking plWordNet-based lexical data for lexicography and educational purposes. In: Proceedings of the 9th Global Wordnet Conference, pp. 368–375 (2018)

    Google Scholar 

  38. Pedersen, B.S., Nimb, S., Olsen, I.R., Olsen, S.: Merging danNet with princeton wordnet. In: Proceedings of the 10th Global Wordnet Conference, pp. 125–134 (2019)

    Google Scholar 

  39. Piasecki, M., Broda, B., Szpakowicz, S.: A Wordnet from the ground up. Oficyna Wydawnicza Politechniki Wrocławskiej Wrocław (2009)

    Google Scholar 

  40. Piasecki, M., Burdka, Ł., Maziarz, M., Kaliński, M.: Diagnostic tools in plWordNet development process. In: Vetulani, Z., Uszkoreit, H., Kubis, M. (eds.) LTC 2013. LNCS (LNAI), vol. 9561, pp. 255–273. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-43808-5_20

    Chapter  Google Scholar 

  41. Piasecki, M., Kaliński, M., Indyka-Piasecka, A.: Disambiguating wikipedia articles on the basis of plWordNet lexico-semantic relations. In: Castro, F., Gelbukh, A., González, M. (eds.) MICAI 2013. LNCS (LNAI), vol. 8265, pp. 228–239. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-45114-0_18

    Chapter  Google Scholar 

  42. Piasecki, M., Walkowiak, T., Rudnicka, E., Bond, F.: Lexical Platform-the first step towards user-centred integration of lexical resources. Cognitive Studies| Études cognitives (18) (2018)

    Google Scholar 

  43. Piasecki, M., Wendelberger, M., Maziarz, M.: Extraction of the multi-word lexical units in the perspective of the wordnet expansion. In: Proceedings of the International Conference RANLP, pp. 512–520 (2015)

    Google Scholar 

  44. Przybyła, P.: Boosting question answering by deep entity recognition. arXiv preprint arXiv:1605.08675 (2016)

  45. Rudnicka, E., Maziarz, M., Piasecki, M., Szpakowicz, S.: A strategy of mapping Polish Wordnet onto Princeton WordNet. In: Proceedings of COLING 2012: Posters, pp. 1039–1048 (2012)

    Google Scholar 

  46. Rudnicka, E., Piasecki, M., Bond, F., Grabowski, Ł., Piotrowski, T.: Sense equivalence in plWordNet to Princeton WordNet mapping. Int. J. Lexicography 1, 1–30 (2019)

    Google Scholar 

  47. Rudnicka, E.K., Witkowski, W., Kaliński, M.: Towards the methodology for extending princeton wordnet. Cogn. Stud.| Études cognitives (15), pp. 335–351 (2015)

    Google Scholar 

  48. Rutkowski, S., Rychlik, P., Mykowiecka, A.: Estimating senses with sets of lexically related words for Polish word sense disambiguation. In: Proceedings of the 10th Global Wordnet Conference, pp. 118–124 (2019)

    Google Scholar 

  49. Rybiński, K.: Political sentiment analysis of Polish politicians. e-politicon, 24, 162–195 (2017)

    Google Scholar 

  50. Twardowski, B., Gawrysiak, P.: Domain dependent product feature and opinion extraction based on e-commerce websites. In: Zgrzywa, A., Choroś, K., Siemiński, A. (eds.) Multimedia and Internet Systems: Theory and Practice, pp. 261–270. Springer, Berlin Heidelberg, Berlin (2013). https://doi.org/10.1007/978-3-642-32335-5_25

    Chapter  Google Scholar 

  51. Wróblewska, A.: Polish corpus of annotated descriptions of images. In: Proceedings of the 11th Int. Conference on Language Resources and Evaluation (2018)

    Google Scholar 

  52. Wróblewska, A., Protaziuk, G., Bembenik, R., Podsiadły-Marczykowska, T.: Associations between texts and ontology. In: Bembenik R., Skonieczny L., Rybinski H., Kryszkiewicz M., Niezgodka M. (eds.) Intelligent Tools for Building a Scientific Information Platform. Studies in Computational Intelligence, vol 467. Springer, Berlin, Heidelberg (2013). https://doi.org/10.1007/978-3-642-35647-6_20

  53. Zaśko-Zielińska, M., Piasecki, M., Szpakowicz, S.: A large wordnet-based sentiment lexicon for Polish. In: Proceedings of the International Conference RANLP, pp. 721–730 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Agnieszka Dziob .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dziob, A., Naskręt, T. (2020). Wordnet – a Basic Resource for Natural Language Processing: The Case of plWordNet. In: Hernes, M., Wojtkiewicz, K., Szczerbicki, E. (eds) Advances in Computational Collective Intelligence. ICCCI 2020. Communications in Computer and Information Science, vol 1287. Springer, Cham. https://doi.org/10.1007/978-3-030-63119-2_56

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-63119-2_56

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-63118-5

  • Online ISBN: 978-3-030-63119-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics