Skip to main content

Inflection of Polish Multi-Word Proper Names with Morfeusz and Multiflex

  • Chapter
Aspects of Natural Language Processing

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5070))

Abstract

We discuss morphological properties of Polish multi-word proper names. We present a cooperating framework of two morphological tools: Morfeusz, a morphological analyser and generator for Polish simple words, and Multiflex, a cross-language morpho-syntactic generator of multi-word units. We discuss interface constraints required for the interoperability of these tools, and we show how the resulting platform allows one to describe the morpho-syntactic behaviour of some interesting examples of Warsaw multi-word toponyms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agafonov, C., Grass, T., Maurel, D., Rossi-Gensane, N., Savary, A.: La traduction multilingue des noms propres dans PROLEX. Mεta 51(4), 622–636 (2006)

    Google Scholar 

  2. Alegria, I., Ansa, O., Artola, X., Ezeiza, N., Gojenola, K., Urizar, R.: Representation and Treatment of Multiword Expressions in Basque. In: Second ACL Workshop on Multiword Expressions, July 2004, pp. 48–55 (2004)

    Google Scholar 

  3. Bień, J.S.: Koncepcja słownikowej informacji morfologicznej i jej komputerowej weryfikacji. Rozprawy Uniwersytetu Warszawskiego t. 383. Wydawnictwa Uniwersytetu Warszawskiego, Warszawa (1991)

    Google Scholar 

  4. Bień, J.S., Saloni, Z.: Pojęcie wyrazu morfologicznego i jego zastosowanie do opisu fleksji polskiej (wersja wstępna). Prace Filologiczne XXXI, 31–45 (1982)

    Google Scholar 

  5. Chena, H.H., Huang, S.J., Ding, Y.W., Tsai, S.C.: Proper Name Translation in Cross-Language Information Retrieval. In: Proceedings of COLING-ACL-1998, pp. 232–236 (1998)

    Google Scholar 

  6. Cieślikowa, A. (ed.): Mały słownik odmiany nazw własnych. Rytm, Warszawa (2008)

    Google Scholar 

  7. Courtois, B., Silberztein, M. (eds.): Les dictionnaires électroniques du français. Larousse, Langue française, vol. 87 (1990)

    Google Scholar 

  8. Friburger, N., Maurel, D.: Textual similarity based on proper names. In: Proceedings of the Workshop on Mathematical Formal Information Retrieval (MFIR’2002), SIGIR’2002, Tampere, Finland, pp. 155–167 (2002)

    Google Scholar 

  9. Grishman, R., Sundheim, B.: Message Understanding Conference - 6: A Brief History. In: Proceedings of COLING-96, pp. 466–471 (1996)

    Google Scholar 

  10. Grzenia, J.: Słownik nazw własnych. Wydawnictwo naukowe PWN (2003)

    Google Scholar 

  11. Hajnicz, E., Kupść, A.: Przegląd analizatorów morfologicznych dla języka polskiego. Wydawnictwo IPI PAN, Warszawa (2001)

    Google Scholar 

  12. Handke, K.: Słownik nazewnictwa Warszawy. Slawistyczny Ośrodek Wydawniczy, Warszawa (1998)

    Google Scholar 

  13. Hatzivassiloglou, V., Klavans, J.L., Eskin, E.: Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Combinations via Machine Learning. In: Proceedings of the 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pp. 203–212 (1999)

    Google Scholar 

  14. Jacquemin, C.: Spotting and Discovering Terms through Natural Language Processing. MIT Press, Cambridge (2001)

    Google Scholar 

  15. Karttunen, L., Kaplan, R.M., Zaenen, A.: Two-Level Morphology with Composition. In: Proceedings of COLING-92, Nantes, pp. 141–148 (1992)

    Google Scholar 

  16. Krstev, C.: Processing of Serbian: Automata, Texts and Electronic Dictionaries. Faculty of Philology, University of Belgrade (2009)

    Google Scholar 

  17. Krstev, C., Vitas, D., Savary, A.: Prerequisites for a Comprehensive Dictionary of Serbian Compounds. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds.) FinTAL 2006. LNCS (LNAI), vol. 4139, pp. 552–563. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  18. Maurel, D.: Prolexbase. A multilungual relational lexical database of proper names. In: Proceedings of LREC’08, Marrakech, Marocco (2008)

    Google Scholar 

  19. Maurel, D., Vitas, D., Krstev, C., Koeva, S.: Prolex: a lexical model for translation of proper names. Application to French, Serbian and Bulgarian. BULAG 32, 55–72 (2007)

    Google Scholar 

  20. Mykowiecka, A., Marciniak, M., Rabiega-Wiśniewska, J.: Proper Names in Polish Dialogs. In: Proceedings of the IIS 2008 Workshop on Spoken Language Understanding and Dialogue Systems, Zakopane, Poland, Springer, Heidelberg (2008)

    Google Scholar 

  21. Oflazer, K., Çetonoğlu, Ö., Say, B.: Integrating Morphology with Multi-word Expression Processing in Turkish. In: Second ACL Workshop on Multiword Expressions, July 2004, pp. 64–71 (2004)

    Google Scholar 

  22. Pašca, M., Lin, D., Bigham, J., Lifchits, A., Jain, A.: Names and Similarities on the Web: Fact Extraction in the Fast Lane. In: Proceedings of COLING-ACL-2006, Sydney, Australia, pp. 809–816 (2006)

    Google Scholar 

  23. Paumier, S.: Manuel d’utilisation du logiciel Unitex (2002), http://www-igm.univ-mlv.fr/unitex/manuelunitex.ps

  24. Piskorski, J., Sydow, M.: Usability of String Distance Metrics for Name Matching Tasks in Polish. In: Human Language Technologies as a Challenge for Computer Science and Linguistics. Proceedings of 3rd Language & Technology Conference, Poland, Poznań, October 5-7 (2007)

    Google Scholar 

  25. Piskorski, J., Sydow, M., Kupść, A.: Lemmatization of Polish Person Names. In: ACL 2007. Proceedings of the Workshop on Balto-Slavonic Natural Language Processing 2007, Special Theme: Information Extraction and Enabling Technologies (2007)

    Google Scholar 

  26. Przepiórkowski, A., Woliński, M.: The Unbearable Lightness of Tagging: A Case Study in Morphosyntactic Tagging of Polish. In: Proceedings of the 4th International Workshop on Linguistically Interpreted Corpora, EACL (2003)

    Google Scholar 

  27. Rabiega-Wiśniewska, J.: Polish proper names in the corpus od spoken dialogues. the syntactic experiment. In: Beiträge der Europäischen Slavistischen Linguistik (Polyslav), vol. 12 (to appear)

    Google Scholar 

  28. Rzetelska-Feleszko, E. (ed.): Polskie nazwy własne. Instytut Języka Polskiego Polskiej Akademii Nauk, Kraków (2005)

    Google Scholar 

  29. Saloni, Z.: Klasyfikacja gramatyczna leksemów polskich. Język Polski LIV LIV, z. 1, 3–13, z. 2, 93–101 (1974)

    Google Scholar 

  30. Saloni, Z.: Kategoria rodzaju we współczesnym języku polskim. In: Kategorie gramatyczne grup imiennych we współczesnym języku polskim, pp. 41–75. Ossolineum, Wrocław (1976)

    Google Scholar 

  31. Saloni, Z.: Kategorie gramatyczne liczebników we współczesnym języku polskim. Studia gramatyczne 1, 145–173 (1977)

    Google Scholar 

  32. Saloni, Z.: Czasownik polski. Odmiana, słownik. Wiedza Powszechna, Warszawa (2007)

    Google Scholar 

  33. Saloni, Z., Gruszczyński, W., Woliński, M., Wołosz, R.: Słownik gramatyczny języka polskiego. Wiedza Powszechna, Warszawa (2007)

    Google Scholar 

  34. Saloni, Z., Świdziński, M.: Składnia współczesnego języka polskiego. PWN, Warszawa (1998)

    Google Scholar 

  35. Sang, E.F.T.K., de Meulder, F.: Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition. In: Proceedings of CoNLL-2003, Edmonton, Canada, pp. 142–147 (2003)

    Google Scholar 

  36. Savary, A.: A formalism for the computational morphology of multi-word units. Archives of Control Sciences 15(3), 437–449 (2005)

    MATH  Google Scholar 

  37. Savary, A.: MULTIFLEX. User’s Manual and Technical Documentation. Version 1.0. Technical Report 285, LI-François Rabelais University of Tours, France (2005)

    Google Scholar 

  38. Savary, A.: Computational Inflection of Multi-Word Units. A contrastive study of lexical approaches. Linguistic Issues in Language Technology 1(2), 1–53 (2008)

    Google Scholar 

  39. Savary, A., Jacquemin, C.: Reducing Information Variation in Text. In: Renals, S., Grefenstette, G. (eds.) Text- and Speech-Triggered Information Access. LNCS (LNAI), vol. 2705, pp. 145–181. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  40. Savary, A., Krstev, C., Vitas, D.: Inflectional Non Compositionality and Variation of Compounds in French, Polish and Serbian, and Their Automatic Processing. BULAG 32, 73–93 (2007)

    Google Scholar 

  41. Silberztein, M.: NooJ’s dictionaries. In: Proceedings of LTC’05, Poznań, Wydawnictwo Poznańskie, pp. 291–295 (2005)

    Google Scholar 

  42. Świdziński, M., Derwojedowa, M., Rudolf, M.: A Computational Account of Multi-Word Numeral Phrases in Polish. In: Kosta, P., Błaszczak, J., Frasek, J., Geist, L., Żygis, M. (eds.) Investigations into Formal Slavic Linguistics, vol. 10, Part I, pp. 405–415. Peter Lang, Frankfurt (2003)

    Google Scholar 

  43. Thompson, P., Dozier, C.: Name Searching and Information Retrieval. In: Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, Providence, Rhode Island (1997)

    Google Scholar 

  44. Tokarski, J.: Schematyczny indeks a tergo polskich form wyrazowych (ed. Zygmunt Saloni), 2nd edn. Wydawnictwo Naukowe PWN, Warszawa (2002)

    Google Scholar 

  45. Tran, M., Maurel, D.: Prolexbase: Un dictionnaire relationnel multilingue de noms propres. Traitement Automatiques des Langues 47(3), 115–139 (2006)

    Google Scholar 

  46. Woliński, M.: System znaczników morfosyntaktycznych w korpusie IPI PAN. Polonica XXII–XXIII, 39–55 (2003)

    Google Scholar 

  47. Woliński, M.: Morfeusz — a Practical Tool for the Morphological Analysis of Polish. In: Kłopotek, M., Wierzchoń, S., Trojanowski, K. (eds.) Intelligent Information Processing and Web Mining, IIS:IIPWM’06 Proceedings, pp. 503–512. Springer, Heidelberg (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Savary, A., Rabiega-Wiśniewska, J., Woliński, M. (2009). Inflection of Polish Multi-Word Proper Names with Morfeusz and Multiflex. In: Marciniak, M., Mykowiecka, A. (eds) Aspects of Natural Language Processing. Lecture Notes in Computer Science, vol 5070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04735-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04735-0_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04734-3

  • Online ISBN: 978-3-642-04735-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics