Abstract
The paper presents a tool for the creation of an electronic dictionary of multi-word proper names. Toposław uses graphs for the representation of inflectional and pragmatic variants of names. It cooperates with Morfeusz, a morphological analyser and generator for Polish words, and Multiflex, a cross-language morpho-syntactic generator of multi-word units. Our goal was to create a user-friendly tool that makes a lexicographic work easy and efficient. In the paper we describe facilities for graph creation, management and debugging. The presented tool was applied to create a dictionary of Warsaw urban proper names.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Marciniak, M., Rabiega-Wiśniewska, J., Savary, A., Woliński, M., Heliasz, C.: Constructing an Electronic Dictionary of Polish Urban Proper Names. In: Recent Advances in Intelligent Information Systems, Warszawa, Exit, pp. 233–246 (2009)
Woliński, M.: Morfeusz — a Practical Tool for the Morphological Analysis of Polish. In: Proc. of IIS: IIPWM 2006, pp. 503–512. Springer, Heidelberg (2006)
Savary, A.: A formalism for the computational morphology of multi-word units. Archives of Control Sciences 15(3), 437–449 (2005)
Paumier, S.: Unitex 2.1. User manual (2003), http://www-igm.univ-mlv.fr/~unitex
Sikora, P., Woliński, M.: Toposław — a Dictionary Creation Tool. In: Recent Advances in Intelligent Information Systems, Warszawa, Exit, pp. 743–749 (2009)
Piskorski, J., Wieloch, K., Sydow, M.: On knowledge-poor methods for person name matching and lemmatization for highly inflectional languages. Information Retrieval 12(3), 275–299 (2009)
Abramowicz, W., Filipowska, A., Piskorski, J., Węcel, K., Wieloch, K.: Linguistic Suite for Polish Cadastral System. In: Proceedings of LREC 2006, Genoa, Italy, pp. 2518–2523 (2006)
Krstev, C., Stanković, R., Vitas, D., Obradović, I.: Workstation for Lexical Resources — WS4LR. In: Proc. of LREC 2006, pp. 1692–1697 (2006)
Krstev, C., Stanković, R., Obradović, I., Vitaš, D., Utvic, M.: Automatic construction of a morphological dictionary of multi-word units. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) IceTAL 2010. LNCS(LNAI), vol. 6233, pp. 226–237. Springer, Heidelberg (2010)
Savary, A., Rabiega-Wiśniewska, J., Woliński, M.: Inflection of Polish Multi-Word Proper Names with Morfeusz and Multiflex. In: Marciniak, M., Mykowiecka, A. (eds.) Bolc Festschrift. LNCS(LNAI), vol. 5070, pp. 111–142. Springer, Heidelberg (2009)
Saloni, Z., Gruszczyński, W., Woliłski, M., Wołosz, R.: Słownik gramatyczny języka polskiego. Wiedza Powszechna, Warszawa (2007)
Krstev, C., Vitas, D.: An Effective Method for Developing a Comprehensive Morphological E-dictionary of Compounds. In: Proceedings of Lexis and Grammar Conference, Bergen, pp. 204–212 (2009)
Graliński, F., Savary, A., Czerepowicka, M., Makowiecki, F.: Computational Lexicography of Multi-Word Units: How Efficient Can It Be? In: Proceedings of the COLING-MWE 2010 Workshop, Beijing, China, pp. 1–9 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Marciniak, M., Savary, A., Sikora, P., Woliński, M. (2011). Toposław – A Lexicographic Framework for Multi-word Units. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-20095-3_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20094-6
Online ISBN: 978-3-642-20095-3
eBook Packages: Computer ScienceComputer Science (R0)