Abstract
This paper deals with the automatic discrimination of contexts of Czech ambiguous words. The Schütze’s methodology was used, modified and transformed for the Czech language. This algorithm is based on vector space and clustering. The semantic discrimination could be understood as a subtask of word sense disambiguation. In this approach, the sense of word is defined as the cluster of contexts of ambiguous word. We show that Schütze’s method is transportable into Czech. Our results are not as good as his because we have experimented with a highly ambiguous word.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brown, P. F. et al.:Word Sense Disambiguation using statistical methods, In Proc. of the 29th Annual Meeting, Berkeley, pp. 264–270, 1991.
Čermák F.: Czech National Corpus: Its Character, Goal and Background, In: Proc. of Workshop on TSD 1999, Springer, Pilsen, 1999.
Ide, N., Véronis, J.: Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art, Computational Linguistics, Vol. 24, Num. 1, 1998.
Manning Ch.D., Schütze H.: Foundations of Statistical Natural Language Processing, The MIT Press, Cambridge, Massachusetts, 1999.
Sedláček R., Smrž P.: A New Czech Morphological Analyser ajka, In: Proceedings of the 4th Workshop on Text, Speech and Dialogue — TSD 2001, Berlin, 2001.
Schütze H.: Automatic Word Sense Discrimination, [3], p. 97–123.
Sussna M.:Word Sense Disambiguation for Free-text Indexing Using a Massive Semantic Network, Proc. of the 2nd International Conference on Information and Knowledge Management, Arlington, 1993.
Vossen, P., et al.: Set of Common Base Concepts in EuroWordNet-2, Final Report, 2D001, Amsterdam, October 1988.
Wilks Y., Stevenson M.: Sense Tagging: Semantic Tagging with a Lexicon, Proceedings of the SIGLEX Workshop on Tagging Text with Lexical Semantics: Why, What and How?, Washington, D.C., 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Král, R. (2002). Word Sense Discrimination for Czech. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_21
Download citation
DOI: https://doi.org/10.1007/3-540-46154-X_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive