Abstract
In the paper, we present a software tool Affisix for automatic recognition of prefixes. On the basis of an extensive list of words in a language, it determines the segments – candidates for prefixes. There are two methods implemented for the recognition – the entropy method and the squares method. We briefly describe the methods, propose their improvements and present the results of experiments with Czech.
This paper is a result of the projects supported by the grants of the Czech Academy of Sciences 1ET101120503 and 1ET101120413.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hlaváčová, J.: Morphological Guesser of Czech Words. In: Matoušek, V. (ed.) Proc. TSD 2001, pp. 70–75. Springer, Berlin (2001)
Ústav Českého národního korpusu FF UK: Český národní korpus – Syn2000 (2000), http://ucnk.ff.cuni.cz
Urrea, A.M.: Automatic discovery of affixes by means of a corpus: A catalog of spanish affixes. Journal of Quantitative Linguistics 7, 97–114 (2000)
Urrea, A.M., Hlaváčová, J.: Automatic Recognition of Czech Derivational Prefixes. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 189–197. Springer, Heidelberg (2005)
Hrušecký, M.: Affisix, http://affisix.sf.net
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hlaváčová, J., Hrušecký, M. (2008). Affisix: Tool for Prefix Recognition. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2008. Lecture Notes in Computer Science(), vol 5246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87391-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-87391-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87390-7
Online ISBN: 978-3-540-87391-4
eBook Packages: Computer ScienceComputer Science (R0)