Abstract
The paper deals with automatic methods for prefix extraction and their comparison. We present experiments with Czech and English and compare the results with regard to the size and type (wordforms vs. lemmas) of input data.
This paper is a result of the projects supported by the grants number P406/2010/0875 and P202/10/1333 of the Grant Agency of the Czech republic (GAČR), and the grant MSM 0021620838 of the Czech Ministry of Education.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Urrea, A.M.: Automatic discovery of affixes by means of a corpus: A catalog of spanish affixes. Journal of Quantitative Linguistics 7, 97–114 (2000)
Hrušecký, M.: Affisix, http://affisix.sf.net
Hrušecký, M., Hlaváčová, J.: Automatické rozpoznávání předpon a přípon s pomocí nástroje affisix. In: Pardubská, D. (ed.) Informačné technológie Aplikácie a Teória, Zborník príspevkov prezentovaných na konferencii ITAT, Seňa, Slovakia, PONT s. r. o, pp. 63–67 (2010)
Bojar, O., Straňák, P., Zeman, D., Jain, G., Hrušecký, M., Richter, M., Hajič, J.: English-hindi translation obtaining mediocre results with bad data and fancy models. In: Sharma, D., Varma, V., Sangal, R. (eds.) Proceedings of ICON 2009: 7th International Conference on Natural Language Processing, Hyderabad, India, NLP Association of India, pp. 316–321. Macmillan Publishers, India (2009)
Hlaváčová, J., Hrušecký, M.: “affisix” tool for prefix recognition. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2008. LNCS (LNAI), vol. 5246, pp. 85–92. Springer, Heidelberg (2008)
Ústav Českého národního korpusu FF UK: Český národní korpus - syn2000, syn2005, syn2010 (2000), http://ucnk.ff.cuni.cz
Oxford University Computing Services on behalf of the BNC Consortium: The british national corpus (2007), http://www.natcorp.ox.ac.uk
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hlaváčová, J., Hrušecký, M. (2011). Prefix Recognition Experiments. In: Habernal, I., Matoušek, V. (eds) Text, Speech and Dialogue. TSD 2011. Lecture Notes in Computer Science(), vol 6836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23538-2_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-23538-2_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23537-5
Online ISBN: 978-3-642-23538-2
eBook Packages: Computer ScienceComputer Science (R0)