Abstract
State-of-the-art rule-based tools for morphological disambiguation use either manually crafted rules or rules learnt from manually annotated data. This paper presents a new method of learning rules for morphological disambiguation using only unannotated data. The inductive logic programming and active learning are employed. The induced rules display very promising acurracy. Also the probable limitations of the proposed method are discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hajič, J., Krbec, P., Květoň, P., Oliva, K., Petkevič, V.: Serial Combination of Rules and Statistics: A Case Study in Czech Tagging. In: Proceedings of ACL/EACL, New Brunswick, Association for Computational Linguistics (2001)
Kohn, D., Atlas, L., Ladner, R.: Improving Generalization with Active Learning. Machine Learning 15, 201–221 (1994)
Stephen, H.: Muggleton and Luc De Raedt. Inductive Logic Programming: Theory and Methods. Journal of Logic Programming 19,20, 629–679 (1994)
Nepil, M., Popelínský, L.: Part-of-speech Tagging by Means of ILP and Active Learning. In: Proceedings of the Workshop on Instance Selection at ECML/PKDD 2001, Freiburg, Department of Computer Science, Albert-Ludwigs University (2001)
Nepil, M., Popelínský, L., Žáčková, E.: Part-of-Speech Tagging by Means of Shallow Parsing, ILP and Active Learning. In: Proceedings of the Third Workshop on Learning Language in Logic, Strasbourg (2001)
Nepil, M.: Relational Rule Induction for Natural Language Disambiguation. Ph.D. thesis, Faculty of Informatics, Masaryk University, Brno (2003)
Pala, K., Rychlý, P., Smrž, P.: DESAM – annotated corpus for Czech. In: Jeffery, K. (ed.) SOFSEM 1997. LNCS, vol. 1338, Springer, Heidelberg (1997)
Popelínský, L., Pavelek, T., Ptáčník, T.: Towards Disambiguation in Czech Corpora. In: Proceedings of the 1st Workshop on Learning Language in Logic, Bled, Slovenia (1999)
Sedláček, R., Smrž, P.: A New Czech Morphological Analyser ajka. In: Matoušek, V., Mautner, P., Mouček, R., Tauser, K. (eds.) TSD 2001. LNCS (LNAI), vol. 2166, Springer, Heidelberg (2001)
Žáčková, E.: Partial syntactic analysis (of Czech). Ph.D. thesis (in Czech), Faculty of Informatics, Masaryk University, Brno (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Šmerk, P. (2004). Unsupervised Learning of Rules for Morphological Disambiguation. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2004. Lecture Notes in Computer Science(), vol 3206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30120-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-540-30120-2_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23049-6
Online ISBN: 978-3-540-30120-2
eBook Packages: Springer Book Archive