Abstract
Morphological analysis of highly inflected languages like Am-haric is a non-trivial task because of the complexity of the morphology. In this paper, we propose a supervised data-driven experimental approach to develop Amharic morphological analyzer. We use a memory-based supervised machine learning method which extrapolates new unseen classes based on previous examples in memory. We treat morphological analysis as a classification task which retrieves the grammatical functions and properties of morphologically inflected words. As the task is geared towards analyzing the vowelled inflected Amharic words with their grammatical functions of morphemes, the morphological structure of words and the way how they are represented in memory-based learning is exhaustively investigated. The performance of the model is evaluated using 10-fold cross-validation with IB1 and IGtree algorithms resulting in the over all accuracy of 93.6% and 82.3%, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amsalu, S., Gibbon, D.: Finite state morphology of Amharic. In: Proc. of Inter. Conf. on Recent Advances in Natural Language Processing, Borovets, pp. 47–51 (2005)
Bosch, A., Busserand, B., Canisius, E., Daelemans, W.: An efficient memory-based morpho-syntactic tagger and parser for Dutch. In: Proc. of the 17th Meeting Comp. Ling. in the Netherlands, Leuven, Belgium (2007)
Bosch, A., Daelemans, W.: Memory-based morphological analysis. In: Proc. of the 37th Annual Meeting of the Association for Computational Linguistics, Stroudsburg (1999)
Clark, A.: Memory-Based Learning of Morphology with Stochastic Transducers. In: Proc. of the 40th Annual Meeting of the Assoc. for Comp. Ling., Philadelphia (2002)
Daelemans, W., Bosch, A.: Memory-Based Language Processing. Cambridge University Press, Cambridge (2009)
Daelemans, W., Bosch, A., Weijters, T.: IGTree: Using Trees for Compression and Classification in Lazy Learning Algorithms. Artificial Intelligence Review 11, 407–423 (1997)
Gasser, M.: HornMorpho: a system for morphological processing of Amharic, Oromo, and Tigrinya. In: Proc. of Conf. on Human Lang. Tech. for Dev., Egypt (2011)
Hammarstrom, H., Borin, L.: Unsupervised Learning of Morphology. Computational Linguistics 37(2), 309–350 (2011)
Marsi, E., Bosch, A., Soudi, A.: Memory-based morphological analysis generation and part-of-speech tagging of Arabic. In: Proc. of the ACL Workshop on Computational Approaches to Semitic Languages, pp. 1–8 (2005)
Mulugeta, W., Gasser, M.: Learning Morphological Rules for Amharic Verbs Using Inductive Logic Programming. In: Proc. of SALTMIL8/AfLaT (2012)
Pauw, G., Schryver, G.: Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes. Lexikos 18, 303–318 (2008)
Yimam, B.: Ye’amarigna sewasew (Amharic Grammar). Eleni Printing Press, Addis Ababa (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Abate, M., Assabie, Y. (2014). Development of Amharic Morphological Analyzer Using Memory-Based Learning. In: Przepiórkowski, A., Ogrodniczuk, M. (eds) Advances in Natural Language Processing. NLP 2014. Lecture Notes in Computer Science(), vol 8686. Springer, Cham. https://doi.org/10.1007/978-3-319-10888-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-10888-9_1
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10887-2
Online ISBN: 978-3-319-10888-9
eBook Packages: Computer ScienceComputer Science (R0)