Abstract
We adapt the cognitively-oriented morphology acquisition model proposed in (Chan 2008) to perform morphological analysis, extending its concept of base-derived relationships to allow multi-step derivations and adding features required for robustness on noisy corpora. This results in a rule-based morphological analyzer which attains an F-score of 58.48% in English and 33.61% in German in the Morpho Challenge 2009 Competition 1 evaluation. The learner’s performance shows that acquisition models can effectively be used in text-processing tasks traditionally dominated by statistical approaches.
Thanks to Jana Beck for her assistance in analyzing the German results and for her insightful comments throughout the development process. Portions of this paper were adapted from the material presented in the CLEF 2009 Morpho Challenge Workshop (Lignos et al. 2009).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brent, M.R., Murthy, S.K., Lundberg, A.: Discovering morphemic suffixes: A case study in minimum description length induction. In: Proceedings of the Fifth International Workshop on AI and Statistics (1995)
Can, B., Manandhar, S.: Unsupervised Learning of Morphology by Using Syntactic Categories. In: Working Notes of the 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Corfu, Greece, September 30–October 2 (2009)
Chan, E.: Structures and distributions in morphology learning. PhD Thesis, University of Pennsylvania (2008)
Creutz, M., Lagus, K.: Unsupervised Morpheme Segmentation and Morphology Induction from Text Corpora Using Morfessor 1.0. Publications in Computer and Information Science, Report A81, Helsinki University of Technology (March 2005)
Goldsmith, J.: Unsupervised learning of the morphology of a natural language. Computational Linguistics 27(2), 153–198 (2001)
Halle, M., Marantz, A.: Distributed morphology and the pieces of inflection. The view from Building 20, 111–176 (1993)
Harris, Z.S.: From phoneme to morpheme. Language, 190–222 (1955)
Keshava, S., Pitler, E.: A simpler, intuitive approach to morpheme induction. In: Proceedings of 2nd Pascal Challenges Workshop, pp. 31–35 (2006)
Lignos, C., Chan, E., Marcus, M.P., Yang, C.: A Rule-Based Unsupervised Morphology Learning Framework. In: Working Notes of the 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Corfu, Greece, September 30–October 2 (2009)
Monson, C.: ParaMor: from Paradigm Structure to Natural Language Morphology Induction. PhD Thesis, Carnegie Mellon University
Parkes, C.H., Malek, A.M., Marcus, M.P.: Towards Unsupervised Extraction of Verb Paradigms from Large Corpora. In: Proceedings of the Sixth Workshop on Very Large Corpora, Montreal, Quebec, Canda, August 15-16 (1998)
Pinker, S.: Words and rules: The ingredients of language. Basic Books, New York (1999)
Rumelhart, D.E., McClelland, J.L.: Parallel distributed processing: Explorations in the microstructure of cognition. Psychological and biological models, vol. 2. MIT Press, Cambridge (1986)
Spiegler, S., Golnia, B., Flach, P.: PROMODES: A probabilistic generative model for word decomposition. In: Working Notes of the 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Corfu, Greece, September 30–October 2 (2009)
Wicentowski, R.: Modeling and Learning Multilingual Inflectional Morphology in a Minimally Supervised Framework. Ph. D. thesis, Johns Hopkins University (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lignos, C., Chan, E., Marcus, M.P., Yang, C. (2010). A Rule-Based Acquisition Model Adapted for Morphological Analysis. In: Peters, C., et al. Multilingual Information Access Evaluation I. Text Retrieval Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6241. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15754-7_79
Download citation
DOI: https://doi.org/10.1007/978-3-642-15754-7_79
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15753-0
Online ISBN: 978-3-642-15754-7
eBook Packages: Computer ScienceComputer Science (R0)