Abstract
One of the most important problems of the morphological analysis is processing of unknown words. This paper proposes to use morpheme and character features to relieve the problem of the unknown words without decreasing of the precision for the known words. We used the maximum entropy method which is flexible to the information of the morphemes and the characters. The experiments revealed that both the morpheme and character features are effective for Chinese morphological analysis and the character features are useful for the processing of the unknown words.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Berger, A.L., Pietra, V.J.D., Pietra, S.A.D.: A maximum entropy approach to natural language processing. Computational Linguistics 22(1), 39–71 (1996)
Nakagawa, T.: Chinese and japanese word segmentation using word-level and character-level information. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), pp. 466–472 (2004)
Oda, H., Mori, S., Kita, K.: A japanese word segmenter by a character class model. Journal of Natural Language Processing 6(7), 93–108 (1999)
Takeuchi, K., Matsumoto, Y.: Hmm parameter learning for japanese morphological analyzer. In: Proceedings of the 10th Pacific Asia Conference on Language Information and Computation (PACLIC 10), pp. 163–172 (1995)
Uchimoto, K., Sekine, S., Isahara, H.: The unknown word problem: a morphological analysis of japanese using maximum entropy aided by a dictionary. In: Proceedings of the 2001 Conference on Empirical Methods in Natural Language Processing, pp. 91–99 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Komiya, K., Hou, H., Shibahara, K., Fujimoto, K., Kotani, Y. (2012). Chinese Morphological Analysis Using Morpheme and Character Features. In: Anthony, P., Ishizuka, M., Lukose, D. (eds) PRICAI 2012: Trends in Artificial Intelligence. PRICAI 2012. Lecture Notes in Computer Science(), vol 7458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32695-0_87
Download citation
DOI: https://doi.org/10.1007/978-3-642-32695-0_87
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32694-3
Online ISBN: 978-3-642-32695-0
eBook Packages: Computer ScienceComputer Science (R0)