Abstract
In Japanese natural language processing, morphological analysis is a very important technique, and many methods for it have been proposed. The task of Japanese morphological analysis is essentially word segmentation. In this study, we propose a new method of Japanese word segmentation. Our method regards word segmentation as the classification problem and solves it by the decision list method. The advantage of our method is that it avoids the unknown word problem because it is a kind of character based method. Another advantage is that it is deterministic, and the time taken for deterministic analysis is proportional to the length of the sentence. Moreover, our approach can use various features to solve the classification problem, and various machine learning methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yarowsky, D.: “Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French”, 32th Annual Meeting of the Association for Computational Linguistics, pp. 88–95 (1994).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shinnou, H. (2000). Deterministic Japanese Word Segmentation by Decision List Method. In: Mizoguchi, R., Slaney, J. (eds) PRICAI 2000 Topics in Artificial Intelligence. PRICAI 2000. Lecture Notes in Computer Science(), vol 1886. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44533-1_111
Download citation
DOI: https://doi.org/10.1007/3-540-44533-1_111
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67925-7
Online ISBN: 978-3-540-44533-3
eBook Packages: Springer Book Archive