Abstract
Analyzing sentences of Korean language, it is found that situation, meaning, and context perform an important role rather than syntactical characteristics. Thus it is difficult to disambiguate word sense by rule-based method, such as context-free grammar, only. In this study, sense-tagged corpora was semi-automatically constructed with the use of predicate-based sub-categorization dictionary. In this process, the information on the frequency of predicate-based sub-categorization patterns, the information on the collocation of predicates and nouns, and the information on the statistic cooccurence of declinable words could be obtained. Based on this information, the method of automatic extension of sub-categorization dictionary is suggested.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Woo, Y.: Constructing a Korean Sub-categorization Dictionary with Semantic Roles using Thesaurus and Predicate Patterns, Paper of Information and Science Association (June 2000)
Fujii, A.: Corpus-Based Word Sense Disambiguation, Tokyo Institute of Technology (July 1998)
Seo, Y.: Development of Korean analyzer based on the token-establishment of Korean semantic analysis dictionary and sub-categorization dictionary, Report of Korea Electronic Telecommunication Research Institute (1998)
Choo, K.: Korean Lexical Sense Analysis for the Concept-Based Information Retrieval, Paper of master’s degree in Incheon university (December 1998)
Wilks, Y., Stevenson, M.: Sense Tagging: Semantic Tagging with a Lexicon. Computational Linguistics (May 1997)
Poesio, M.: Semantic Ambiguity and Perceived Ambiguity. Computational Linguistics (May 1995)
Briscoe, T., Carroll, J.: Automatic Extraction of Sub-categorization from Copus. Computational Linguistics (February 1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choo, K., Kang, S., Min, H., Woo, Y. (2004). Automatic Extension of Korean Predicate-Based Sub-categorization Dictionary from Sense Tagged Corpora. In: Laganá, A., Gavrilova, M.L., Kumar, V., Mun, Y., Tan, C.J.K., Gervasi, O. (eds) Computational Science and Its Applications – ICCSA 2004. ICCSA 2004. Lecture Notes in Computer Science, vol 3045. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24767-8_61
Download citation
DOI: https://doi.org/10.1007/978-3-540-24767-8_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22057-2
Online ISBN: 978-3-540-24767-8
eBook Packages: Springer Book Archive