Abstract
The main aim of this work is to make improvements to Mirine 2.2, a Korean grammar-checker that takes Korean language properties into account and to develop a system that satisfies user expectations. Particular attention is given to processing Korean texts that may contain users' grammatical and practical errors. In order to treat these errors efficiently, the system includes methods based on asymmetric relations from which partial parsing and the potential governing relationship are derived, implying the starting point for checking, the direction for parsing, and the limits of the scope for parsing.To organize partial parsing efficiently, the system requires an appropriate knowledge base. As its essential prerequisites, this study (a) considers the factors from which the various error types encountered while parsing various Korean texts arise; (b) extracts general patterns from the linguistic or extra-linguistic factors obtained in this manner; and (c) demonstrates how the system, based on a linguistic analysis, procures an adequate knowledge base for partial parsing to satisfy end-users. Mirine 2.2 achieves an F-measure of about 0.80 in detecting unknown erroneous words; an F-measure of about 0.98 when not considering unknown words; and 98.94% precision in correcting erroneous words.
- Allen, J. 1995. Natural Language Understanding. 2nd ed. Benjamin Cummings, Redwood City, CA. Google Scholar
- Chae, Y.S. and KwonH.C. 1991. A dictionary-based morphological analysis, In Proceedings of the NLPRS, 141-147.Google Scholar
- Comrie, B. 1989. Language Universals and Linguistic Typology. Blackwell, Cambridge, UK.Google Scholar
- Cressels, D. 1995. Elements de Syntaxe Generale. PUF, Paris, France.Google Scholar
- Francis, N. W. and Kucera, H. 1982. Frequency Analysis of English Usage: Lexicon and Grammar. Houghton Mifflin, Boston, MA.Google Scholar
- Heidorn, G. E. 2000. Intelligent writing assistance. In Handbook of Natural Language Processing, R. Dale, H. Moisl, and H. Somers, eds. Marcel Dekker, New York, 181-207.Google Scholar
- Johanna, n. 1986. head-marking and dependent-marking grammar. language 62 (1986), 56-119.Google Scholar
- Jurafsky, D. and Martin, J. H. 2000. Speech and Language Processing. Prentice Hall, Englewood Cliffs, NJ. Google Scholar
- Kang, M.Y., Park, S.H., Yoon, A.S. and Kwon, H.C. 2002. Potential governing relationship and a Korean grammar checker using partial parsing. Lecture Note in Computer Science, IEA/AIE, 692-702. Google Scholar
- Kang, S.S. 2002. Korean Morphological Analysis and Information Retrieval (Hangugeo Hyeongtaeso Bunseokgwa Jeongbo Geomsaek). Top Publisher, Seoul, Korea.Google Scholar
- Kim, H.J., Park, D.I. and Kwon, H.C. 1997. Implementation of a Korean spelling and grammar checker using collocation of words and partial analysis of a sentence, In Proceedings of the IASTED, 208-211.Google Scholar
- Kim, M.J., Kwon, H.C., and Yoon, A.S. 1996. Rule-based approach to Korean morphological disambiguation supported by statistical method. In Proceedings of the Eleventh PACLIC, 237-246.Google Scholar
- Kwon, H.C. 1997 Current status and trend of NLP technologies in Korea. In Proceedings of the Korea-France Joint Workshop on Language Industries, 7-11.Google Scholar
- Lee, B.G. 1993. hwagsilhi-wa hagsil-i. Hanjulgi Publisher, Seoul, Korea. LEE, H. B. 1988. Revised Korean spelling system. Korea Journal 28, 4 (1988),. 34-41.Google Scholar
- Lee, H. B. 1988. Revised Korean spelling system. Korea Journal 28, 4 (1988),. 34-41.Google Scholar
- Lee, H.Y. 1996. Korean Phonetics (Gugeo Eumseonghak). Taehaksa Publisher, Seoul, Korea.Google Scholar
- Lim, H.S., Kim, J.D., and Lim, H.C. 1997. Part-of-speech tagging using complemental characteristics of linguistic knowledge and stochastic information. In Proceedings of the Ninth Conference on Hangul and Korean Information Processing, 102-108.Google Scholar
- Melcuk, I. A. 1988. Dependency Syntax: Theory and Practice. State University of New York Press, Albany, NY.Google Scholar
- Ministry of Culture and Tourism 2000. Revised Romanization of Korean http://www.mct.go.kr/english/K_about/Language04.html.Google Scholar
- Nam, K.S. 1998. Standard Korean Grammar (Pyojun Gugeo Munbeomnon). 6th ed., revised and corrected. Top Publisher, Seoul, Korea.Google Scholar
- Seo, K.J. and Choi, K.S. 1993. Korean parser using fuzzy dependency relations between word-phrases, In Proceedings of the Korea Information Science Society Fall Conference, 135-141Google Scholar
- Son, K.J., Hong, Y.K., Lee, J.H. Lee, G.B. 1994. Korean parser for dependency analysis among Eojeols. In Proceedings of the Korea Information Science Society Spring Conference, 135-141.Google Scholar
- Spencer, A. 1991. Morphological Theory. Blackwell, Cambridge, UK.Google Scholar
- Tesniere, L. 1959. Elements de Syntaxe Structurale. 2nd ed., revised and corrected. Klincksieck, Paris. France.Google Scholar
Index Terms
- Improving partial parsing based on error-pattern analysis for a Korean grammar-checker
Recommendations
A parsing algorithm for Korean implemented in Prolog
ACM-SE 44: Proceedings of the 44th annual Southeast regional conferenceThis paper presents a parsing algorithm for a partial free-word-order language, Korean. It is based on dependency grammar, unification-based grammar, and implemented in Prolog. The dependency grammar accepts free order of the words in a sentence. ...
Parsing Arabic using induced probabilistic context free grammar
The importance of the parsing task for NLP applications is well understood. However developing parsers remains difficult because of the complexity of the Arabic language. Most parsers are based on syntactic grammars that describe the syntactic ...
Exploring Self-Training and Co-training for Hindi Dependency Parsing Using Partial Parses
IALP '12: Proceedings of the 2012 International Conference on Asian Language ProcessingIn this paper, we explore the effect of self-training and co-training on Hindi dependency parsing using partial parses. We use Partial parser and apply self-training using a large unannotated corpus. For co-training, we use Malt and MST parser along ...
Comments