Skip to main content
Log in

Korean Combinatory Categorial Grammar and Statistical Parsing

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

Korean Combinatory Categorial Grammar (KCCG) is an extendedcombinatory categorial grammar formalism to capture thesyntax and interpretation of a relative freess word order, longdistance scrambling, and other specific characteristics of Korean.KCCG formalism can uniformly handle word order variations amongarguments and adjuncts within a clause, as well as in complexclauses and across clause boundaries, i.e. long distancescrambling. The approach we develop takes advantage of the ability of CCGfor type raising and composition along with the ability of variablecategories and unordered argument modeling for relatively freeword order treatment (Lee et al., 1994; Lee et al., 1997).We apply a probability model and heuristics using Koreancharacteristics to our KCCG parser.Results of the experiments on varioustext genre show that the KCCG parser performsat 87.67/87.03% constituent precision/recall.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

REFERENCES

  • Ades, A. and M. Steedman. “On the order of words”. Linguistics and Philosophy, 4 (1982), pp.517–558.

    Google Scholar 

  • Black, E., S. Abney, D. Flickenger, C. Gdaniec, R. Grishman, P. Harrison, D. Hindle, R. Ingria, F. Jelinek, J. Klavans, M. Liberman, M. Marcus, S. Roukos, B. Santorini and T. Strzalkowski. Proc. of Fourth DARPA Speech and Natural Language Workshop, A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars, 1991.

  • Bozsahin, C. “Deriving the Predicate-Argument Structure for a Free Word Order Language”. Proceedings of COLING-ACL' 98, 1998.

  • Cha, J., G. Lee and J.-H. Lee. “Generalized unknown morpheme guessing for hybrid POS tagging of Korean”. Procceeding of Sixth Workshop on Very Large Corpora in Coling-ACL 98, Montreal, Canada, 1998.

  • Charniak, E. Prsing with Context-Free Grammars and Word Statistics, {saTechnical Report} CS-95-28, Brown University, 1995.

  • Cho, H. and J.C. Park. Proceedings of the 11th the Korean conference on Language and Information Processing, Combinatory Categorial Grammar and Parsing, 1999.

  • Collins, M. Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, A New Statistical Parser Based on Bigram Lexical Dependencies, 1996.

  • Collins, M. Proceedings of the 35th Annual Meeting of the ACL, Three Generative, Lexicalised Models for Statistical Parsing, 1997.

  • Han, S.-K. and C.-G. Park. Proceedings of the 2th conference of Korean and Korean Information Processing, Combinatory Categorial Grammar for Korean, 1990.

  • Hindle, D. and H. Rooth. “Structural Ambiguity and Lexical Relations”. Computational Linguistics, 19(1) (1993), pp. 103–120.

    Google Scholar 

  • Hoffman, B. Proceedings of the 30th Annual Meeting of ACL, Student Session, A CCG Approach to Free Word Order Languages, 1992.

  • Hoffman, B. Proceedings of the 31th Annual Meeting of ACL, The Formal Consequences of using Variables in CCG Categories, 1993.

  • Hoffman, B. Proceedings of the European Chapter of the Association for Computational Linguistics, Integrating “Free” Word Order Syntax and Information Structure, Dublin, 1995a.

  • Hoffman, B. The Computational Analysis of the Syntax and Interpretation of “Free” Word Order in Turkish, Ph.D. thesis, University of Pennsylvania, IRCS Report 95-17, 1995b.

  • Kim, C., J.H. Kim, J. Seo and G.C. Kim. “A Right-to-Left Chart Parsing with Headable Paths for Korean Dependency Grammar”. Computer Processing of Chinese and Oriental Languages, 8, supplement, 1994.

  • Lee, K.J. “Probabilistic Parsing using Structural Preference and Head-Head Co-occurrence”. Ph.D. thesis, KAIST, Korean, 1997.

    Google Scholar 

  • Lee, K.J. and K.C. Kim. Joint Conference on Intelligence Technology, Tree Transformation Rules for Korean Lexicalized Multi-Component TAG Parser, Korean, 1995.

  • Lee, N.-S. Case and Case marker, Weol-In, 1998.

  • Lee, W., G. Lee and J. Lee. “Table-driven neural syntactic analysis of Korean”. Proceedings of the Coling–94, Kyoto, Japan, 1994, pp. 911–915.

  • Lee, W., G. Lee and J. Lee. “Chart-driven connectionist categorial parsing of spoken Korean”. Computer processing of Oriental languages, 10(2) (1996), pp. 147–159.

    Google Scholar 

  • Lee, W., G. Lee and J. Lee. “Morpho-syntactic modeling of Korean with a categorial grammar”. Proceedings of the natural language processing pacific-rim symposium, Phuket, Thailand, 1997, pp. 545–548.

  • Li, Charles/Sandra Thompson. Subject and Topic: A New Typology of Language, Academic Press, NY, 1996.

    Google Scholar 

  • Magerman, D.M. and M.P. Marcus. “Parsing the Voyager Domain Using Pearl”. Proc. Of the DARPA Speech and Natural Language Workshop, 1991, pp. 231–236.

  • Magerman, D.M. and C. Weir. “Efficiency, Robustness and Accuracy in Picky Chart Parsing”. In Proc. Of the 30th Annual Meeting of the Assoc. For Computational Linguistics (ACL–92), 1992, pp. 40–47.

  • Seo, J. Korean Grammar, Hanyang university press, Korean, 1996.

    Google Scholar 

  • Son, S. Research of Korean auxiliary verb, Korean Culture Press, Seoul, Korean, 1996.

    Google Scholar 

  • Steedman, M. “Dependency and Coordination in the Grammar of Dutch and English”. Language, 61 (1985), pp. 523–568.

    Google Scholar 

  • Steedman, M. “Combinatory grammars and parasitic gaps”. Natural Language and Linguistic Theory, 5 (1987), pp. 403–439.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cha, J., Lee, G. & Lee, J. Korean Combinatory Categorial Grammar and Statistical Parsing. Computers and the Humanities 36, 431–453 (2002). https://doi.org/10.1023/A:1020260012525

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1020260012525

Navigation