Abstract
This article describes an attempt to implement a constraint-based dependency grammar for Czech, a language with rich morphology and free word order, in the formalism Extensible Dependency Grammar (XDG). The grammar rules are automatically inferred from the Prague Dependency Treebank (PDT) and constrain dependency relations, modification frames and word order, including non-projectivity. Although these simple constraints are adequate from the linguistic point of view, their combination is still too weak and allows an exponential number of solutions for a sentence of n words.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Sgall, P., Hajičová, E., Panevová, J.: The Meaning of the Sentence and Its Semantic and Pragmatic Aspects. Academia/Reidel Publishing Company, Prague, Czech Republic/Dordrecht (1986)
Hajič, J., Panevová, J., Buráňová, E., Urešová, Z., Bémová, A.: A Manual for Analytic Layer Tagging of the Prague Dependency Treebank. Technical Report TR-2001-, ÚFAL MFF UK, Prague, Czech Republic (2001) English translation of the original Czech version
Hajičová, E., Panevová, J., Sgall, P.: A Manual for Tectogrammatic Tagging of the Prague Dependency Treebank. Technical Report TR-2000-09, ÚFAL MFF UK, Prague, Czech Republic (2000) (In Czech)
Žabokrtský, Z., Benešová, V., Lopatková, M., Skwarská, K.: Tektogramaticky anotovaný valenční slovník českých sloves. Technical Report TR-2002-15, ÚFAL/CKL, Prague, Czech Republic (2002)
Collins, M., Hajič, J., Brill, E., Ramshaw, L., Tillmann, C.: A Statistical Parser of Czech. In: Proceedings of 37th ACL Conference, pp. 505–512. University of Maryland, College Park, USA (1999)
Zeman, D.: Can Subcategorization Help a Statistical Parser?. In: Proceedings of the 19th International Conference on Computational Linguistics (Coling 2002), Taibei, Tchaj-wan, Zhongyang Yanjiuyuan (Academia Sinica) (2002)
Charniak, E.: A maximum-entropy-inspired parser. In: Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL-2000), Seattle, Washington, USA, pp. 132–139 (April 2000), http://www.cs.brown.edu/people/ec/papers/
Debusmann, R., Duchier, D., Koller, A., Kuhlmann, M., Smolka, G., Thater, S.: A relational syntax-semantics interface based on dependency grammar. In: Proceedings of COLING 2004, Geneva, Switzerland (2004)
Duchier, D., Debusmann, R.: Topological dependency trees: A constraint-based account of linear precedence. In: 39th Annual Meeting of the Association for Computational Linguistics, ACL 2001 (2001)
Debusmann, R., Duchier, D.: A meta-grammatical framework for dependency grammar (2003)
Bojar, O.: Czech Syntactic Analysis Constraint-Based, XDG: One Possible Start. Prague Bulletin of Mathematical Linguistics (2004)
Holan, T.: K syntaktické analýze českých(!) vět. In: MIS 2003, MATFYZ Press (2003)
Yamada, H., Matsumoto, Y.: Statistical dependency analysis with support vector machines. In: Proceedings of the International Workshop on Parsing Technologies (IWPT 2003), Nancy, France (2003)
Bojar, O.: Towards Automatic Extraction of Verb Frames. Prague Bulletin of Mathematical Linguistics, 101–120 (2003)
Kruijff, G.J.M.: 3-phase grammar learning. In: Proceedings of the Workshop on Ideas and Strategies for Multilingual Grammar Development (2003)
Bech, G.: Studien über das deutsche Verbum infinitum. 2nd unrevised edition published 1983 by Max Niemeyer Verlag, Tübingen (Linguistische Arbeiten 139) (1955)
Holan, T., Kuboň, V., Oliva, K., Plátek, M.: Two Useful Measures of Word Order Complexity. In: Polguere, A., Kahane, S. (eds.) Proceedings of the Coling 1998 Workshop: Processing of Dependency-Based Grammars, University of Montreal, Montreal (1998)
Sarkar, A., Zeman, D.: Automatic Extraction of Subcategorization Frames for Czech. In: Proceedings of the 18th International Conference on Computational Linguistics (Coling 2000), Saarbrücken, Germany, Universität des Saarlandes (2000)
Dubey, A., Keller, F.: Probabilistic parsing for German using sister-head dependencies. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, pp. 96–103 (2003)
Harper, M.P., Helzerman, R.A., Zoltowski, C.B., Yeo, B.L., Chan, Y., Stewart, T., Pellom, B.L.: Implementation Issues in the Development of the PARSEC Parser. SOFTWARE - Practice and Experience 25, 831–862 (1995)
Dienes, P., Koller, A., Kuhlmann, M.: Statistical a-star dependency parsing. In: Duchier, D. (ed.) Prospects and Advances of the Syntax/Semantics Interface, Nancy, pp. 85–89 (2003)
Heinecke, J., Kunze, J., Menzel, W., Schüder, I.: Eliminative parsing with graded constraints. In: Proceedings of COLING-ACL Conference, Montreal, Canada (1998)
Foth, K., Menzel, W., Schröder, I.: Robust parsing with weighted constraints. Natural Language Engineering (2004) (in press)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bojar, O. (2005). Problems of Inducing Large Coverage Constraint-Based Dependency Grammar for Czech. In: Christiansen, H., Skadhauge, P.R., Villadsen, J. (eds) Constraint Solving and Language Processing. CSLP 2004. Lecture Notes in Computer Science(), vol 3438. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424574_6
Download citation
DOI: https://doi.org/10.1007/11424574_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26165-0
Online ISBN: 978-3-540-31928-3
eBook Packages: Computer ScienceComputer Science (R0)