Skip to main content

Problems of Inducing Large Coverage Constraint-Based Dependency Grammar for Czech

  • Conference paper
Constraint Solving and Language Processing (CSLP 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3438))

Included in the following conference series:

  • 307 Accesses

Abstract

This article describes an attempt to implement a constraint-based dependency grammar for Czech, a language with rich morphology and free word order, in the formalism Extensible Dependency Grammar (XDG). The grammar rules are automatically inferred from the Prague Dependency Treebank (PDT) and constrain dependency relations, modification frames and word order, including non-projectivity. Although these simple constraints are adequate from the linguistic point of view, their combination is still too weak and allows an exponential number of solutions for a sentence of n words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Sgall, P., Hajičová, E., Panevová, J.: The Meaning of the Sentence and Its Semantic and Pragmatic Aspects. Academia/Reidel Publishing Company, Prague, Czech Republic/Dordrecht (1986)

    Google Scholar 

  2. Hajič, J., Panevová, J., Buráňová, E., Urešová, Z., Bémová, A.: A Manual for Analytic Layer Tagging of the Prague Dependency Treebank. Technical Report TR-2001-, ÚFAL MFF UK, Prague, Czech Republic (2001) English translation of the original Czech version

    Google Scholar 

  3. Hajičová, E., Panevová, J., Sgall, P.: A Manual for Tectogrammatic Tagging of the Prague Dependency Treebank. Technical Report TR-2000-09, ÚFAL MFF UK, Prague, Czech Republic (2000) (In Czech)

    Google Scholar 

  4. Žabokrtský, Z., Benešová, V., Lopatková, M., Skwarská, K.: Tektogramaticky anotovaný valenční slovník českých sloves. Technical Report TR-2002-15, ÚFAL/CKL, Prague, Czech Republic (2002)

    Google Scholar 

  5. Collins, M., Hajič, J., Brill, E., Ramshaw, L., Tillmann, C.: A Statistical Parser of Czech. In: Proceedings of 37th ACL Conference, pp. 505–512. University of Maryland, College Park, USA (1999)

    Google Scholar 

  6. Zeman, D.: Can Subcategorization Help a Statistical Parser?. In: Proceedings of the 19th International Conference on Computational Linguistics (Coling 2002), Taibei, Tchaj-wan, Zhongyang Yanjiuyuan (Academia Sinica) (2002)

    Google Scholar 

  7. Charniak, E.: A maximum-entropy-inspired parser. In: Proceedings of the 1st Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL-2000), Seattle, Washington, USA, pp. 132–139 (April 2000), http://www.cs.brown.edu/people/ec/papers/

  8. Debusmann, R., Duchier, D., Koller, A., Kuhlmann, M., Smolka, G., Thater, S.: A relational syntax-semantics interface based on dependency grammar. In: Proceedings of COLING 2004, Geneva, Switzerland (2004)

    Google Scholar 

  9. Duchier, D., Debusmann, R.: Topological dependency trees: A constraint-based account of linear precedence. In: 39th Annual Meeting of the Association for Computational Linguistics, ACL 2001 (2001)

    Google Scholar 

  10. Debusmann, R., Duchier, D.: A meta-grammatical framework for dependency grammar (2003)

    Google Scholar 

  11. Bojar, O.: Czech Syntactic Analysis Constraint-Based, XDG: One Possible Start. Prague Bulletin of Mathematical Linguistics (2004)

    Google Scholar 

  12. Holan, T.: K syntaktické analýze českých(!) vět. In: MIS 2003, MATFYZ Press (2003)

    Google Scholar 

  13. Yamada, H., Matsumoto, Y.: Statistical dependency analysis with support vector machines. In: Proceedings of the International Workshop on Parsing Technologies (IWPT 2003), Nancy, France (2003)

    Google Scholar 

  14. Bojar, O.: Towards Automatic Extraction of Verb Frames. Prague Bulletin of Mathematical Linguistics, 101–120 (2003)

    Google Scholar 

  15. Kruijff, G.J.M.: 3-phase grammar learning. In: Proceedings of the Workshop on Ideas and Strategies for Multilingual Grammar Development (2003)

    Google Scholar 

  16. Bech, G.: Studien über das deutsche Verbum infinitum. 2nd unrevised edition published 1983 by Max Niemeyer Verlag, Tübingen (Linguistische Arbeiten 139) (1955)

    Google Scholar 

  17. Holan, T., Kuboň, V., Oliva, K., Plátek, M.: Two Useful Measures of Word Order Complexity. In: Polguere, A., Kahane, S. (eds.) Proceedings of the Coling 1998 Workshop: Processing of Dependency-Based Grammars, University of Montreal, Montreal (1998)

    Google Scholar 

  18. Sarkar, A., Zeman, D.: Automatic Extraction of Subcategorization Frames for Czech. In: Proceedings of the 18th International Conference on Computational Linguistics (Coling 2000), Saarbrücken, Germany, Universität des Saarlandes (2000)

    Google Scholar 

  19. Dubey, A., Keller, F.: Probabilistic parsing for German using sister-head dependencies. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, pp. 96–103 (2003)

    Google Scholar 

  20. Harper, M.P., Helzerman, R.A., Zoltowski, C.B., Yeo, B.L., Chan, Y., Stewart, T., Pellom, B.L.: Implementation Issues in the Development of the PARSEC Parser. SOFTWARE - Practice and Experience 25, 831–862 (1995)

    Article  Google Scholar 

  21. Dienes, P., Koller, A., Kuhlmann, M.: Statistical a-star dependency parsing. In: Duchier, D. (ed.) Prospects and Advances of the Syntax/Semantics Interface, Nancy, pp. 85–89 (2003)

    Google Scholar 

  22. Heinecke, J., Kunze, J., Menzel, W., Schüder, I.: Eliminative parsing with graded constraints. In: Proceedings of COLING-ACL Conference, Montreal, Canada (1998)

    Google Scholar 

  23. Foth, K., Menzel, W., Schröder, I.: Robust parsing with weighted constraints. Natural Language Engineering (2004) (in press)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bojar, O. (2005). Problems of Inducing Large Coverage Constraint-Based Dependency Grammar for Czech. In: Christiansen, H., Skadhauge, P.R., Villadsen, J. (eds) Constraint Solving and Language Processing. CSLP 2004. Lecture Notes in Computer Science(), vol 3438. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424574_6

Download citation

  • DOI: https://doi.org/10.1007/11424574_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26165-0

  • Online ISBN: 978-3-540-31928-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics