Dependency and Phrasal Parsers of the Czech Language: A Comparison

Horák, Aleš; Holan, Tomáš; Kadlec, Vladimír; Kovář, Vojtěch

doi:10.1007/978-3-540-74628-7_12

Aleš Horák¹,
Tomáš Holan²,
Vladimír Kadlec¹ &
…
Vojtěch Kovář¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4629))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

1735 Accesses
2 Citations

Abstract

In the paper, we present the results of an experiment with comparing the effectiveness of real text parsers of Czech language based on completely different approaches – stochastic parsers that provide dependency trees as their outputs and a meta-grammar parser that generates a resulting chart structure representing a packed forest of phrasal derivation trees.

We describe and formulate main questions and problems accompanying such experiment, try to offer answers to these questions and finally display also factual results of the tests measured on 10 thousand Czech sentences.

This work has been partly supported by the Academy of Sciences of Czech Republic under the projects T100300414, T100300419 and 1ET100300517 and by the Ministry of Education of CR within the Center of basic research LC536 and by the Czech Science Foundation under the project 201/05/2781.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hajič, J.: Building a syntactically annotated corpus: The Prague Dependency Treebank. In: Issues of Valency and Meaning, Prague, Karolinum, pp. 106–132 (1998)
Google Scholar
Horák, A., Kadlec, V.: New meta-grammar constructs in Czech language parser synt. In: Matoušek, V., Mautner, P., Pavelka, T. (eds.) TSD 2005. LNCS (LNAI), vol. 3658, pp. 85–92. Springer, Heidelberg (2005)
Google Scholar
McDonald, R.: Discriminative learning and spanning tree algorithms for dependency parsing. PhD thesis, University of Pennsylvania (2006)
Google Scholar
Hajič, J., Collins, M., Ramshaw, L., Tillmann, C.: A Statistical Parser for Czech. In: Proceedings ACL 1999, Maryland, USA (1999)
Google Scholar
Holan, T., Žabokrtský, Z.: Combining Czech Dependency Parsers. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 95–102. Springer, Heidelberg (2006)
Chapter Google Scholar
Holan, T.: Genetické učení závislostních analyzátorů. In: Sborník semináře ITAT 2005. UPJŠ, Košice (2005)
Google Scholar
Holan, T.: Tvorba závislostního syntaktického analyzátoru. In: Wiil, U.K. (ed.) MIS 2004. LNCS, vol. 3511, Springer, Heidelberg (2005)
Google Scholar
Hall, K., Novák, V.: Corrective modeling for non-projective dependency parsing, 42–51 (2005)
Google Scholar
Nilsson, J., Nivre, J., Hall, J.: Graph transformations in data-driven dependency parsing. In: Proceedings of the 21st Conference on Computational Linguistics and 44th Annual Meeting of the ACL, Sydney, pp. 257–264 (2006)
Google Scholar
Horák, A., Kadlec, V., Smrž, P.: Enhancing best analysis selection and parser comparison. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2002. LNCS (LNAI), vol. 2448, pp. 461–467. Springer, Heidelberg (2002)
Chapter Google Scholar
Sedláček, R.: Morphemic Analyser for Czech. PhD thesis, Masaryk University (2005)
Google Scholar
Hajič, J.: Disambiguation of Rich Inflection (Computational Morphology of Czech). Karolinum, Charles University Press, Prague, Czech Republic (2004)
Google Scholar
Horák, A., Smrž, P.: Best analysis selection in inflectional languages. In: Proceedings of the 19th international conference on Computational linguistics, Taipei, Taiwan, Association for Computational Linguistics, pp. 363–368 (2002)
Google Scholar
Hajič, J.: Complex Corpus Annotation: The Prague Dependency Treebank, Bratislava, Slovakia, Jazykovedný ústav Ľ. Štúra, SAV (2004)
Google Scholar
Collins, M.: dep2phr – conversion between dependency and phrase structures (1998), http://ufal.mff.cuni.cz/pdt/Utilities/dep2phr/
Bangalore, S., Sarkar, A., Doran, C., Hockey, B.A.: Grammar & parser evaluation in the XTAG project (1998), http://www.cs.sfu.ca/~anoop/papers/pdf/eval-final.pdf
Sampson, G.: A Proposal for Improving the Measurement of Parse Accuracy. International Journal of Corpus Linguistics 5(01), 53–68 (2000)
Article Google Scholar
Sampson, G., Babarczy, A.: A test of the leaf-ancestor metric for parse accuracy. Natural Language Engineering 9(04), 365–380 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Botanická 68a, 602 00 Brno, Czech Republic
Aleš Horák, Vladimír Kadlec & Vojtěch Kovář
Faculty of Mathematics and Physics, Charles University, Malostranské nám. 25, CZ-11800 Prague, Czech Republic
Tomáš Holan

Authors

Aleš Horák
View author publications
You can also search for this author in PubMed Google Scholar
Tomáš Holan
View author publications
You can also search for this author in PubMed Google Scholar
Vladimír Kadlec
View author publications
You can also search for this author in PubMed Google Scholar
Vojtěch Kovář
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Václav Matoušek Pavel Mautner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Horák, A., Holan, T., Kadlec, V., Kovář, V. (2007). Dependency and Phrasal Parsers of the Czech Language: A Comparison. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2007. Lecture Notes in Computer Science(), vol 4629. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74628-7_12

Download citation

DOI: https://doi.org/10.1007/978-3-540-74628-7_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74627-0
Online ISBN: 978-3-540-74628-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dependency and Phrasal Parsers of the Czech Language: A Comparison