New Methods for Pruning and Ordering of Syntax Parsing Trees

Kovář, Vojtěch; Horák, Aleš; Kadlec, Vladimír

doi:10.1007/978-3-540-87391-4_18

Vojtěch Kovář¹,
Aleš Horák¹ &
Vladimír Kadlec¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5246))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

965 Accesses
1 Citations

Abstract

Most robust rule-based syntax parsing techniques face the problem of high number of possible syntax trees as the output. There are two possible solutions to this: either release the request for robustness and provide special rules for uncovered phenomena, or equip the parser with filtering and ordering techniques. We describe the implementation and evaluation of the latter approach. In this paper, we present new techniques of pruning and ordering the resulting syntax trees in the Czech parser synt. We describe the principles of the methods and present results of measurements of effectiveness of these methods both per method and in combination, as computed for 10,000 corpus sentences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baumann, S., Brinckmann, C., Hansen-Schirra, S., et al.: The muli project: Annotation and analysis of information structure in German and English. In: Proceedings of the LREC 2004 Conference, Lisboa, Portugal (2004)
Google Scholar
Radford, A.: Minimalist Syntax. Cambridge University Press, Chicago (1993)
Google Scholar
Horák, A., Smrž, P.: Best analysis selection in inflectional languages. In: Proceedings of the 19^th international conference on Computational linguistics, Taipei, Taiwan, pp. 363–368. Association for Computational Linguistics (2002)
Google Scholar
Jaeger, T., Gerassimova, V.: Bulgarian word order and the role of the direct object clitic in LFG. In: Butt, M., King, T. (eds.) Proceedings of the LFG02 Conference. CSLI Publications, Stanford (2002)
Google Scholar
Hoffman, B.: The Computational Analysis of the Syntax and Interpretation of Free Word Order in Turkish. Ph.D. thesis, University of Pennsylvania, Philadelphia (1995)
Google Scholar
Horák, A., Kadlec, V.: New Meta-grammar Constructs in Czech Language Parser synt. In: Proceedings of Text, Speech and Dialogue 2005, Karlovy Vary, Czech Republic. LNCS (LNAI), pp. 85–92. Springer, Heidelberg (2005)
Google Scholar
Horák, A., Kadlec, V., Smrž, P.: Enhancing Best Analysis Selection and Parser Comparison. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2002. LNCS (LNAI), vol. 2448, pp. 461–467. Springer, Heidelberg (2002)
Chapter Google Scholar
Kovář, V., Horák, A.: Reducing the Number of Resulting Parsing Trees for the Czech Language Using the Beautified Chart Method. In: Proceedings of 3^rd Language and Technology Conference, Poznań, Wydawnictwo Poznańskie, pp. 433–437 (2007)
Google Scholar
Kadlec, V.: Syntactic analysis of natural languages based on context-free grammar backbone. Ph.D. thesis, Masaryk University, Czech Republic (2008)
Google Scholar
Pala, K., Rychlý, P., Smrž, P.: DESAM – annotated corpus for Czech. In: Proceedings of SOFSEM 1997. LNCS, vol. 1338, pp. 523–530. Springer, Heidelberg (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Vojtěch Kovář, Aleš Horák & Vladimír Kadlec

Authors

Vojtěch Kovář
View author publications
You can also search for this author in PubMed Google Scholar
Aleš Horák
View author publications
You can also search for this author in PubMed Google Scholar
Vladimír Kadlec
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Petr Sojka Aleš Horák Ivan Kopeček Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kovář, V., Horák, A., Kadlec, V. (2008). New Methods for Pruning and Ordering of Syntax Parsing Trees. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2008. Lecture Notes in Computer Science(), vol 5246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87391-4_18

Download citation

DOI: https://doi.org/10.1007/978-3-540-87391-4_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87390-7
Online ISBN: 978-3-540-87391-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

New Methods for Pruning and Ordering of Syntax Parsing Trees