Abstract
In this paper we present a method for extracting general structures of the verb groups from a tagged and fully disambiguated corpus and consecutive exploitation of these structures for the building a formal grammar in the Prolog DCG fashion. Our goal is to apply them as a rules for the analysis of the Czech verb groups in the non- disambiguated grammatically tagged Czech corpus texts. The problem of the recognition of verb discontinuous constituents in Czech is also approached and obtained statistical data are presented.
This research has been partially supported by the Czech Ministry of Education under the grant VS97028
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Karel Pala, Pavel Rychlý and Pavel Smrž. DESAM — approaches to disambiguation. Technical Report FIMU-RS-97-09, Brno, 1997.
Karel Pala, Pavel Rychlý and Pavel Smrž. DESAM — annotated corpus for Czech. In Proceedings of SOFSEM’97. Springer-Verlag, 1997.
Veronica Dahl. More on Gapping Grammars. In Proceedings of the International Conference on Fifth Generation Computer Systems. Tokyo, 1984.
Bruno Maxmilian Schulze and Oliver Christ. The CQP User’s Manual. Universität Stuttgart, Stuttgart, 1996.
Jan Petr et al. The Grammar of Czech III. Academia, Praha, 1987.
Pavel Ševeček. LEMMA — a Lemmatizer for Czech. Brno, 1996. (manuscript).
Karel Pala and Pavel Ševeček. Valencies of Czech Verbs. Studia Minora Facultatis Philosophicae Universitatis Brunensis, A45, 1997.
Pavel Smrž and Eva Žáčková. New Tools for Disambiguation of Czech Texts. In Proceedings of TSD’98. Masaryk University, Brno, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Žáčková, E., Pala, K. (1999). Corpus-Based Rules for Czech Verb Discontinuous Constituents. In: Matousek, V., Mautner, P., Ocelíková, J., Sojka, P. (eds) Text, Speech and Dialogue. TSD 1999. Lecture Notes in Computer Science(), vol 1692. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48239-3_59
Download citation
DOI: https://doi.org/10.1007/3-540-48239-3_59
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66494-9
Online ISBN: 978-3-540-48239-0
eBook Packages: Springer Book Archive