Abstract
The Prague Dependency Treebank (PDT) project is conceived of as a many-layered scenario, both from the point of view of the stratal annotation scheme, from the division-of-labor point of view, and with regard to the level of detail captured at the highest, tectogrammatical layer. The following aspects of the present status of the PDT are discussed in detail: the now-available PDT version 1.0, annotated manually at the morphemic and analytic layers, including the recent experience with post-annotation checking; the ongoing effort of tectogrammatical layer annotation, with a specific attention to the so-called model collection; and to two different areas of exploitation of the PDT, for linguistic research purposes and for information retrieval application purposes.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Böhmovä A., Hajič J., Hajičová E., Hladká B.: The Prague Dependency Treebank: A Three-Level Scenario. In: Treebanks: Building and Using Syntactically Annotated Corpora, ed. Anne Abeille, Kluwer Academic Publishers, in press
Böhmová A., Panevová J., Sgall P.: Syntactic Tagging. In: Text, Speech and Dialogue. Ed. By V. Matoušek, P. Mautner, J. Ocelíková and P. Sojka. Berlin: Springer (1999) 34–38
Collins M., Hajič J., Brill E., Ramshaw L., Tillmann Ch.: A Statistical Parser of Czech. In Proceedings of 37th ACL’99 (1999) 22–25
Czech National Corpus on-line resources: http://ucnk.ff.cuni.cz
Hajič J.: Building a Syntactically Annotated Corpus In: Issues of Valency and Meaning, ed. by E. Hajičová. Prague: Charles University (1998) 106–132
Hajič J.: Disambiguation of Rich Inflection-Computational Morphology of Czech. Vol. I. Prague: Karolinum, Charles University Press (2001) 334pp.
Hajič J. et al.: A Manual for Analytic Layer Tagging of the Prague Dependency Treebank (1999). English translation. Technical Report, UFAL MFF UK. In prep.
Hajič J., Hladká B.: Tagging Inflective Languages: Prediction of Morphological Categories for a Rich, Structured Tagset. In: Proceedings of Coling/ACL’98. Montréal, Canada (1998) 483–490
Hajič, J., Krbec, P., Květoň, P., Oliva, K., Petkevič, V.: Serial Combination of Rules and Statistics: A Case Study in Czech Tagging. In: Proceedings of the 39th ACL Meeting, Toulouse, France (2001). In print.
Hajičová E.: The Prague Dependency Treebank: From Analytic to Tectogrammatical Annotations. In: Text, Speech, Dialogue, ed. by P. Sojka, V. Matoušek and I. Kopeček, Brno: Masaryk University (1998) 45–50
Hajičová E.: The Prague Dependency Treebank: Crossing the Sentence Boundary. In: Text, Speech and Dialogue, ed. by V. Matoušek, P. Mautner, J. Ocelíková and P. Sojka, Berlin: Springer (1999) 20–27
Holub M.: Use of Dependency Microcontexts in Information Retrieval. In: SOFSEM 2000, ed. by V. Hlaváč, K. G. Jeffery and J. Wiedermann, Berlin: Springer-Verlag (2000) 347–355
Hajičová E., Panevová J., Sgall P.: A Manual for Tectogrammatic Tagging of the Prague Dependency Treebank. ÚFAL/CKL Technical Report TR-2000-09, Charles University, Czech Republic (2000)
Panevová J.: On Verbal Frames in Functional Generative Description. Prague Bulletin of Mathematical Lingusitics 22, 3–40, 23, (1974) 17–52
Panevová J.: Formy a funkce ve stavbě české věty (Forms and Functions in the Sentence Structure of Czech). Prague: Academia (1980)
Panevová J.: Ellipsis and Zero Elements in the Structure of the Sentence. In: Tipologie, grammatika, semantika. Sankt-Peterburg: Nauka (1998) 67–76
PDT on-line resources: http://ufal.mff.cuni.cz/pdt
Plátek, M.: Strict Monotonicity and Restarting Automata. PBML 72 (1999) 11–27
Řezníčková V.: PDT: Two Steps in Tectogrammatical Syntactic Annotation. (2001) Delivered at the SLE Annual Meeting, Leuven
Sgall P., Hajičová E., Panevová J.: The Meaning of the Sentence in Its Semantic and Pragmatic Aspects. Dordrecht: Reidel (1986)
Skoumalová H., Straňáková M., Žabokrtský Z.: Enhancing the Valency Dictionary of Czech Verbs: Tectogrammatical Annotation. This Volume.
Straňáková-Lopatková M.: Ambiguity of Prepositional Groups and the Possibility of Its Automatic Processing. PhD Thesis, Charles University, Prague (2001)
Žabokrtský Z.: Automatic Functor Assignment in the Prague Dependency Treebank. Master Thesis. Czech Technical University, Prague (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hajičová, E. et al. (2001). The Current Status of the Prague Dependency Treebank. In: Matoušek, V., Mautner, P., Mouček, R., Taušer, K. (eds) Text, Speech and Dialogue. TSD 2001. Lecture Notes in Computer Science(), vol 2166. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44805-5_2
Download citation
DOI: https://doi.org/10.1007/3-540-44805-5_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42557-1
Online ISBN: 978-3-540-44805-1
eBook Packages: Springer Book Archive