Abstract
We present experiments with automatic annotation of English texts, taken from the Penn Treebank, at the dependency-based tectogrammatical layer, as it is defined in the Prague Dependency Treebank. The proposed analyzer, which is based on machine-learning techniques, outperforms a tool based on hand-written rules, which is used for partial tectogrammatical annotation of English now, in the most important characteristics of tectogrammatical annotation. Moreover, both tools were combined and their combination gives the best results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Hajič, J., et al.: Prague Dependency Treebank 2.0, Linguistic Data Consortium, Philadelphia LDC Catalog No. LDC2006T01 (2006), http://ufal.mff.cuni.cz/pdt2.0/
Sgall, P., Hajičová, E., Panevová, J.: The Meaning of a Sentence in Its Semantic and Pragmatic Aspects. Academia – Kluwer, Praha – Amsterdam (1986)
Hajičová, E., Panevová, J., Sgall, P.: A Manual for Tectogrammatic Tagging of the Prague Dependency Treebank. ÚFAL/CKL Technical Report TR-2000-09. Charles University, Prague (2000)
Klimeš, V.: Transformation-Based Tectogrammatical Analysis of Czech. In: Sojka, P., Kopeček, I., Pala, K. (eds.) Proceedings of Text, Speech and Dialogue 2006, Springer, Heidelberg (2006)
Kučerová, I., Žabokrtský, Z.: Transforming Penn Treebank Phrase Trees into (Praguian) Tectogrammatical Dependency Trees, Prague Bulletin of Mathematical Linguistic, Prague, vol. 78, pp. 77–94 (2002)
Cinková, S., et al.: Annotation of English on the Tectogrammatical Level. Technical Report No. TR-2006-35. ÚFAL MFF, Charles University, Prague (2006)
Ngai, G., Florian, R.: Transformation-Based Learning in the Fast Lane. In: Proceedings of NAACL 2001, Pittsburgh, PA, pp. 40–47 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Klimeš, V. (2007). Transformation-Based Tectogrammatical Dependency Analysis of English. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2007. Lecture Notes in Computer Science(), vol 4629. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74628-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-74628-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74627-0
Online ISBN: 978-3-540-74628-7
eBook Packages: Computer ScienceComputer Science (R0)