Abstract
The impact of clause and intraclausal coordination detection to dependency parsing of Slovene is examined. New methods based on machine learning and heuristic rules are proposed for clause and intraclausal coordination detection. They were included in a new dependency parsing algorithm, PACID. For evaluation, Slovene dependency treebank was used. At parsing, 6.4% and 9.2 % relative error reduction was achieved, compared to the dependency parsers MSTP and Malt, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abney, S.P.: Rapid Incremental Parsing with Repair. In: Proceedings of the 6th New OED Conference, pp. 1–9 (1990)
Ejerhed, E.I.: Finding clauses in unrestricted text by finitary and stochastic methods. In: Proceedings of the second conference on Applied natural language processing, pp. 219–227 (1988)
Tjong Kim Sang, E.F.: Memory-Based Shallow Parsing. Journal of Machine Learning Research 2, 559–594 (2002)
Hogan, D.: Empirical measurements of lexical similarity in noun phrase conjuncts. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 149–152 (2007)
Ohno, T., Matsubara, S., Kashioka, H., Maruyama, T., Inagaki, Y.: Incremental Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics (ACL), pp. 169–176 (2006)
Holán, T., Žabokrtský, Z.: Combining czech dependency parsers. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 95–102. Springer, Heidelberg (2006)
Džeroski, S., Erjavec, T., Ledinek, N., Pajas, P., Žabokrtský, Z., Žele, A.: Towards a Slovene Dependency Treebank. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC), pp. 1388–1391 (2006)
Kuboň, V., Lopatková, M., Plátek, M., Pognan, P.: Segmentation of complex sentences. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 151–158. Springer, Heidelberg (2006)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann Publishers, San Francisco (2005)
Erjavec, T.: The English-Slovene ACQUIS Corpus. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC), pp. 2138–2141 (2006)
McDonald, R., Pereira, F., Ribarov, K., Hajič, J.: Non-projective Dependency Parsing Using Spanning Tree Algorithms. In: Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP), pp. 523–530 (2005)
Nivre, J.: Inductive Dependency Parsing. Springer, Heidelberg (2006)
Marinčič, D., Gams, M., Šef, T.: How much can clause identification help to improve dependency parsing? In: Proceedings of the 10th International Multiconference Information Society (IS), pp. 92–94 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Marinčič, D., Gams, M., Šef, T. (2009). Intraclausal Coordination and Clause Detection as a Preprocessing Step to Dependency Parsing. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2009. Lecture Notes in Computer Science(), vol 5729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04208-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-04208-9_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04207-2
Online ISBN: 978-3-642-04208-9
eBook Packages: Computer ScienceComputer Science (R0)