Abstract
Arc contractions in syntactic dependency graphs can be used to decide which graphs are trees. The paper observes that these contractions can be expressed with weighted finite-state transducers (weighted FST) that operate on string-encoded trees. The observation gives rise to a finite-state parsing algorithm that computes the parse forest and extracts the best parses from it. The algorithm is customizable to functional and bilexical dependency parsing, and it can be extended to non-projective parsing via a multi-planar encoding with prior results on high recall. Our experiments support an analysis of projective parsing according to which the worst-case time complexity of the algorithm is quadratic to the sentence length, and linear to the overlapping arcs and the number of functional categories of the arcs. The results suggest several interesting directions towards efficient and high-precision dependency parsing that takes advantage of the flexibility and the demonstrated ambiguity-packing capacity of such a parser.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
It would be interesting to study how the minorization relation compares with the derivation relation of tree adjoining grammars. In both cases, the derived tree is manipulated from inside.
- 2.
This article is published on the occasion of Professor Lauri Carlson’s birthday. As he co-supervised my Ph.D. research together with Kimmo Koskenniemi a decade ago, it is now a great privilege for me to write about these new advances in the research area where we started together.
- 3.
The tree is drawn with the xdag.sty package written by Denys Duchier, Ralph Debusmann and Robert Grabowski. For convenience, the orientation of the tree is flipped in the context of the linguistic example that is typeset with expex.sty.
- 4.
This definition excludes arc deletion that is normally included in the definition of graph minors.
- 5.
These minorization and “majorization” phases could be compared to the forward and backward procedures used in trellis algorithms for Hidden Markov Models.
References
Abney, Steven. 1996. Partial parsing via finite state cascades. In Proceedings of the ESSLLI’96 robust parsing workshop, Prague, Czech Republic.
Debusmann, Ralph, Denys Duchier, and Geert-Jan M. Kruijff. 2004. Extensible dependency grammar: A new methodology. In Proceedings of the COLING 2004 workshop of recent advances in dependency grammar, Geneva, Switzerland, eds. Geert-Jan M. Kruijff and Denys Duchier, 78–84.
Didriksen, Tino. 2010. Constraint grammar manual: 3rd version of the CG formalism variant. GrammarSoft ApS, Denmark. http://beta.visl.sdu.dk/cg3/vislcg3.pdf.
Eisner, Jason. 1997. Bilexical grammars and a cubic-time probabilistic parser. In Proceedings of the 4th international workshop on parsing technologies, 54–65. Cambridge: MIT.
Elworthy, David. 2000. A finite state parser with dependency structure output. In Proceedings of sixth international workshop on parsing technologies (IWPT 2000). Trento: Institute for Scientific and Technological Research.
Gómez-Rodríguez, Carlos, and Joakim Nivre. 2010. A transition-based parser for 2-planar dependency structures. In Proceedings of the 48th annual meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden, 1492–1501.
Gómez-Rodríguez, Carlos, David Weir, and John Carroll. 2009. Parsing mildly non-projective dependency structures. In Proceedings of the 12th conference of the European chapter of the Association for Computational Linguistics (EACL 2009), 291–299.
Hulden, Mans. 2011. Constraint Grammar parsing with left and right sequential finite transducers. In Proceedings of the 9th international workshop on finite state methods and natural language processing (FSMNLP 2011), 39–47. Blois: Association for Computational Linguistics. http://www.aclweb.org/anthology/W11-4406.
Joshi, Aravind K. 1996. A parser from antiquity: an early application of finite state transducers to natural language parsing. In Proceedings of the ECAI’96 workshop on extended finite state models of language, ed. András Kornai. Studies in natural language processing, 33–34. Cambridge: Cambridge University Press.
Koskenniemi, Kimmo, Pasi Tapanainen, and Atro Voutilainen. 1992. Compiling and using finite-state syntactic rules. In Proceedings of the 14th COLING 1992 conference, Nantes, France. Vol. 1, 156–162.
Kuhlmann, Marco. 2010. In Dependency structures and lexicalized grammars. An algebraic approach, Vol. 6270 of Lecture notes in artificial intelligence, FoLLI publications on logic, language and information. Springer: Berlin.
Kuhlmann, Marco, and Giorgio Satta. 2009. Treebank grammar techniques for non-projective dependency parsing. In Proceedings of the 12th conference of the European chapter of the Association for Computational Linguistics (EACL’09), 478–486.
Lombardo, Vincenzo, and Leonardo Lesmo. 1996. An Earley-type recognizer for dependency grammar. In Proceedings of the 16th COLING conference, Copenhagen, Denmark, Vol. 2, 723–728.
Maruyama, Hiroshi. 1990. Structural disambiguation with constraint propagation. In Proceedings of the 28th ACL 1989 conference, Pittsburgh, Pennsylvania, 31–38.
Mohri, Mehryar. 2009. Weighted automata algorithms. In Handbook of weighted automata, eds. Manfred Droste, Werner Kuich, and Heiko Vogler, 213–254. Berlin: Springer.
Neuhaus, Peter, and Norbert Bröker. 1997. The complexity of recognition of linguistically adequate dependency grammars. In Proceedings of the 35th annual meeting of the Association for Computational Linguistics and the 8th conf. of the European chapter of the Association for Computational Linguistics, Madrid, Spain, 337–343.
Nivre, Joakim. 2006. Constraints on non-projective dependency parsing. In Proceedings of the 11th conference of the European chapter of the Association for Computational Linguistics (EACL 2006), 73–80.
Nivre, Joakim. 2008. Algorithms for deterministic incremental dependency parsing. Computational Linguistics 34: 513–553.
Oflazer, Kemal. 2003. Dependency parsing with an extended finite-state approach. Computational Linguistics 29: 515–544.
Plátek, Martin, Markéta Lopatková, and Karel Oliva. 2003. Restarting automata: motivations and applications. In Workshop ‘Petrinetze’ and 13. Theorietag ‘Formale Sprachen und Automaten’, ed. M. Holzer, 90–96. Institut für Informatik, Technische Universität München.
Roche, Emmanuel. 1997. Parsing with finite-state transducers. In Finite-state language processing, eds. Emmanuel Roche and Yves Schabes, 241–281. Cambridge: MIT Press. Chap. 8.
Tapanainen, Pasi. 1999. Parsing in two frameworks: finite-state and functional dependency grammar. Doctoral diss., University of Helsinki, Finland.
Tesnière, Lucien. 1959. Éléments de syntaxe structurale. Paris: Éditions Klincksieck.
Voutilainen, Atro. 1994. Designing a parsing grammar, Vol. 22 of Publications of the Department of General Linguistics, University of Helsinki. Helsinki: Yliopistopaino.
Yli-Jyrä, Anssi Mikael. 2003. Multiplanarity—a model for dependency structures in treebanks. In TLT 2003. Proceedings of the second workshop on treebanks and linguistic theories, eds. Joakim Nivre and Erhard Hinrichs, Vol. 9 of Mathematical modelling in physics, engineering and cognitive sciences, 189–200. Växjö: Växjö University Press.
Yli-Jyrä, Anssi. 2004. Axiomatization of restricted non-projective dependency trees through finite-state constraints that analyse crossing bracketings. In Proceedings of the COLING 2004 workshop of recent advances in dependency grammar, Geneva, Switzerland, 33–40.
Yli-Jyrä, Anssi. 2005. Approximating dependency grammars through intersection of star-free regular languages. International Journal of Foundations of Computer Science 16: 565–579.
Yli-Jyrä, Anssi. 2008. Transducers from parallel replace rules and modes with generalized lenient composition. In 6th international workshop, finite-state methods and natural language processing, FSMNLP-2007. Revised papers, 197–212. Potsdam: Potsdam University Press.
Yli-Jyrä, Anssi. 2011. An efficient constraint grammar parser based on inward deterministic automata. In Proceedings of the NODALIDA 2011 workshop constraint grammar applications, eds. Eckhard Bick, Kristin Hagen, Kaili Müürisep, and Trond Trosterud, Vol. 14 of NEALT proceedings series, 50–60.
Yli-Jyrä, Anssi, and Kimmo Koskenniemi. 2004. Compiling contextual restrictions on strings into finite-state automata. In Proceedings of the Eindhoven FASTAR days 2004, eds. Loek Cleophas and Bruce W. Watson, Vol. 04-40 of Computer science reports. Eindhoven: Technische Universiteit Eindhoven.
Yli-Jyrä, Anssi, and Matti Nykänen. 2004. A hierarchy of mildly context sensitive dependency grammars. In Proceedings of the 9th conference on formal grammar (FGNancy 2004), eds. Gerald Penn, Gerhard Jäger, Paola Monachesi, and Shuly Wintner, 151–165.
Acknowledgements
The research has been made possible by the Academy of Finland grant number 128536 “Open and Language Independent Automata-Based Resource Production Methods for Common Language Research Infrastructure”, and, more recently, by the FIN-CLARIN project steered by Krister Lindén. Kimmo Koskenniemi, Pasi Tapanainen, Atro Voutilainen and Lauri Carlson supported my first investigations into contractions in finite-state intersection parsing since 1995. More recently, my thinking has benefited from several related discussions with Carlos Gómez-Rodríguez, Jason Eisner, Joakim Nivre, Marco Kuhlmann, and John Hale. During the multi-year creative process, I felt several times need for heavenly empowerment. I look gratefully back to every inspired moment.
I am indebted to the prior anonymous reviewers of the CIAA 2011 and FSMNLP 2011 meetings, as well as Aarne Ranta, Wanjiku Nganga, Jussi Piitulainen, and Miikka Silfverberg for their valuable comments pointing out many areas for further study. The remaining imperfections in the text are mine, of course.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Yli-Jyrä, A. (2012). On Dependency Analysis via Contractions and Weighted FSTs. In: Santos, D., Lindén, K., Ng’ang’a, W. (eds) Shall We Play the Festschrift Game?. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30773-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-30773-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30772-0
Online ISBN: 978-3-642-30773-7
eBook Packages: Computer ScienceComputer Science (R0)