An Overview of Probabilistic Tree Transducers for Natural Language Processing

Knight, Kevin; Graehl, Jonathan

doi:10.1007/978-3-540-30586-6_1

An Overview of Probabilistic Tree Transducers for Natural Language Processing

Kevin Knight¹⁷ &
Jonathan Graehl¹⁷

Conference paper

2409 Accesses
30 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3406))

Abstract

Probabilistic finite-state string transducers (FSTs) are extremely popular in natural language processing, due to powerful generic methods for applying, composing, and learning them. Unfortunately, FSTs are not a good fit for much of the current work on probabilistic modeling for machine translation, summarization, paraphrasing, and language modeling. These methods operate directly on trees, rather than strings. We show that tree acceptors and tree transducers subsume most of this work, and we discuss algorithms for realizing the same benefits found in probabilistic string transduction.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Knight, K., Graehl, J.: Machine transliteration. Computational Linguistics 24 (1998)
Google Scholar
Mohri, M., Pereira, F., Riley, M.: The design principles of a weighted finite-state transducer library. Theor. Comput. Sci. 231 (2000)
Google Scholar
Kaplan, R., Kay, M.: Regular models of phonological rule systems. Computational Linguistics 20 (1994)
Google Scholar
Karttunen, L., Gaal, T., Kempe, A.: Xerox finite-state tool. Technical report, Xerox Research Centre Europe (1997)
Google Scholar
van Noord, G., Gerdemann, D.: An extendible regular expression compiler for finite-state approaches in natural language processing. In: Boldt, O., Jürgensen, H. (eds.) WIA 1999. LNCS, vol. 2214, p. 122. Springer, Heidelberg (2001)
Chapter Google Scholar
Kanthak, S., Ney, H.: Fsa: An efficient and flexible C++ toolkit for finite state automata using on-demand computation. In: Proc. ACL (2004)
Google Scholar
Graehl, J.: Carmel finite-state toolkit (1997), http://www.isi.edu/licensed-sw/carmel/
Knight, K., Al-Onaizan, Y.: Translation with finite-state devices. In: Farwell, D., Gerber, L., Hovy, E. (eds.) AMTA 1998. LNCS (LNAI), vol. 1529, pp. 421–437. Springer, Heidelberg (1998)
Chapter Google Scholar
Brown, P., Della Pietra, S., Della Pietra, V., Mercer, R.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19, 263–311 (1993)
Google Scholar
Kumar, S., Byrne, W.: A weighted finite state transducer implementation of the alignment template model for statistical machine translation. In: Proc. NAACL (2003)
Google Scholar
Och, F., Tillmann, C., Ney, H.: Improved alignment models for statistical machine translation. In: Proc. ACL (1999)
Google Scholar
Yamada, K., Knight, K.: A syntax-based statistical translation model. In: Proc. ACL, pp. 523–530 (2001)
Google Scholar
Wu, D.: Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics 23, 377–404 (1997)
Google Scholar
Alshawi, H., Bangalore, S., Douglas, S.: Learning dependency translation models as collections of finite state head transducers. Computational Linguistics 26, 45–60 (2000)
Article MathSciNet Google Scholar
Gildea, D.: Loosely tree-based alignment for machine translation. In: Proc. ACL, Sapporo, Japan (2003)
Google Scholar
Eisner, J.: Learning non-isomorphic tree mappings for machine translation. In: Proc. ACL, companion volume (2003)
Google Scholar
Melamed, I.D.: Multitext grammars and synchronous parsers. In: Proc. NAACL (2003)
Google Scholar
Knight, K., Marcu, D.: Summarization beyond sentence extraction: A probabilistic approach to sentence compression. Artificial Intelligence 139 (2002)
Google Scholar
Pang, B., Knight, K., Marcu, D.: Syntax-based alignment of multiple translations extracting paraphrases and generating new sentences. In: Proc. NAACL (2003)
Google Scholar
Langkilde, I., Knight, K.: Generation that exploits corpus-based statistical knowledge. In: Proc. ACL (1998)
Google Scholar
Bangalore, S., Rambow, O.: Exploiting a probabilistic hierarchical model for generation. In: International Conference on Computational Linguistics (COLING 2000), Saarbrucken, Germany (2000)
Google Scholar
Corston-Oliver, S., Gamon, M., Ringger, E.K., Moore, R.: An overview of Amalgam: A machine-learned generation module. In: Proceedings of the International Natural Language Generation Conference, New York, USA, pp. 33–40 (2002)
Google Scholar
Echihabi, A., Marcu, D.: A noisy-channel approach to question answering. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, Japan (2003)
Google Scholar
Charniak, E.: Immediate-head parsing for language models. In: Proc. ACL (2001)
Google Scholar
Rounds, W.C.: Mappings and grammars on trees. Mathematical Systems Theory 4, 257–287 (1970)
Article MATH MathSciNet Google Scholar
Thatcher, J.W.: Generalized² sequential machine maps. J. Comput. System Sci. 4, 339–367 (1970)
Article MATH MathSciNet Google Scholar
Graehl, J., Knight, K.: Training tree transducers. In: Proc. NAACL (2004)
Google Scholar
Gécseg, F., Steinby, M.: Tree Automata. Akadémiai Kiadó, Budapest (1984)
MATH Google Scholar
Comon, H., Dauchet, M., Gilleron, R., Jacquemard, F., Lugiez, D., Tison, S., Tommasi, M.: Tree automata techniques and applications (1997), Available on www.grappa.univ-lille3.fr/tata (release October 1, 2002)
Galley, M., Hopkins, M., Knight, K., Marcu, D.: What’s in a translation rule? In: NAACL, Boston, MA (2004)
Google Scholar
Doner, J.: Tree acceptors and some of their applications. Journal of Computer and System Sciences 4, 406–451 (1970)
Article MATH MathSciNet Google Scholar
Hopcroft, J., Ullman, J.: Introduction to Automata Theory, Languages, and Computation. Addison-Wesley Series in Computer Science. Addison-Wesley, London (1979)
MATH Google Scholar
Johnson, M.: PCFG models of linguistic tree representations. Computational Linguistics 24, 613–632 (1998)
Google Scholar
Collins, M.: Three generative, lexicalised models for statistical parsing. In: Proc. ACL (1997)
Google Scholar
Thatcher, J.: Characterizing derivation trees of context-free grammars through a generalization of finite automata theory. J. Comput. Syst. Sci. 1, 317–322 (1967)
Article MATH MathSciNet Google Scholar
Yamasaki, K., Sodeshima, Y.: Fundamental properties of pushdown tree transducers (PDTT) — a top-down case. IEICE Trans. Inf. and Syst. E76-D (1993)
Google Scholar
Engelfriet, J.: Top-down tree transducers with regular look-ahead. Math. Systems Theory 10, 289–303 (1977)
Article MATH MathSciNet Google Scholar
Engelfriet, J.: Bottom-up and top-down tree transformations — a comparison. Math. Systems Theory 9, 198–231 (1975)
Article MATH MathSciNet Google Scholar
Baum, L.E., Eagon, J.A.: An inequality with application to statistical estimation for probabilistic functions of Markov processes and to a model for ecology. Bulletin of the American Mathematical Society 73, 360–363 (1967)
Article MATH MathSciNet Google Scholar
Aho, A.V., Ullman, J.D.: Translations of a context-free grammar. Information and Control 19, 439–475 (1971)
Article MathSciNet Google Scholar
van Noord, G.: The intersection of finite state automata and definite clause grammars. In: Proc. ACL (1995)
Google Scholar
Dijkstra, E.W.: A note on two problems in connexion with graphs. Numerische Mathematik 1, 269–271 (1959)
Article MATH MathSciNet Google Scholar
Eppstein, D.: Finding the k shortest paths. SIAM Journal on Computing 28, 652–673 (1999)
Article MathSciNet Google Scholar
Mohri, M., Riley, M.: An efficient algorithm for the n-best-strings problem. In: Proc. ICSLP (2002)
Google Scholar
Langkilde, I.: Forest-based statistical sentence generation. In: Proc. NAACL (2000)
Google Scholar
Nederhof, M.J., Satta, G.: Parsing non-recursive CFGs. In: Proc. ACL (2002)
Google Scholar
Knuth, D.: A generalization of Dijkstra’s algorithm. Info. Proc. Letters 6 (1977)
Google Scholar
Klein, D., Manning, C.: Parsing and hypergraphs. In: International Workshop on Parsing Technologies (2001)
Google Scholar
Nederhof, M.J.: Weighted deductive parsing and Knuth’s algorithm. Computational Linguistics 29 (2003)
Google Scholar
Shieber, S.M., Schabes, Y.: Synchronous tree-adjoining grammars. In: Proceedings of the 13th International Conference on Computational Linguistics, Helsinki, Finland, vol. 3, pp. 253–258 (1990)
Google Scholar

Download references

Author information

Authors and Affiliations

Information Sciences Institute (ISI) and Computer Science Department, University of Southern California,
Kevin Knight & Jonathan Graehl

Authors

Kevin Knight
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Graehl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Knight, K., Graehl, J. (2005). An Overview of Probabilistic Tree Transducers for Natural Language Processing. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-30586-6_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24523-0
Online ISBN: 978-3-540-30586-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics