Abstract
A novel theoretical framework for describing stochastic grammars is proposed based on a small set of basic random variables that generate tree structures and relate them to surface strings. A number of prominent statistical language models are formulated as stochastic processes over these basic random variables.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
David Magerman. Statistical decision-tree models for parsing. Procs. 33rd Annual Meeting of the Association for Computational Linguistics, pages 276–283, 1995.
Hiyan Alshawi. Head automata and bilingual tiling: Translation with minimal representations. Procs. 34th Annual Meeting of the Association for Computational Linguistics, pages 167–176, 1996.
Michael Collins. Three generative, lexicalized models for statistical parsing. Procs. 35th Annual Meeting of the Association for Computational Linguistics, pages 16–23, 1997.
T. L. Booth and R. A. Thompson. Applying probability measures to abstract languages. IEEE Transactions on Computers, C-22(5), pages 442–450, 1973.
Ted Briscoe and John Carroll. Generalized probabilistic lr parsing of natural language (corpora) with unification-based grammars. Computational Linguistics, 19(1):25–59, 1993.
Ezra Black, Fred Jelinek, John Lafferty, David Magerman, Robert Mercer, and Salim Roukos. Towards history-based grammars: Using richer models for probabilistic parsing. Procs. 28th Annual Meeting of the Association for Computational Linguistics, pages 31–37, 1993.
Rens Bod. Enriching Linguistics with Statistics: Performance Models of Natural Language. ILLC Dissertation Series 1995-14, Amsterdam, 1995.
Yves Schabes. Stochastic lexicalized tree-adjoining grammars. Proc. 14th International Conference on Computational Linguistics, pages 426–432, 1992.
Khalil Sima’an. Computational complexity of probabilistic disambiguations by means of tree-grammars. Procs. 16th International Conference on Computational Linguistics, page at the very end, 1996.
Saul Gorn. Processors for infinite codes of shannon-fano type. Symp. Math. Theory of Automata, pages--–--, 1962.
Lucien Tesniére. Éléments de Syntaxe Structurale. Libraire C. Klincksieck, Paris, 1959.
Jacques Brel. Ne me quitte pas. La Valse á Mille Temps (PHI 6325.205), 1959.
Gerald Gazdar, Ewan Klein, Geoffrey K. Pullum, and Ivan A. Sag. Generalized Phrase Structure Grammar. Basil Blackwell Publishing, Oxford, England, 1985. Also published by Harvard University Press, Cambridge, MA.
Zhiyi Chi. Statistical properties of probabilistic context-free grammars. Computational Linguistics, 25(1):131–160, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Samuelsson, C. (2000). A Theory of Stochastic Grammars. In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_9
Download citation
DOI: https://doi.org/10.1007/3-540-45154-4_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67605-8
Online ISBN: 978-3-540-45154-9
eBook Packages: Springer Book Archive