Abstract
In this paper, I describe SPATTER, a statistical parser based on decision-tree learning techniques which avoids the difficulties of grammar development simply by having no grammar. Instead, the parser is driven by statistical pattern recognizers, in the form of decision trees, trained on correctly parsed sentences. This approach to grammatical inference results in a parser which constructs a complete parse for every sentence and achieves accuracy rates far better than any previously published result.
This paper discusses work performed at the IBM Speech Recognition Group under Frederick Jelinek and at the BBN Speech Recognition Group under John Makhoul.
Preview
Unable to display preview. Download preview PDF.
References
L. R. Bahl, P. F. Brown, P. V. deSouza, and R. L. Mercer. 1989. A tree-based statistical language model for natural language speech recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 36, No. 7, pages 1001–1008.
J. K. Baker. 1975. Stochastic modeling for automatic speech understanding. Speech Recognition, pages 521–542.
L. E. Baum. 1972. An inequality and associated maximization technique in statistical estimation of probabilistic functions of markov processes. Inequalities, Vol. 3, pages 1–8.
E. Black and et al. 1991. A procedure for quantitatively comparing the syntactic coverage of english grammars. Proceedings of the February 1991 DARPA Speech and Natural Language Workshop, pages 306–311.
E. Black, R. Garside, and G. Leech. 1993. Statistically-driven computer grammars of english: the ibm/lancaster approach. Rodopi, Atlanta, Georgia.
E. Black, J. Lafferty, and S. Roukos. 1992. Development and evaluation of a broadcoverage probabilistic grammar of english-language computer manuals. Proceedings of the Association for Computational Linguistics, 1992.
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. 1984. Classification and Regression Trees. Wadsworth and Brooks, Pacific Grove, California.
P. F. Brown, V. Della Pietra, P. V. deSouza, J. C. Lai, and R. L. Mercer. 1992. “Class-based n-gram models of natural language.” Computational Linguistics, 18(4), pages 467–479.
G. Carroll and E. Charniak 1992. Learning probabilistic dependency grammars from labeled text. Working Notes, Fall Symposium Series, AAAI, pages 25–32.
F. Jelinek, J. Lafferty, D. M. Magerman, R. Mercer, A. Ratnaparkhi, and S. Roukos. 1994. Decision tree parsing using a hidden derivation model. Proceedings of the 1994 Human Language Technology Workshop, pages 272–277.
F. Jelinek, J. D. Lafferty, and R. L. Mercer. 1992. Basic methods of probabilistic context-free grammars. Speech Recognition and Understanding: Recent Advances, Trends, and Applications. Proceedings of the NATO Advanced Study Institute, pages 345–360.
J. Kupiec. 1991. A trellis-based algorithm for estimating the parameters of a hidden stochastic context-free grammar. Proceedings of the February 1991 DARPA Speech and Natural Language Workshop, pages 241–246, 1991.
D. M. Magerman. 1994. Natural Language Parsing as Statistical Pattern Recognition. Doctoral dissertation. Stanford University, Stanford, California.
D. M. Magerman. 1995. Statistical decision-tree models for parsing. Proceedings of the Association for Computational Linguistics, 1995.
D. M. Magerman and M. P. Marcus. 1990. Parsing a natural language using mutual information statistics. Proceedings of AAAI-90.
F. Pereira and Y. Schabes. 1992. Inside-outside reestimation from partially bracketed corpora. Proceedings of the February 1992 DARPA Speech and Natural Language Workshop, pages 122–127.
V. Pratt. 1973. A linguistics oriented programming language. Proceedings of the Third International Joint Conference on Artificial Intelligence.
Y. Schabes and R. Waters. 1993. Stochastic lexicalized context-free grammar. Proceedings of the August 1993 International Workshop on Parsing Technologies.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Magerman, D.M. (1996). Learning grammatical structure using statistical decision-trees. In: Miclet, L., de la Higuera, C. (eds) Grammatical Interference: Learning Syntax from Sentences. ICGI 1996. Lecture Notes in Computer Science, vol 1147. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0033339
Download citation
DOI: https://doi.org/10.1007/BFb0033339
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61778-5
Online ISBN: 978-3-540-70678-6
eBook Packages: Springer Book Archive