Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1040))

Included in the following conference series:

  • 192 Accesses

Abstract

We describe a language model employing a new headed-disjuncts formulation of Lafferty et al.'s (1992) probabilistic link grammar, together with (1) an EM training method for estimating the probabilities, and (2) a procedure for learning some simple lexicalized grammar structures. The model in its simplest form is a generalization of n-gram models, but in its general form possesses context-free expressiveness. Unlike the original experiments on probabilistic link grammars, we assume that no hand-coded grammar is initially available (as with n-gram models). We employ untyped links to concentrate the learning on lexical dependencies, and our formulation uses the lexical identities of heads to influence the structure of the parse graph. After learning, the language model consists of grammatical rules in the form of a set of simple disjuncts for each word, plus several sets of probability parameters. The formulation extends cleanly toward learning more powerful context-free grammars. Several issues relating to generalization bias, linguistic constraints, and parameter smoothing are considered. Preliminary experimental results on small artificial corpora are supportive of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Baker, James K. 1979. Trainable grammars for speech recognition. In Speech Communication Papers for the 97th Meeting of the Acoustic Society of America, ed. by D. H. Klatt & J. J. Wolf, 547–550.

    Google Scholar 

  • Baum, Leonard E. 1972. An inequality and associated maximization technique in Statistical estimation for probabilistic functions of Markov processes. Inequalities, 3(1):1–8.

    Google Scholar 

  • Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra, Jennifer C. Lai, & Robert L. Mercer. 1992. An estimate of an upper bound for the entropy of English. Computational Linguistics, 18(1):31–40.

    Google Scholar 

  • Carroll, Glenn & Eugene Charniak. 1992. Learning probabilistic dependency grammars from labelled text. In AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 25–32, Cambridge, MA. Proceedings to appear as AAAI technical report.

    Google Scholar 

  • Dempster, A. P., N. M. Laird, & D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B, 39:1–38.

    Google Scholar 

  • Jelinek, Frederick, John D. Lafferty, & Robert L. Mercer. 1992. Basic methods of probabilistic context-free grammars. In Speech Recognition and Understanding: Recent Advances, Trends, and Applications, Nato ASI Series, Vol. F75, 345–360. Springer Verlag.

    Google Scholar 

  • Lafferty, John D., Daniel Sleator, & Davy Temperley. 1992. Grammatical trigrams: A probabilistic model of link grammar. In AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 89–97, Cambridge, MA. Proceedings to appear as AAAI technical report.

    Google Scholar 

  • Lari, K. & S. J. Young. 1990. The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language, 4:35–56.

    Article  Google Scholar 

  • Sleator, Daniel & Davy Temperley. 1991. Parsing English with a link grammar. Technical Report CMU-CS-91-196, Dept. of Computer Science, Carnegie Mellon University.

    Google Scholar 

  • Starosta, Stanley. 1991. Dependency grammar and monostratal transfer. In 2nd Japan-Australia Joint Symposium on Natural Language Processing.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Stefan Wermter Ellen Riloff Gabriele Scheler

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fong, E.Wm., Wu, D. (1996). Learning restricted probabilistic link grammars. In: Wermter, S., Riloff, E., Scheler, G. (eds) Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing. IJCAI 1995. Lecture Notes in Computer Science, vol 1040. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60925-3_46

Download citation

  • DOI: https://doi.org/10.1007/3-540-60925-3_46

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-60925-4

  • Online ISBN: 978-3-540-49738-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics