Abstract
In this paper, we present a directed Markov random field model that integrates trigram models, structural language models (SLM) and probabilistic latent semantic analysis (PLSA) for the purpose of statistical language modeling. The SLM is essentially a generalization of shift-reduce probabilistic push-down automata thus more complex and powerful than probabilistic context free grammars (PCFGs). The added context-sensitiveness due to trigrams and PLSAs and violation of tree structure in the topology of the underlying random field model make the inference and parameter estimation problems plausibly intractable, however the analysis of the behavior of the lexical and semantic enhanced structural language model leads to a generalized inside-outside algorithm and thus to rigorous exact EM type re-estimation of the composite language model parameters.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abney, S., McAllester, D., Pereira, F.: Relating probabilistic grammars and automata. In: Proceedings of ACL, pp. 542–549 (1999)
Baker, J.: Trainable grammars for speech recognition. In: Proceedings of the 97th Meeting of the Acoustical Society of America, pp. 547–550 (1979)
Chelba, C.: Exploiting syntactic structure for natural language modeling. Ph.D. Dissertation. Johns Hopkins University (1999)
Chelba, C., Jelinek, F.: Structured language modeling. Computer Speech and Language 14(4), 283–332 (2000)
Chomsky, N.: Three models for the description of language. IRE Transactions on Information Theory 2(3), 113–124 (1956)
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood estimation from incomplete data via the EM algorithm. Journal of Royal Statistical Society 39, 1–38 (1977)
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42(1), 177–196 (2001)
Hopcroft, J., Ullman, J.: Introduction to Automata Theory, Languages and Computation. Addison-Wesley, Reading (1979)
Jelinek, F., Lafferty, J., Mercer, R.: Basic methods of probabilistic context-free grammars. In: Laface, P., De Mori, R. (eds.) Speech Recognition and Understanding: Recent Advances, Trends, and Applications, pp. 347–360. Springer, Heidelberg (1992)
Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Cambridge (1998)
Jelinek, F., Chelba, C.: Putting language into language modeling. In: Proceedings of the 6th EuroSpeech Communication and Technology, pp. 1–6 (1999)
Jelinek, F.: Stochastic analysis of structured language modeling. In: Johnson, M., Khudanpur, S., Ostendorf, M., Rosenfeld, R. (eds.) Mathematical Foundations of Speech and Language Processing, pp. 37–72. Springer, Heidelberg (2004)
Khudanpur, S., Wu, J.: Maximum entropy techniques for exploiting syntactic, semantic and collocational dependencies in language modeling. Computer Speech and Language 14(4), 355–372 (2000)
Kschischang, F., Frey, B., Loeliger, H.: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 47(2), 498–519 (2001)
Lafferty, J.: A derivation of the inside-outside algorithm from the EM algorithm. IBM Research Report 21636 (2000)
Lari, K., Young, S.: The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language 4, 35–56 (1990)
Lauritzen, S.: Graphical Models. Oxford Press, Oxford (1996)
Shannon, C.: A mathematical theory of communication. Bell System Technical Journal 27(2), 379–423 (1948)
Wang, S., Wang, S., Greiner, R., Schuurmans, D., Cheng, L.: Exploiting syntactic, semantic and lexical regularities in language modeling via directed Markov random fields. In: The 22nd International Conference on Machine Learning, pp. 953–960 (2005)
Younger, D.: Recognition and parsing of context free languages in time N 3. Information and Control 10, 198–208 (1967)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, S., Wang, S., Cheng, L., Greiner, R., Schuurmans, D. (2006). Stochastic Analysis of Lexical and Semantic Enhanced Structural Language Model. In: Sakakibara, Y., Kobayashi, S., Sato, K., Nishino, T., Tomita, E. (eds) Grammatical Inference: Algorithms and Applications. ICGI 2006. Lecture Notes in Computer Science(), vol 4201. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11872436_9
Download citation
DOI: https://doi.org/10.1007/11872436_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45264-5
Online ISBN: 978-3-540-45265-2
eBook Packages: Computer ScienceComputer Science (R0)