Synonyms
Definition
Hidden Markov models (HMMs) form a class of statistical models in which the system being modeled is assumed to be a Markov process with hidden states. From observed output sequences generated by the Markov process, both the output emission probabilities from the hidden states and the transition probabilities between the hidden states can be estimated by using dynamic programming methods. The estimated model parameters can then be used for various sequence analysis purposes.
Motivation and Background
The states of a regular Markov model, named after Russian mathematician Andrey Markov (1865–1922), are directly observable, hence its only parameters are the state transition probabilities. In many real-world cases, however, the states of the system that one wants to model are not directly observable. For instance, in speech recognition the audio is the observable stream, while the goal is to discover the phonemes (the categorical elements of speech) that emitted the...
Recommended Reading
Baum, L. E., Petrie, T., Soules, G., & Weiss, N. (1970). A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics,41(1), 164–171.
Burge, C., & Karlin, S. (1997). Prediction of complete gene structures in human genomic DNA. Journal of Molecular Biology,268, 78–94.
Charniak, E. (1993). Statistical language learning. Cambridge, MA: MIT Press.
Church, K. W. (1988). A stochastic parts program and noun phrase parser for unrestricted text. In Proceedings of the second conference on applied natural language processing (pp. 136–143). Austin, TX.
Dempster, A., Laird, N., & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39(1), 1–38.
Durbin, R., Eddy, S., Krogh, A., & Mitchison, G. (1998). Biological sequence analysis: Probabilistic models of proteins and nucleic acids. Cambridge, UK: Cambridge University Press.
Eddy, S. (2007). HMMER. http://hmmer.janelia.org/.
François, J.-M. (2006). JAHMM. http://www.run.montefiore.ulg.ac.be/~francois/software/jahmm/.
Freitag, D., & McCallum, A. (1999). Information extraction with HMM structures learned by stochastic optimization. In Proceedings of the national conference on artificial intelligence (pp. 584–589). Cambridge, MA: MIT Press.
Jelinek, F. (1998). Statistical methods for speech recognition. Cambridge, MA: MIT Press.
Kanungo, T. (1999). UMDHMM: Hidden Markov model toolkit. In A. Kornai (Ed.), Extended finite state models of language. Cambridge, UK: Cambridge University Press. URL: http://www.kanungo.us/software/software.html.
Kulp, D., Haussler, D., Reese, M. G., & Eeckman, F. H. (1996). A generalized hidden Markov model for the recognition of human genes in DNA. Proceedings of the International Conference on Intelligent Systems for Molecular Biology, 4, 134–142.
Manning, C., & Schütze, H. (1999). Foundations of statistical natural language processing. Cambridge, MA: MIT Press.
Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), 257–286.
Viterbi, A. J. (1967). Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, 13(2), 260–269.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this entry
Cite this entry
Bosch, A. (2011). Hidden Markov Models. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-30164-8_362
Download citation
DOI: https://doi.org/10.1007/978-0-387-30164-8_362
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-30768-8
Online ISBN: 978-0-387-30164-8
eBook Packages: Computer ScienceReference Module Computer Science and Engineering