Skip to main content

Minimum Message Length

  • Reference work entry
  • First Online:
Encyclopedia of Machine Learning and Data Mining
  • 59 Accesses

Abstract

The Minimum Message Length (MML) Principle is an information-theoretic approach to induction, hypothesis testing, model selection, and statistical inference. MML, which provides a formal specification for the implementation of Occam’s Razor, asserts that the ‘best’ explanation of observed data is the shortest. MML was first published by Chris Wallace and David Boulton in 1968.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 699.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 949.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  • Allison L (2009) MML website. http://www.allisons.org/ll/MML/

  • Dowe DL, Gardner SB, Oppy G (2007) Bayes not bust!: why simplicity is no problem for Bayesians. Brit J Phil Sci 58:709–754

    Article  MathSciNet  MATH  Google Scholar 

  • Dowty JG (2013) SMML estimators for 1-dimensional continuous data. Comput J. doi:10.1093/comjnl/bxt145

    Google Scholar 

  • Dai H, Korb KB, Wallace CS, Wu X (1997) A study of causal discovery with weak links and small samples. In: Proceedings of fifteenth international joint conference on artificial intelligence. Morgan Kaufman, San Francisco, pp 1304–1309

    Google Scholar 

  • Edgoose T, Allison L (1999) MML Markov classification of sequential data. Stat Comput 9(4):269–278

    Article  Google Scholar 

  • Farr GE, Wallace CS (2002) The complexity of strict minimum message length inference. Comput J 45(3): 285–292

    Article  MATH  Google Scholar 

  • Grunwald P (2008) The minimum description length principle. MIT Press, Cambridge

    Google Scholar 

  • Honkela A, Valpola H (2004) Variational learning and bits-back coding: an information-theoretic view to Bayesian learning. IEEE Trans Neural Netw 15(4):800–810

    Article  Google Scholar 

  • Lanterman AD (2001) Schwarz, Wallace and Rissanen: intertwining themes in theories of model selection. Int Stat Rev 69(2):185–212

    Article  MATH  Google Scholar 

  • MML software: www.datamining.monash.edu.au/software, http://allisons.org/ll/Images/People/Wallace/FactorSnob/

  • Neil JR, Wallace CS, Korb KB, Learning Bayesian networks with restricted interactions, in Laskey and Prade. In: Proceedings of the fifteenth conference of uncertainty in artificial intelligence (UAI-99), Stockholm, pp 486–493

    Google Scholar 

  • O’Donnell R, Allison L, Korb K (2006) Learning hybrid Bayesian networks by MML. Lecture notes in computer science: AI 2006 – Advances in artificial intelligence, vol 4304. Springer, Berlin/New York, pp 192–203

    Google Scholar 

  • Wallace CS (1990) Classification by minimum-message length inference. In: Akl SG et al (eds) Advances in computing and information-ICCI 1990. No. 468 in lecture notes in computer science. Springer, Berlin

    Google Scholar 

  • Wallace CS (2005) Statistical & inductive inference by MML. Information sciences and statistics. Springer, New York

    Google Scholar 

  • Wallace CS, Boulton DM (1968) An information measure for classification. Comput J 11:185–194

    Article  MATH  Google Scholar 

  • Wallace CS, Boulton DM (1975) An information measure for single-link classification. Comput J 18(3):236–238

    Article  MATH  Google Scholar 

  • Wallace CS, Dowe DL (1999) Minimum message length and Kolmogorov complexity. Comput J 42(4):330–337

    Article  MATH  Google Scholar 

  • Wallace CS, Freeman PR (1987) Estimation and inference by compact coding. J. R. Stat. Soc. (Ser B) 49:240–252

    MathSciNet  MATH  Google Scholar 

  • Wallace CS, Patrick JD (1993) Coding decision trees. Mach Learn 11:7–22

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rohan A. Baxter .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media New York

About this entry

Cite this entry

Baxter, R.A. (2017). Minimum Message Length. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_547

Download citation

Publish with us

Policies and ethics