Skip to main content

Minimum Message Length Criterion for Second-Order Polynomial Model Discovery

  • Conference paper
  • First Online:
Knowledge Discovery and Data Mining. Current Issues and New Applications (PAKDD 2000)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1805))

Included in the following conference series:

Abstract

This paper proposes a method based on the Minimum Message Length (MML) Principle for the task of discovering polynomial models up to the second order. The method is compared with a number of other selection criteria in the ability to, in an automated manner, discover a model given the generated data. Of particular interest is the ability of the methods to discover (1) second-order independent variables, (2) independent variables with weak causal relationships with the target variable given a small sample size, and (3) independent variables with weak links to the target variable but strong links from other variables which are not directly linked with the target variable. A common non-backtracking search strategy has been developed and is used with all of the model selection criteria.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. H. Akaike. Information theory and an extension of the Maximum Likelihood principle. In B.N. Petrov and F. Csaki, editors, Proc. 2nd Int. Symp. Information Thy., pages 267–281, 1973.

    Google Scholar 

  2. R. Baxter and D. Dowe. Model selection in linear regression using the MML criterion. Technical Report 276, School of Computer Science and Software Engineering, Monash University, 1996.

    Google Scholar 

  3. H. Bozdogan. Model selection and akaike’s information criterion (AIC): the general theory and its analytical extensions. Psychometrika, 52(3):345–370, 1987.

    Article  MATH  MathSciNet  Google Scholar 

  4. J.H. Conway and N.J.A. Sloane. Sphere Packings, Lattices and Groups. Springer-Verlag, New York, 1988.

    MATH  Google Scholar 

  5. M. Ezekiel. Methods of Correlation Analysis. Wiley, New York, 1930.

    MATH  Google Scholar 

  6. C. Mallows. Some comments on Cp. Technometrics, 15:661–675, 1973.

    Article  MATH  Google Scholar 

  7. A.J. Miller. Subset Selection in Regression. Chapman and Hall, London, 1990.

    MATH  Google Scholar 

  8. J. Rissanen. Modeling by shortest data description. Automatica, 14:465–471, 1978.

    Article  MATH  Google Scholar 

  9. J. Rissanen. Stochastic complexity. Journal of the Royal Statistical Society B, 49(1):223–239, 1987.

    MATH  MathSciNet  Google Scholar 

  10. T. Ryan. Modern Regression Methods. John Wiley & Sons, New York, 1997.

    MATH  Google Scholar 

  11. G. Schwarz. Estimating the dimension of a model. Annals of Statistics, 6:461–464, 1978.

    MATH  MathSciNet  Google Scholar 

  12. V. Vapnik. The Nature of Statistical Learning Theory. Springer, New York, 1995.

    MATH  Google Scholar 

  13. C.S. Wallace. On the selection of the order of a polynomial model, unpublished technical report, Royal Holloway College, 1997.

    Google Scholar 

  14. C.S. Wallace and P.R. Freeman. Estimation and inference by compact coding. Journal of the Royal Statistical Society B, 49(1):240–252, 1987.

    MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rumantir, G.W. (2000). Minimum Message Length Criterion for Second-Order Polynomial Model Discovery. In: Terano, T., Liu, H., Chen, A.L.P. (eds) Knowledge Discovery and Data Mining. Current Issues and New Applications. PAKDD 2000. Lecture Notes in Computer Science(), vol 1805. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45571-X_7

Download citation

  • DOI: https://doi.org/10.1007/3-540-45571-X_7

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67382-8

  • Online ISBN: 978-3-540-45571-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics