Data Mining Model Comparison

Giudici, Paolo

doi:10.1007/978-0-387-09823-4_32

Data Mining Model Comparison

Paolo Giudici³

Chapter
First Online: 01 January 2010

16k Accesses
1 Citations

Summary

The aim of this contribution is to illustrate the role of statistical models and, more generally, of statistics, in choosing a Data Mining model. After a preliminary introduction on the distinction between Data Mining and statistics, we will focus on the issue of how to choose a Data Mining methodology. This well illustrates how statistical thinking can bring real added value to a Data Mining analysis, as otherwise it becomes rather difficult to make a reasoned choice. In the third part of the paper we will present, by means of a case study in credit risk management, how Data Mining and statistics can profitably interact.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 349.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Akaike, H. A new look at statistical model identification. IEEE Transactions on Automatic Control 1974; 19: 716-723
Article MATH MathSciNet Google Scholar
Bernardo, J.M. and Smith, A.F.M., Bayesian Theory. New York: Wiley, 1994.
Book MATH Google Scholar
Bickel, P.J. and Doksum, K.A., Mathematical Statistics. New Jersey: Prentice and Hall, 1977.
MATH Google Scholar
Castelo, R. and Giudici, P., Improving Markov chain model search for Data Mining. Machine Learning,50:127-158,2003.
Article MATH Google Scholar
Giudici, P., Applied Data Mining. London: Wiley, 2003.
MATH Google Scholar
Giudici P., Castelo R.. Association models for web mining, Data mining and knowledge discovery, 5, 183-196, 2001.
Article MATH Google Scholar
Hand, D.J.,Mannila, H. and Smyth, P., Principles of Data Mining. New York: MIT press, 2001.
Google Scholar
Hand, D. Construction and assessment of classification rules. London: Wiley, 1997.
MATH Google Scholar
Hastie, T., Tibshirani, R., Friedman, J. The elements of statistical learning: Data Mining, inference and prediction. New York: Springer-Verlag, 2001.
MATH Google Scholar
Mood, A.M., Graybill, F.A. and Boes, D.C. Introduction to the theory of Statistics. Tokyo: McGraw Hill, 1991.
Google Scholar
Rokach, L., Averbuch, M., and Maimon, O., Information retrieval system for medical narrative reports. Lecture notes in artificial intelligence, 3055. pp. 217-228, Springer-Verlag (2004).
Google Scholar
Schwarz, G. Estimating the dimension of a model. Annals of Statistics 1978; 62: 461-464.
Article Google Scholar
Zucchini,W. An Introduction to Model Selection. Journal of Mathematical Psychology 2000; 44: 41-61
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Economics, University of Pavia, Pavia, Italy
Paolo Giudici

Authors

Paolo Giudici
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

, Dept. Industrial Engineering, Tel Aviv University, Ramat Aviv, 69978, Israel
Oded Maimon
, Dept. Information Systems Engineering, Ben-Gurion University of the Negev, Beer-Sheva, 84105, Israel
Lior Rokach

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Giudici, P. (2009). Data Mining Model Comparison. In: Maimon, O., Rokach, L. (eds) Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09823-4_32

Download citation

DOI: https://doi.org/10.1007/978-0-387-09823-4_32
Published: 07 July 2010
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-09822-7
Online ISBN: 978-0-387-09823-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics