Skip to main content

Improving the Probabilistic Modeling of Market Basket Data

  • Conference paper
Advances in Data Analysis
  • 3753 Accesses

Abstract

Current approaches to market basket simulation neglect the fact that empty transactions are typically not recorded and therefore should not occur in simulated data. This paper suggest how the simulation framework without associations can be extended to avoid empty transactions and explores the possible consequences for several measures of interestingness used in association rule filtering.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • BAHADUR, R.R. (1961): A Representation of the Joint Distribution of Responses to n Dichotomous Items. In: H. Solomon (Ed.): Studies in Item Analysis and Prediction. Standford Mathematical Studies in the Social Sciences VI. Stanford University Press, Stanford.

    Google Scholar 

  • EMERICH, L.J. and PIEDMONT, M.R. (1991): A Method for Generating High-dimensional Multivariate Binary Deviates. Statistical Computing, 45, 302–304.

    Google Scholar 

  • HAHSLER, M., HORNIK, K. and REUTTERER, T. (2006): Implications of Probabilistic Data Modelling for Mining Association Rules. In: M. Spiliopoulou, R. Kruse, C. Borgelt, A. Nürnberger and W. Gaul (Eds.): From Data and Information Analysis to Knowledge Engineering. Springer, Berlin, 598–605.

    Chapter  Google Scholar 

  • HRUSCHKA, H. (1990): Bestimmung der Kaufverbundenheit mit Hilfe eines probabilistischen Messmodells. zfbf, 418–434.

    Google Scholar 

  • LEE, A.J. (1997): Some Simple Methods for Generating Correlated Categorical Deviates. Computational Statistics & Data Analysis, 25, 133–148.

    Article  Google Scholar 

  • LEE, A.J. (1993): Generating Random Binary Deviates Having Fixed Marginal Distributions and Specified Degrees of Association. The American Statistician, 47,3, 209–215.

    Google Scholar 

  • LEISCH, F., WEINGESSEL, A. and HORNIK, K. (1998): On the Generation of Correlated Artificial Binary Data. Technical Report 13, SFB Working Paper Series.

    Google Scholar 

  • OMAN, S.D. and ZUCKER, D.M. (2001): Modelling and Generating Correlated Binary Variables. Biometrica, 88,1, 287–290.

    Article  MathSciNet  MATH  Google Scholar 

  • ORASCH, M., LEISCH, F. and WEINGESSEL, A. (1998): On Specifying Correlation Matrices for Binary Data. Technical Report 53, SFB Working Paper Series.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Buchta, C. (2007). Improving the Probabilistic Modeling of Market Basket Data. In: Decker, R., Lenz, H.J. (eds) Advances in Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70981-7_47

Download citation

Publish with us

Policies and ethics