Skip to main content

Descriptive Analysis of Gambling Data for Data Mining of Behavioral Patterns

  • Conference paper
  • First Online:
Proceedings of the Second International Conference on Innovations in Computing Research (ICR’23)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 721))

  • 305 Accesses

Abstract

The use of data analytics methods for behavioral analysis of gamblers has been of interest in the gambling field. Most of the research on this topic has been conducted using self-reported survey data due to the limited availability of quantitative data such as behavioral tracking data. To fill in this gap, we describe a dataset comprising financial payments records for modeling behavioral patterns of gamblers using quantifiable variables. This data has been obtained from a digital payments provider, which acts as an intermediary between customers’ banks and gambling merchants. In this paper, we provide a descriptive analysis of this data comprising its distribution with respect to transaction volume and amounts, outlier analysis, auto-correlation analysis, and stationarity analysis. From this analysis, we conclude that this data is right skewed with the largest number of transactions taking place after 2019. We also conclude that the data is non-stationary and does not exhibit any significant auto-regressive characteristics. Stationarity and seasonality for this data will need to be addressed for applying statistical time-series forecasting models. It is worth noting that this data is limited to customers in the USA and only includes details on money committed to gambling and not detailed betting behavior. It also does not take account other methods of payments available to customers and the possibility of customers having multiple accounts with the same payments provider. Additionally, since merchant IDs and customer IDs have been obfuscated, further analysis on merchants and specific customers could be impacted.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Al-Hashedi, K.G., Magalingam, P.: Financial fraud detection applying data mining techniques: a comprehensive review from 2009 to 2019. Comput. Sci. Rev. 40, 100402 (2021)

    Article  Google Scholar 

  2. Chatfield, C.: Time-Series Forecasting. Chapman and Hall/CRC (2000)

    Google Scholar 

  3. Chatfifield, C., Xing, H.: The analysis of time series: an introduction with r (2019)

    Google Scholar 

  4. Chen, N., Ribeiro, B., Chen, A.: Financial credit risk assessment: a recent review. Artif. Intell. Rev. 45, 1–23 (2016)

    Article  Google Scholar 

  5. Ghaharian, K., et al.: Applications of data science for responsible gambling: a scoping review. International Gambling Studies 0(0), 1–24 (2022). https://doi.org/10.1080/14459795.2022.2135753

  6. Haeusler, J.: Follow the money: using payment behaviour as predictor for future self-exclusion. Int. Gambl. Stud. 16(2), 246–262 (2016)

    Article  Google Scholar 

  7. Hyndman, R.J., Athanasopoulos, G.: Forecasting: principles and practice. OTexts (2018)

    Google Scholar 

  8. Kwiatkowski, D., Phillips, P.C., Schmidt, P., Shin, Y.: Testing the null hypothesis of stationarity against the alternative of a unit root: how sure are we that economic time series have a unit root? J. Econometr. 54(1–3), 159–178 (1992)

    Article  MATH  Google Scholar 

  9. Lim, B., Zohren, S.: Time-series forecasting with deep learning: a survey. Phil. Trans. R. Soc. A 379(2194), 20200209 (2021)

    Article  MathSciNet  Google Scholar 

  10. Muggleton, N., Parpart, P., Newall, P., Leake, D., Gathergood, J., Stewart, N.: The association between gambling and financial, social and health outcomes in big financial data. Nat. Hum. Behav. 5(3), 319–326 (2021)

    Article  Google Scholar 

  11. Mushtaq, R.: Augmented dickey fuller test (2011)

    Google Scholar 

  12. Oates, T., Firoiu, L., Cohen, P.R.: Clustering time series with hidden markov models and dynamic time warping. In: Proceedings of the IJCAI-99 workshop on neural, symbolic and reinforcement learning methods for sequence learning. vol. 17, p. 21. Citeseer (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Piyush Puranik .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Puranik, P., Taghva, K., Ghaharian, K. (2023). Descriptive Analysis of Gambling Data for Data Mining of Behavioral Patterns. In: Daimi, K., Al Sadoon, A. (eds) Proceedings of the Second International Conference on Innovations in Computing Research (ICR’23). Lecture Notes in Networks and Systems, vol 721. Springer, Cham. https://doi.org/10.1007/978-3-031-35308-6_4

Download citation

Publish with us

Policies and ethics