Skip to main content

Discovering Association Rules Using R. A Case Study on Retail’s Database

  • Conference paper
  • First Online:
Computer Science – CACIC 2018 (CACIC 2018)

Abstract

Today, the high competitiveness in retail businesses requires them to seek new strategies to ensure their survival. To this end, organizations have understood that the data located in their transactional databases can be used as raw material to boost business growth, if they can be exploited properly. The research’s main objective is to apply Data Mining techniques for the discovery of association rules from purely commercial transactional data, taking as a study period 10-year in a household appliances and furniture retail entity. The selection’s phase and preparation data are described as well as its cost in man/hours. In the modeling phase, the Apriori and Eclat algorithms implemented in the arules package of the R tool were executed, where both the resulting associations and execution time were compared. The results show relevant patterns in the buying behavior of customers such as those that relate items and accessories’ prices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Douglas, H.: Retail—origin and meaning of retail by online etymology dictionary. https://goo.gl/zzwvu2. Accessed 25 May 2018

  2. Giménez, G.G.: González giménez y cia. https://goo.gl/MY3oVv. Accessed 05 July 2018

  3. Jiawei, H., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann, San Francisco (2002)

    MATH  Google Scholar 

  4. Kim, J., Ale, J.: Descubrimiento incremental de las reglas de asociación temporales. In: X Congreso Argentino de Ciencias de la Computación (2004)

    Google Scholar 

  5. Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools with Java Implementations. Morgan Kaufmann, San Francisco (2000)

    Google Scholar 

  6. Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th International Joint Conference on Artificial Intelligence (IJCAI 1993), pp. 1022–1027 (1993)

    Google Scholar 

  7. Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  8. Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22, 207–216 (1993)

    Article  Google Scholar 

  9. Heaton, J.: Comparing dataset characteristics that favor the Apriori, Eclat or FP-Growth frequent itemset mining algorithms. SoutheastCon, pp. 1–7 (2016)

    Google Scholar 

  10. Schmidt-Thieme, L.: Algorithmic features of Eclat. In: FIMI (2004)

    Google Scholar 

  11. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference Very Large Data Bases, VLDB, pp. 487–499 (1994)

    Google Scholar 

  12. Zaki, M., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: Third International Conference on Knowledge Discovery and Data Mining, pp. 283–286 (1997)

    Google Scholar 

  13. Ultima Hora, P.: González giménez expande sus productos y servicios (2008). https://goo.gl/LcmVvn. Accessed 17 June 2018

  14. Martino, E.: Law num 1352/88. https://bit.ly/2ucWM1u. Accessed 18 June 2018

  15. SET: Lista de pequenos contribuyentes. https://goo.gl/Fpqny5. Accessed 17 July 2018

  16. Sosa-Cabrera, G., García-Torres, M., Gómez, S., Schaerer, C., Divina, F.: Understanding a version of multivariate symmetric uncertainty to assist in feature selection. In: Conference of Computational Interdisciplinary Science (2016)

    Google Scholar 

  17. Moine, J., Gordillo, S., Haedo, A.: Análisis comparativo de metodologías para la gestión de proyectos de minería de datos. In: Congreso Argentino de Ciencias de la Computación (2011)

    Google Scholar 

  18. Báez, J., et al.: Descubriendo reglas de asociación en bases de datos del sector retail usando R. In: Libro de Actas XXIV Congreso Argentino de Ciencias de la Computación, CACIC 2018, pp. 432–441. Red de Universidades con Carreras en Informática, RedUNCI. Facultad de Ciencias Exactas, Universidad Nacional del Centro de la Provincia de Buenos Aires (2018)

    Google Scholar 

  19. Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2012)

    MATH  Google Scholar 

  20. Hahsler, M., Buchta, C., Gruen, B., Hornik, K., Johnson, I., Borgelt, C.: arules: mining association rules and frequent itemsets. https://cran.r-project.org/package=arules. Accessed 03 May 2018

  21. BBC: Black friday: por qué el viernes negro se llama así y otras 4 curiosidades sobre el famoso día de compras (2018). https://bbc.in/2AbApMP. Accessed 17 Jan 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gustavo Sosa-Cabrera .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Acuña, J.M.B., Cabañas, C.A.P., Sosa-Cabrera, G., García-Díaz, M.E. (2019). Discovering Association Rules Using R. A Case Study on Retail’s Database. In: Pesado, P., Aciti, C. (eds) Computer Science – CACIC 2018. CACIC 2018. Communications in Computer and Information Science, vol 995. Springer, Cham. https://doi.org/10.1007/978-3-030-20787-8_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-20787-8_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-20786-1

  • Online ISBN: 978-3-030-20787-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics