Abstract
Today, the high competitiveness in retail businesses requires them to seek new strategies to ensure their survival. To this end, organizations have understood that the data located in their transactional databases can be used as raw material to boost business growth, if they can be exploited properly. The research’s main objective is to apply Data Mining techniques for the discovery of association rules from purely commercial transactional data, taking as a study period 10-year in a household appliances and furniture retail entity. The selection’s phase and preparation data are described as well as its cost in man/hours. In the modeling phase, the Apriori and Eclat algorithms implemented in the arules package of the R tool were executed, where both the resulting associations and execution time were compared. The results show relevant patterns in the buying behavior of customers such as those that relate items and accessories’ prices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Douglas, H.: Retail—origin and meaning of retail by online etymology dictionary. https://goo.gl/zzwvu2. Accessed 25 May 2018
Giménez, G.G.: González giménez y cia. https://goo.gl/MY3oVv. Accessed 05 July 2018
Jiawei, H., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann, San Francisco (2002)
Kim, J., Ale, J.: Descubrimiento incremental de las reglas de asociación temporales. In: X Congreso Argentino de Ciencias de la Computación (2004)
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools with Java Implementations. Morgan Kaufmann, San Francisco (2000)
Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th International Joint Conference on Artificial Intelligence (IJCAI 1993), pp. 1022–1027 (1993)
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22, 207–216 (1993)
Heaton, J.: Comparing dataset characteristics that favor the Apriori, Eclat or FP-Growth frequent itemset mining algorithms. SoutheastCon, pp. 1–7 (2016)
Schmidt-Thieme, L.: Algorithmic features of Eclat. In: FIMI (2004)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference Very Large Data Bases, VLDB, pp. 487–499 (1994)
Zaki, M., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: Third International Conference on Knowledge Discovery and Data Mining, pp. 283–286 (1997)
Ultima Hora, P.: González giménez expande sus productos y servicios (2008). https://goo.gl/LcmVvn. Accessed 17 June 2018
Martino, E.: Law num 1352/88. https://bit.ly/2ucWM1u. Accessed 18 June 2018
SET: Lista de pequenos contribuyentes. https://goo.gl/Fpqny5. Accessed 17 July 2018
Sosa-Cabrera, G., García-Torres, M., Gómez, S., Schaerer, C., Divina, F.: Understanding a version of multivariate symmetric uncertainty to assist in feature selection. In: Conference of Computational Interdisciplinary Science (2016)
Moine, J., Gordillo, S., Haedo, A.: Análisis comparativo de metodologías para la gestión de proyectos de minería de datos. In: Congreso Argentino de Ciencias de la Computación (2011)
Báez, J., et al.: Descubriendo reglas de asociación en bases de datos del sector retail usando R. In: Libro de Actas XXIV Congreso Argentino de Ciencias de la Computación, CACIC 2018, pp. 432–441. Red de Universidades con Carreras en Informática, RedUNCI. Facultad de Ciencias Exactas, Universidad Nacional del Centro de la Provincia de Buenos Aires (2018)
Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2012)
Hahsler, M., Buchta, C., Gruen, B., Hornik, K., Johnson, I., Borgelt, C.: arules: mining association rules and frequent itemsets. https://cran.r-project.org/package=arules. Accessed 03 May 2018
BBC: Black friday: por qué el viernes negro se llama así y otras 4 curiosidades sobre el famoso día de compras (2018). https://bbc.in/2AbApMP. Accessed 17 Jan 2019
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Acuña, J.M.B., Cabañas, C.A.P., Sosa-Cabrera, G., García-Díaz, M.E. (2019). Discovering Association Rules Using R. A Case Study on Retail’s Database. In: Pesado, P., Aciti, C. (eds) Computer Science – CACIC 2018. CACIC 2018. Communications in Computer and Information Science, vol 995. Springer, Cham. https://doi.org/10.1007/978-3-030-20787-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-20787-8_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-20786-1
Online ISBN: 978-3-030-20787-8
eBook Packages: Computer ScienceComputer Science (R0)