ABSTRACT
In data mining, association rule mining has been proven to be a valuable strategy. The Apriori algorithm, proposed in 1993, is one of the most classic association rule mining algorithms and has been widely used in various domains to date. However, the algorithm itself requires multiple scans of the dataset, making the efficiency improvement of the algorithm an important topic. In this study, focusing on the association rule mining problem in shopping basket datasets, we address the efficiency and accuracy issues of the traditional Apriori algorithm in this domain and propose a novel improved Apriori algorithm. This algorithm enhances the accuracy and time efficiency of rule mining by introducing deletion and identification modules that exploit the characteristics of shopping basket datasets. Through mathematical reasoning and experimental validation, it is demonstrated that the improved Apriori algorithm achieves higher accuracy and time efficiency, thus proving its superior
- Baralis, Elena, Luca Cagliero, Tania Cerquitelli, Silvia Chiusano, Paolo Garza, Luigi Grimaudo, and Fabio Pulvirenti. "Nemico: Mining network data through cloud-based data mining techniques." In 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing, pp. 503-504. IEEE, 2014.Google Scholar
- Asif, Muhammad, and Jamil Ahmed. "Analysis of effectiveness of apriori and frequent pattern tree algorithm in software engineering data mining." In 2015 6th International Conference on Intelligent Systems, Modelling and Simulation, pp. 28-33. IEEE, 2015.Google Scholar
- Edastama, Primasatria, Ankur Singh Bist, and Ari Prambudi. "Implementation of data mining on glasses sales using the apriori algorithm." International Journal of Cyber and IT Service Management 1, no. 2, 2021: 159-172.Google ScholarCross Ref
- Chen, Yingchun, Yuchen Dai, Jingliang Xue, and Fang Dong. "Blind identification of network protocols based on improved Apriori algorithm." In IOP Conference Series: Materials Science and Engineering, vol. 740, no. 1, p. 012040. IOP Publishing, 2020.Google Scholar
- Wang, Hongqin. "Correlation analysis of College students’ achievement based on improved Apriori algorithm." In Journal of Physics: Conference Series, vol. 1848, no. 1, p. 012020. IOP Publishing, 2021.Google ScholarCross Ref
- Funcion, Devine Grace Doble. "Apriori algorithm application on the prevalence of computer malware." Indian Journal of Science and Technology 12, no. 17, 2019: 1-6.Google ScholarCross Ref
- Jia, Yubo, Guanghu Xia, Hongdan Fan, Qian Zhang, and Xu Li. "An improved apriori algorithm based on association analysis." In 2012 Third International Conference on Networking and Distributed Computing, pp. 208-211. IEEE, 2012.Google Scholar
- Wang, Xiaoli, Kui Su, and Lirong Su. "Research on Improved Apriori Algorithm Based on Data Mining in Electronic Cases." International Journal of Healthcare Information Systems and Informatics (IJHISI) 14, no. 3, 2019: 16-28.Google ScholarCross Ref
- Wang, C., and X. Zheng. "Application of improved time series Apriori algorithm by frequent itemsets in association rule data mining based on temporal constraint. Evol. Intell. 13 (1), 39–49, 2019." 2020.Google ScholarCross Ref
- Shoujian, Yu, and Zhou Yiyang. "Aprefixed-itemset-based improvement for apriori algorithm." In College Of Computer Science And Technology. Donghua University, 2016.Google Scholar
- Zhu, Xiaoyun, and Shuping Luo. "The influence of computer network technology on national income distribution under the background of social economy." Computer Communications 177, 2021: 166-175.Google ScholarDigital Library
- Sun, Li-na. "An improved apriori algorithm based on support weight matrix for data mining in transaction database." Journal of Ambient Intelligence and Humanized Computing 11, 2020: 495-501.Google ScholarCross Ref
- Raj, Shashi, Dharavath Ramesh, and Krishan Kumar Sethi. "A Spark-based Apriori algorithm with reduced shuffle overhead." The Journal of Supercomputing 77, no. 1, 2021: 133-151.Google ScholarDigital Library
- Guo, Hong, Hong Liu, JiaYou Chen, and Yan Zeng. "Data mining and risk prediction based on apriori improved algorithm for lung cancer." Journal of Signal Processing Systems 93, no. 7, 2021: 795-809.Google ScholarDigital Library
- Wang, ZhiChao, Qing Tian, and Xinxing Duan. "Research on the evaluation index system of college students’ class teaching quality based on association algorithm." Cluster Computing 22, 2019: 13797-13803.Google ScholarCross Ref
- Hou, Haibing, and Shenghui Zhou. "Integration and Optimization of Multimedia Network-Assisted English Teaching Resources Based on Association Rule Algorithm." Mobile Information Systems 2022, 2022.Google Scholar
- Singh, Sudhakar, Rakhi Garg, and P. K. Mishra. "Performance optimization of MapReduce-based Apriori algorithm on Hadoop cluster." Computers & Electrical Engineering 67, 2018: 348-364.Google ScholarDigital Library
- Guo, Wenhao, Xiaoqing Zuo, Jianwei Yu, and Baoding Zhou. "Method for mid-long-term prediction of landslides movements based on optimized apriori algorithm." Applied Sciences 9, no. 18, 2019: 3819.Google ScholarCross Ref
- Cai, Qiuru. "Cause analysis of traffic accidents on urban roads based on an improved association rule mining algorithm." IEEE Access 8, 2020: 75607-75615.Google ScholarCross Ref
- Yulanda, R. D., Wahyuningsih, S., & Amijaya, F. D. T. 2019, July. Association rules with apriori algorithm and hash-based algorithm. In Journal of Physics: Conference Series (Vol. 1277, No. 1, p. 012048). IOP Publishing.Google Scholar
Index Terms
- Research and Application of an Improved Apriori Algorithm in Market Basket Data
Recommendations
Mining Association Rules Based on Apriori Algorithm and Application
IFCSTA '09: Proceedings of the 2009 International Forum on Computer Science-Technology and Applications - Volume 01In the data mining research, mining association rules is an important topic. Apriori algorithm submitted by Agrawal and R. Srikant in 1994 is the most effective algorithm. Aimed at two problems of discovering frequent itemsets in a large database and ...
Real Market Basket Analysis using Apriori and Frequent Pattern Tree Algorithm
IC3INA '21: Proceedings of the 2021 International Conference on Computer, Control, Informatics and Its ApplicationsRecently, data mining has been implemented in various fields, including business and telecommunications. Data mining is a technique for extracting and detecting patterns in massive data sets that combines machine learning, statistics, and database ...
Mining Frequent Itemsets Using Improved Apriori on Spark
ICISDM '19: Proceedings of the 2019 3rd International Conference on Information System and Data MiningFinding the frequent itemset is one of the most investigated extents of data mining. The Apriori algorithm is the most established algorithm for frequent itemset mining, but it has issues regarding scanning frequent databases and generating a large ...
Comments