Abstract
Today mobile network and various smart devices flourish rapidly. Data collected from the mobile devices and network can bring us huge opportunities to understand some significant characteristics of the users which traditional data cannot. Association rules mining is an extremely important topic in data mining that can make the utmost value of massive data effectively. Apriori algorithm and the improved Apriori ones based on Boolean matrix are the representative ones in association rules mining. Nevertheless, these solutions have their problems. In this paper, we have proposed an algorithm called MAR-DPS, which has some deep pruning strategies containing three methods to compress the size of frequent itemsets and reduce the joining numbers in generating new frequent itemsets. It can also select the appropriate method to generate frequent 2-itemsets when facing different data sets. Extensive experimental results on three different data sets have demonstrated that our MAR-DPS can perform much better than other tested algorithms.
Similar content being viewed by others
Notes
Mushroom Dataset: http://archive.ics.uci.edu/ml/datasets/Mushroom.
References
Niemegeers, I. G., & Groot, S. M. H. D. (2002). From personal area networks to personal networks: A user oriented approach. Wireless Personal Communications, 22(2), 175–186.
Han, J., Pei, J., & Kamber, M. (2011). Data mining: Concepts and techniques. London: Elsevier.
Agrawal, R., Imilienski, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. In Proceedings of the ACM SIGMOD international conference on the management of data (pp. 207–216).
Agrawal, R., & Srikant, R. (1994). Fast algorithms for mining association rules. In Proceedings of the international conference on very large databases (pp. 487–499).
Brin, S., Motwani, R., Ullman, J., & Tsur, S. (1997). Dynamic itemset counting and implication rules for market basket data. In ACM SIGMOD conference management of data (1997).
Pei, J., Han, J., & Mao, R. (2000) Closet: An efficient algorithm for mining frequent closed itemsets. In : SIGMOD international workshop on data mining and knowledge discovery (2000).
Zaki, M. J., & Hsiao, C. J. (2002) CHARM: An efficient algorithm for closed it emset mining. In Proceedings of the 2002 SIAM international conference on data mining (pp. 457–473). Society for Industrial and Applied Mathematics.
Borgelt, C. (2003). Efficient implementations of Apriori and Eclat. In Proceddings iEEE ICDM workshop on frequent item set mining implementations. CEUR workshop proceedings (p. 90).
Jinjing, H., Lei, Z. Jiwen, Y., et al. (2008). An improved apriori algorithm. In The third international conference on computer science and education (ICCSE’2008) (pp. 604–608).
Anshu, C., & Raghuvanshi, C. S. (2010). An algorithm for frequent pattern mining based on apriori. International Journal on Computer Science and Engineering, 2(04), 942–947.
Abaya, S. A. (2012). Association rule mining based on Apriori algorithm in minimizing candidate generation. International Journal of Scientific and Engineering Research, 3(7), 1–4.
Chao, L., & Zhaoping, Y. (2006). The improvement apriori algorithm based on Boolean matrix. Computer Engineering, 32(23), 68–69.
Jha, J., & Ragha, L. (2013). Educational data mining using improved apriori algorithm. International Journal of Information and Computation Technology, 3(5), 411–418.
Dutt, S., Choudhary, N., & Singh, D. (2014). An improved apriori algorithm based on matrix data structure. Global Journal of Computer Science and Technology, 14(5), 6–10.
Shoujian, Y. (2017). The improvement of Apriori algorithm based on prefix itemsets. Journal of computer applications and software, 2, 290–294.
Niu, K., Jiao, H., & Gao, Z., et al. (2017). A developed apriori algorithm based on frequent matrix. In Proceedings of the 5th international conference on bioinformatics and computational biology (pp. 55–58). ACM.
Ying, C., & Zhigang, M. (2016). Improved apriori algorithm based on vector matrix optimization frequent items. Journal of Jilin University (Science Edition), 54(2), 349–353.
Hongli, Z. (2015). The improved apriori algorithm in the analysis of college students psychological research. Wuhan: Central China normal university.
Zhigang, Y., & Yueshun, H. (2010). An improved apriori algorithm based on the compressed transaction matrix multiplication. China’s New Technology and New Products, 6, 57–58.
Lvhui, H., Yulan, R., & Zhenlin, H. (2015). Improved apriori algorithm based on classification and database compression. Journal of ChengDu University of Technology Science & Technology Edition, 42(1), 110–114.
Cao, L., et al. (2011). Combined mining: Discovering informative knowledge in complex data. IEEE Transactions on Systems, Man, and Cybernetics Part B (Cybernetics), 41(3), 699–712.
Lv, Y., Chen, Y., Zhang, X., et al. (2017). Social media based transportation research: The state of the work and the networking. IEEE/CAA Journal of Automatica Sinica, 4(1), 19–26.
Acknowledgements
This work was supported by the National Social Science Foundation of China under Grant 16ZDA055; National Natural Science Foundation of China under Grant 91546121, 71231002 and 61202247; EU FP7 IRSES MobileCloud Project 612212; the 111 Project of China under Grant B08004; Engineering Research Center of Information Networks, Ministry of Education; the project of Beijing Institute of Science and Technology Information; the project of CAPINFO Company Limited.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, L., Li, Q., Wu, Y. et al. Mining Association Rules Based on Deep Pruning Strategies. Wireless Pers Commun 102, 2157–2181 (2018). https://doi.org/10.1007/s11277-017-5169-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-017-5169-0