Abstract
Smart grid computing environment is an information platform that has lots of production data, data management, and the real-time and non real-time data. Under such massive data environment, the classic Apriori algorithm of mining association rules has a significant performance bottleneck. After analyzing the Apriori algorithm, the MapReduce programming model is used to realize the parallel Apriori algorithm. In order to improve the mining efficiency further, auxiliary tables and attribute columns are added and parallel strategy is improved in the process of candidate itemsets generation. Simulation experiments show that the improved Apriori algorithm can effectively reduce the algorithm execution time and improve the efficiency of data mining under the massive data environment.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Zhu M (2002) Data mining. Hefei, University of Science and Technology of China Press
Yang X (2014) Improvement and application of association rules algorithm for big data. Comput Modern 12:23–26
Zhu A (2012) Study on improvement and transplantation of Apriori algorithm based on Hadoop. Huazhong University of Science and Technology, Wuhan
Li L, Zhang M (2011) Research on algorithms of mining association rule under cloud computing environment. Comput Technol Dev 21(2):43–46
Chen Y (2011) Data mining technology and application. Tsinghua University Press, Beijing
Xu Z, Liu M, Zhang S, Lu J, Ou Y (2004) Three kinds of optimization methods of Apriori algorithm. Comput Eng Appl 40(36):190–192
Xie G, Luo S (2010) Research based on the application of Hadoop MapReduce model. Microcomput Appl 29(8):4–7
Che B (2013) Key technology research-based the Hadoop of massive data processing. University of Electronic Science and Technology of China, Chengdu
Qian G, Jia R, Zhang R, Li L (2008) One optimized method of Apriori algorithm. Comput Eng 34(23):196–198
Liu H, Guo R, Jiang H (2009) Research and improvement of Apriori algorithm for mining association rules. Comput Appl Softw 26(1):146–149
Zhang S (2011) An Apriori-based algorithm of association rules based on cloud computing. Commun Technol 44(6):141–143
Yang C (2010) Research of data mining based on Hadoop. Chongqing University, Chongqing
Zhang Q (2015) An improved algorithm for finding frequent itemsets in association rule mining. Stat Decis 4:32–35
Chen L (2012) Parallel association rules algorithm based on Hadoop. Nanjing University, Nanjing
Mao W (2014) The research of parallel association rules mining algorithms based on cloud platform. East China University of Science and Technology, Shanghai
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Xu, Y., Zhan, R., Tan, G., Chen, L., Tian, B. (2020). An Improved Apriori Algorithm Research in Massive Data Environment. In: Xu, Z., Choo, KK., Dehghantanha, A., Parizi, R., Hammoudeh, M. (eds) Cyber Security Intelligence and Analytics. CSIA 2019. Advances in Intelligent Systems and Computing, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-030-15235-2_113
Download citation
DOI: https://doi.org/10.1007/978-3-030-15235-2_113
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15234-5
Online ISBN: 978-3-030-15235-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)