An Improved Apriori Algorithm Research in Massive Data Environment

Xu, Yu; Zhan, Ranzhi; Tan, Gang; Chen, Lu; Tian, BoJin

doi:10.1007/978-3-030-15235-2_113

An Improved Apriori Algorithm Research in Massive Data Environment

Yu Xu¹⁹,
Ranzhi Zhan¹⁹,
Gang Tan¹⁹,
Lu Chen¹⁹ &
…
BoJin Tian¹⁹

Conference paper
First Online: 25 April 2019

126 Accesses
2 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 928))

Abstract

Smart grid computing environment is an information platform that has lots of production data, data management, and the real-time and non real-time data. Under such massive data environment, the classic Apriori algorithm of mining association rules has a significant performance bottleneck. After analyzing the Apriori algorithm, the MapReduce programming model is used to realize the parallel Apriori algorithm. In order to improve the mining efficiency further, auxiliary tables and attribute columns are added and parallel strategy is improved in the process of candidate itemsets generation. Simulation experiments show that the improved Apriori algorithm can effectively reduce the algorithm execution time and improve the efficiency of data mining under the massive data environment.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Zhu M (2002) Data mining. Hefei, University of Science and Technology of China Press
Google Scholar
Yang X (2014) Improvement and application of association rules algorithm for big data. Comput Modern 12:23–26
Google Scholar
Zhu A (2012) Study on improvement and transplantation of Apriori algorithm based on Hadoop. Huazhong University of Science and Technology, Wuhan
Google Scholar
Li L, Zhang M (2011) Research on algorithms of mining association rule under cloud computing environment. Comput Technol Dev 21(2):43–46
MathSciNet Google Scholar
Chen Y (2011) Data mining technology and application. Tsinghua University Press, Beijing
Google Scholar
Xu Z, Liu M, Zhang S, Lu J, Ou Y (2004) Three kinds of optimization methods of Apriori algorithm. Comput Eng Appl 40(36):190–192
Google Scholar
Xie G, Luo S (2010) Research based on the application of Hadoop MapReduce model. Microcomput Appl 29(8):4–7
Google Scholar
Che B (2013) Key technology research-based the Hadoop of massive data processing. University of Electronic Science and Technology of China, Chengdu
Google Scholar
Qian G, Jia R, Zhang R, Li L (2008) One optimized method of Apriori algorithm. Comput Eng 34(23):196–198
Google Scholar
Liu H, Guo R, Jiang H (2009) Research and improvement of Apriori algorithm for mining association rules. Comput Appl Softw 26(1):146–149
MathSciNet Google Scholar
Zhang S (2011) An Apriori-based algorithm of association rules based on cloud computing. Commun Technol 44(6):141–143
Google Scholar
Yang C (2010) Research of data mining based on Hadoop. Chongqing University, Chongqing
Google Scholar
Zhang Q (2015) An improved algorithm for finding frequent itemsets in association rule mining. Stat Decis 4:32–35
Google Scholar
Chen L (2012) Parallel association rules algorithm based on Hadoop. Nanjing University, Nanjing
Google Scholar
Mao W (2014) The research of parallel association rules mining algorithms based on cloud platform. East China University of Science and Technology, Shanghai
Google Scholar

Download references

Author information

Authors and Affiliations

State Grid Chongqing Electric Power Company Information Communication Branch, Chongqing, 400000, China
Yu Xu, Ranzhi Zhan, Gang Tan, Lu Chen & BoJin Tian

Authors

Yu Xu
View author publications
You can also search for this author in PubMed Google Scholar
Ranzhi Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Gang Tan
View author publications
You can also search for this author in PubMed Google Scholar
Lu Chen
View author publications
You can also search for this author in PubMed Google Scholar
BoJin Tian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lu Chen .

Editor information

Editors and Affiliations

Shanghai University, Shanghai, China
Zheng Xu
University of Texas at San Antonio, San Antonio, TX, USA
Kim-Kwang Raymond Choo
University of Guelph, Guelph, ON, Canada
Ali Dehghantanha
Kennesaw State University, Marietta, GA, USA
Reza Parizi
Manchester Metropolitan University, Stockport, UK
Mohammad Hammoudeh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, Y., Zhan, R., Tan, G., Chen, L., Tian, B. (2020). An Improved Apriori Algorithm Research in Massive Data Environment. In: Xu, Z., Choo, KK., Dehghantanha, A., Parizi, R., Hammoudeh, M. (eds) Cyber Security Intelligence and Analytics. CSIA 2019. Advances in Intelligent Systems and Computing, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-030-15235-2_113

Download citation

DOI: https://doi.org/10.1007/978-3-030-15235-2_113
Published: 25 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15234-5
Online ISBN: 978-3-030-15235-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics