Skip to main content

An Improved Apriori Algorithm Research in Massive Data Environment

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 928))

Abstract

Smart grid computing environment is an information platform that has lots of production data, data management, and the real-time and non real-time data. Under such massive data environment, the classic Apriori algorithm of mining association rules has a significant performance bottleneck. After analyzing the Apriori algorithm, the MapReduce programming model is used to realize the parallel Apriori algorithm. In order to improve the mining efficiency further, auxiliary tables and attribute columns are added and parallel strategy is improved in the process of candidate itemsets generation. Simulation experiments show that the improved Apriori algorithm can effectively reduce the algorithm execution time and improve the efficiency of data mining under the massive data environment.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Zhu M (2002) Data mining. Hefei, University of Science and Technology of China Press

    Google Scholar 

  2. Yang X (2014) Improvement and application of association rules algorithm for big data. Comput Modern 12:23–26

    Google Scholar 

  3. Zhu A (2012) Study on improvement and transplantation of Apriori algorithm based on Hadoop. Huazhong University of Science and Technology, Wuhan

    Google Scholar 

  4. Li L, Zhang M (2011) Research on algorithms of mining association rule under cloud computing environment. Comput Technol Dev 21(2):43–46

    MathSciNet  Google Scholar 

  5. Chen Y (2011) Data mining technology and application. Tsinghua University Press, Beijing

    Google Scholar 

  6. Xu Z, Liu M, Zhang S, Lu J, Ou Y (2004) Three kinds of optimization methods of Apriori algorithm. Comput Eng Appl 40(36):190–192

    Google Scholar 

  7. Xie G, Luo S (2010) Research based on the application of Hadoop MapReduce model. Microcomput Appl 29(8):4–7

    Google Scholar 

  8. Che B (2013) Key technology research-based the Hadoop of massive data processing. University of Electronic Science and Technology of China, Chengdu

    Google Scholar 

  9. Qian G, Jia R, Zhang R, Li L (2008) One optimized method of Apriori algorithm. Comput Eng 34(23):196–198

    Google Scholar 

  10. Liu H, Guo R, Jiang H (2009) Research and improvement of Apriori algorithm for mining association rules. Comput Appl Softw 26(1):146–149

    MathSciNet  Google Scholar 

  11. Zhang S (2011) An Apriori-based algorithm of association rules based on cloud computing. Commun Technol 44(6):141–143

    Google Scholar 

  12. Yang C (2010) Research of data mining based on Hadoop. Chongqing University, Chongqing

    Google Scholar 

  13. Zhang Q (2015) An improved algorithm for finding frequent itemsets in association rule mining. Stat Decis 4:32–35

    Google Scholar 

  14. Chen L (2012) Parallel association rules algorithm based on Hadoop. Nanjing University, Nanjing

    Google Scholar 

  15. Mao W (2014) The research of parallel association rules mining algorithms based on cloud platform. East China University of Science and Technology, Shanghai

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lu Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, Y., Zhan, R., Tan, G., Chen, L., Tian, B. (2020). An Improved Apriori Algorithm Research in Massive Data Environment. In: Xu, Z., Choo, KK., Dehghantanha, A., Parizi, R., Hammoudeh, M. (eds) Cyber Security Intelligence and Analytics. CSIA 2019. Advances in Intelligent Systems and Computing, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-030-15235-2_113

Download citation

Publish with us

Policies and ethics