Abstract
Big data mining based on cloud computing is the hot topic of the industry research, this paper proposed an improved distributed Apriori algorithm. More importantly, In view of the poor performance of running Apriori algorithm in large data, the algorithm of association rule data mining based on Apriori algorithm is put forward, and the improved distributed Apriori algorithm based on Hadoop platform is proposed. The algorithm focuses on the application of association rules algorithm based on Hadoop in mass data mining. This paper describes the idea of improved Apriori algorithm on Hadoop platform, and presents the experimental test. The experimental results show that the improved algorithm of association rules based on Hadoop can effectively improve the Apriori algorithm for association rules of operation efficiency, and reduce the redundant association rules, and has the efficient advantage in dealing with massive data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Rai, N., Jain, S., Jain, A.: Mining interesting positive and negative association rule based on improved genetic algorithm. Int. J. Adv. Comput. Sci. Appl. 5(1), 160–165 (2014)
Gupta, M.K., Sikka, G.: Association rules extraction using multi-objective feature of genetic algorithm. In: Proceedings of the World Congress on Engineering and Computer Science, pp. 23–25 (2013)
Zhao, L., Jin, X., Sun, L., et al.: Association rule mining method based on niching genetic algorithm. Comput. Eng. 34(10), 163–165 (2013)
Lammel, R.: Google’s MapReduce programming model - revisited. Sci. Comput. Prog. (S0167-6423) 70(1), 1–30 (2012)
Mccreadie, R.M.C., Macdonald, C., Ounis, I.: On single-pass indexing with MapReduce. In: Association Computing Machinery, New York, USA (2009)
He, B., Yang, K., Fang, R., Lu, M., Govindaraju, N.K., Luo, Q., Sander, P.V.: Relational joins on graphics processors. In: ACM SIGMOD 2014 (2014)
Mei, S., Kun, Z.: Big-data analytics: challenges, key technologi and prospects. ZTE Commun. 11(2), 11–17 (2013)
Yang, X., Liu, Z., Fu, Y.: MapReduce as a programming model for association rules algorithm on Hadoop. In: The 3rd International Conference on Information Sciences and Interaction Sciences (ICIS), pp. 99–102 (2010)
Dean, J., Ghemawat, S.: MapReduce: a flexible data processing tool. Commun. ACM 53(1), 72–77 (2013)
Lin, J., Dyer, C.: Data-Intensive Text Processing with MapReduce (2010)
Yadav, C., Wang, S., Jkumar, M.: An approach to improve Apriori algorithm based on association rule mining. In: 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT), pp. 1–9. IEEE, USA (2013)
Abadeh, M.S., Hamid, M., Jafar, H.: Design and analysis of genetic fuzzy systems for intrusion detection in computer networks. Expert Syst. Appl. 38, 7067–7075 (2014)
Wu, K., Hao, J., Wang, C.: Application of fuzzy association rules in intrusion detection. In: International Conference on Internet Computing and Information Services, pp. 269–272 (2011)
Wu, Y., Qin, Y., Song, J.: Research overview of intrusion detection algorithms based on association rules. Comput. Eng. Des. 32(3), 834–838 (2013)
Acknowledgements
The work is supported in part by Department of Education of Guangdong Province under Grant 2015KQNCX188.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Qiu, L. (2018). Research on Data Mining Algorithm of Association Rules Based on Hadoop. In: Li, K., Li, W., Chen, Z., Liu, Y. (eds) Computational Intelligence and Intelligent Systems. ISICA 2017. Communications in Computer and Information Science, vol 873. Springer, Singapore. https://doi.org/10.1007/978-981-13-1648-7_25
Download citation
DOI: https://doi.org/10.1007/978-981-13-1648-7_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1647-0
Online ISBN: 978-981-13-1648-7
eBook Packages: Computer ScienceComputer Science (R0)