Skip to main content
Log in

GPU-based bees swarm optimization for association rules mining

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Association rules mining (ARM) is a well-known combinatorial optimization problem aiming at extracting relevant rules from given large-scale datasets. According to the state of the art, the bio-inspired methods proved their efficiency by generating acceptable solutions in a reasonable time when dealing with small and medium size instances. Unfortunately, to cope with large instances such as the webdocs benchmark, these methods require more and more powerful processors and are time expensive. Nowadays, computing power is no longer a real issue. It can be provided by the power of emerging technologies such as graphics processing units (GPUs) that are massively multi-threaded processors. In this paper, we investigate the use of GPUs to speed up the computation. We propose two GPU-based bees swarm algorithms for association rules mining (single evaluation in GPU, SE-GPU and multiple evaluation in GPU, ME-GPU). SE-GPU aims at evaluating one rule at a time where each thread is associated with one transaction, whereas ME-GPU evaluates multiple rules in parallel on GPU where each thread is associated with several transactions. To validate our approaches, the two algorithms have been executed to solve well-known large ARM instances. Real experiments have been carried out on an Intel Xeon 64 bit quad-core processor E5520 coupled to an Nvidia Tesla C2075 GPU device. The results show that our approaches improve the execution time up to 100\(\times \) over the sequential mono-core bees swarm optimization-ARM algorithm. Moreover, the proposed approaches have been compared with CPU multi-core ones (1–8 cores). The results show that they are faster than the multi-core versions whatever is the number of used cores.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Adil SH, Qamar S (2009) Implementation of association rule mining using CUDA. In: International conference onemerging technologies (ICET 2009). IEEE, New York

  2. Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969

    Article  Google Scholar 

  3. Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. ACM SIGMOD Rec 22(2) (ACM)

  4. Arabnia HR, Oliver MA (1989) A transputer network for fast operations on digitised images. Int J Eurograph Assoc (Comput Graph Forum) 8(1):3–12

    Article  Google Scholar 

  5. Bendjoudi A, Chekini M, Gharbi M, Mehdi M, Benatchba K, Sitayeb-Benbouzid F, Melab N (2013) Parallel B&B algorithm on hybrid multicore/GPU architecture. In: IEEE international conference on high performance computing and communications, vol 15. IEEE, New York

  6. Boley D et al (1999) Partitioning-based clustering for web document categorization. Decis Support Syst 27(3):329–341

    Article  Google Scholar 

  7. Brin S et al (1997) Dynamic itemset counting and implication rules for market basket data. ACM SIGMOD Rec 26(2) (ACM)

  8. Cano A, Luna JM, Ventura S (2013) High performance evaluation of evolutionary-mined association rules on GPUs. J Supercomput 1–24

  9. Cecilia JM, García JM, Nisbet A, Amos M, Ujaldón M (2013) Enhancing data parallelism for ant colony optimization on gpus. J Parallel Distrib Comput 73(1):42–51

    Article  Google Scholar 

  10. Cui Q, Xiaobo G (2013) Research on parallel association rules mining on GPU. In: Proceedings of the 2nd international conference on green communications and networks 2012 (GCN 2012), vol 2. Springer, Berlin

  11. Djenouri Y, Drias H, Habbas Z (2014) Bees swarm optimisation using multiple strategies for association rule mining. Int J Bio-Inspir Comput 6(4):239–249

    Article  Google Scholar 

  12. Fang W et al (2009) Frequent itemset mining on graphics processors. In: Proceedings of the fifth international workshop on data management on new hardware. ACM, New York

  13. Gada FV, Yadav KR, Shah RD (2013) Data mining in medical application for better detection of disease. IJCER 2(2):173–176

    Google Scholar 

  14. Garg R, Mishra PK (2011) Exploiting parallelism in association rule mining algorithms. Int J Adv Technol 2.2:222–232

  15. Goethals B, Zaki MJ (2003) Frequent itemset mining implementations repository. This site contains a wide-variety of algorithms for mining frequent, closed, and maximal itemsets. http://fimi.cs.helsinki.fi

  16. Guvenir HA, Uysal I (2000) Bilkent university function approximation repository. http://funapp.cs.bilkent.edu.tr/DataSets. Accessed 12 March 2012

  17. Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Rec 29(2) (ACM)

  18. Han J, Kamber M, Pei J (2006) Data mining: concepts and techniques. Morgan Kaufmann, Burlington

  19. Kacprzyk J, Zadrozny S (2013) Derivation of linguistic summaries is inherently difficult: can association rule mining help? In: Towards advanced data analysis by combining soft computing and statistics, pp 291–303. Springer, Berlin

  20. Khademolghorani F, Baraani A, Zamanifar K (2014) Efficient mining of association rules based on gravitational search algorithm. Int J Comput Sci 8

  21. Khabzaoui M, Dhaenens C, Talbi E-G (2005) Parallel genetic algorithms for multi-objective rule mining. In: The 6th MIC 2005

  22. Kozawa Y, Amagasa T, Kitagawa H (2014) Probabilistic frequent itemset mining on a GPU cluster. IEICE Trans Inf Syst 97(4):779–789

    Article  Google Scholar 

  23. Kuo RJ, Chao CM, Chiu YT (2011) Application of particle swarm optimization to association rule mining. Appl Soft Comput 11(1):326–336

    Article  Google Scholar 

  24. Li H, Zhang N (2010) Mining maximal frequent itemsets on graphics processors. In: 2010 seventh international conference on fuzzy systems and knowledge discovery (FSKD), vol 3. IEEE, New York

  25. Lin KW, Deng DJ (2010) A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments. Int J Ad Hoc Ubiquitous Comput 6(4):205–215

    Article  Google Scholar 

  26. Liu K, Hogan WR, Crowley RS (2011) Natural language processing methods and systems for biomedical ontology learning. J Biomed Inform 44(1):163–179

    Article  Google Scholar 

  27. Liu Z, Chen S, Cai J, Zhang E, Lan L, Zheng J, Du J (2013) Traditional Chinese medicine syndrome-related herbal prescriptions in treatment of malignant tumors. J Tradit Chin Med 33(1):19–26

    Article  Google Scholar 

  28. Luke S (2011) Essentials of meta-heuristics, pp 11–52

  29. Luper D, Cameron D, Miller JA, Arabnia HR (2007) Spatial and temporal target association through semantic analysis and GPS data mining. In: International conference on information and knowledge engineering (IKE’07), USA, pp 251–257 (ISBN 1-60132-050-7)

  30. Martínez-Ballesteros M, Nepomuceno-Chamorro IA, Riquelme JC (2013) Discovering gene association networks by multi-objective evolutionary quantitative association rules. J Comput Syst Sci

  31. Mata J, José-Luis A, Riquelme J-C (2002) An evolutionary algorithm to discover numeric association rules. In: Proceedings of the 2002 ACM symposium on applied computing. ACM, New York

  32. Melab N, Talbi E-G (2001) A parallel genetic algorithm for rule mining. In: Proceedings of the 15th international parallel and distributed processing symposium. IEEE Computer Society, London

  33. Melab N, Chakroun I, Bendjoudi A (2013) Graphics processing unit-accelerated bounding for branch-and-bound applied to a permutation problem using data access optimization. Concurr Comput Pract Exp

  34. Moslehi P et al (2014) Multi-objective numeric association rules mining via ant colony optimization for continuous domains without specifying minimum support and minimum confidence. Int J Comput Sci 8

  35. Mishra BSP et al (2011) Parallel multi-objective genetic algorithms for associative classification rule mining. In: Proceedings of the 2011 international conference on communication, computing and security. ACM, New York

  36. Olafsson S, Li X, Wu S (2008) Operations research and data mining. Eur J Oper Res 187(3):1429–1448

    Article  MATH  MathSciNet  Google Scholar 

  37. Orlando S et al (2002) Adaptive and resource-aware mining of frequent sets. In: Proceedings of the 2002 IEEE international conference on data mining, ICDM 2003. IEEE, New York

  38. Park JS, Chen M-S, Yu PS (1995) An effective hash-based algorithm for mining association rules, vol 24(2). ACM, New York

  39. Park JS, Chen M-S, Yu PS (1995) Efficient parallel data mining for association rules. In: Proceedings of the fourth international conference on information and knowledge management. ACM, New York

  40. Parthasarathy S et al (2001) Parallel data mining for association rules on shared-memory systems. Knowl Inf Syst 3.1:1–29

  41. Rahbarinia B, Pedram MM, Arabnia HR, Alavi Z (2010) A multi-objective scheme to hide sequential patterns. In: The proceedings of the 2010 international conference on computer and automation engineering, ICCAE, Singapore, 26–28 February 2010, pp 153–158 (E-ISBN 978-1-4244-5586-7, doi:10.1109/ICCAE.2010.5451977)

  42. Romero C et al (2012) Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. Expert Syst

  43. Silvestri C, Orlando S (2012) gpuDCI: exploiting GPUs in frequent itemset mining. In: 20th euromicro international conference on parallel, distributed and network-based processing (PDP). IEEE, New York

  44. Sun J, Pan W (2009) Parallel association rule mining for image data. In: International conference on photonics and image in agriculture engineering (PIAGENG 2009). International Society for Optics and Photonics, London

  45. Talia D, Trunfio P, Verta O (2005) Weka4ws: a wsrf-enabled weka toolkit for distributed data mining on grids. In: Knowledge discovery in databases: PKDD 2005, pp 309–320. Springer, Berlin

  46. Wang ZG, Wang CS (2012) A parallel association-rule mining algorithm. In: Web information systems and mining, pp 125–129. Springer, Berlin

  47. Weiss RM (2010) GPU-accelerated data mining with swarm intelligence. Doctoral dissertation, Honors thesis. Department of Computer Science, Macalester College. http://metislogic.net/thesis.pdf

  48. Yan X, Zhang C, Zhang S (2009) Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. Expert Syst Appl 36(2):3066–3076

    Article  Google Scholar 

  49. Yang J, Yang Y (2010) A parallel algorithm for mining association rules. In: 2nd international conference on networking and digital society (ICNDS), vol 1. IEEE, New York

  50. Zaïane OR, El-Hajj M, Lu P (2001) Fast parallel association rule mining without candidacy generation. In: Proceedings of the IEEE international conference on data mining, ICDM 2001. IEEE, New York

  51. Zhang F, Zhang Y, Bakos J (2011) GPApriori: GPU-accelerated frequent itemset mining. In: IEEE international conference on cluster computing (CLUSTER). IEEE, New York

  52. Zhou J, Yu K-M, Wu B-C (2010) Parallel frequent patterns mining algorithm on GPU. In: IEEE international conference on systems man and cybernetics (SMC). IEEE, New York

  53. Zhou Y, Tan Y (2009) GPU-based parallel particle swarm optimization. In: IEEE congress on evolutionary computation, 2009, CEC’09, pp 1493–1500. IEEE, New York

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ahcene Bendjoudi.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Djenouri, Y., Bendjoudi, A., Mehdi, M. et al. GPU-based bees swarm optimization for association rules mining. J Supercomput 71, 1318–1344 (2015). https://doi.org/10.1007/s11227-014-1366-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-014-1366-8

Keywords

Navigation