Skip to main content
Log in

FHUQI-Miner: Fast high utility quantitative itemset mining

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

High utility itemset mining is a popular pattern mining task, which aims at revealing all sets of items that yield a high profit in a transaction database. Although this task is useful to understand customer behavior, an important limitation is that high utility itemsets do not provide information about the purchase quantities of items. Recently, some algorithms were designed to address this issue by finding quantitative high utility itemsets but they can have very long execution times due to the larger search space. This paper addresses this issue by proposing a novel efficient algorithm for high utility quantitative itemset mining, called FHUQI-Miner (Fast High Utility Quantitative Itemset Miner). It performs a depth-first search and adopts two novel search space reduction strategies, named Exact Q-items Co-occurrence Pruning Strategy (EQCPS) and Range Q-items Co-occurrence Pruning Strategy (RQCPS). Experimental results show that the proposed algorithm is much faster than the state-of-art HUQI-Miner algorithm on sparse datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Ahmed CF, Tanbeer SK, Jeong BS (2010) A novel approach for mining high-utility sequential patterns in sequence databases. ETRI J 32(5):676–686

    Article  Google Scholar 

  2. Aryabarzan N, Minaei-Bidgoli B (2018) Teshnehlab, M.: negfin: An efficient algorithm for fast mining frequent itemsets. Expert Syst Appl 105:129–143

    Article  Google Scholar 

  3. Chan R, Yang Q, Shen YD (2003) Mining high utility itemsets. In: Proceedings of the 3rd IEEE international conference on data mining. IEEE computer society, pp 19–19

  4. Dinh DT, Le B, Fournier-Viger P, Huynh VN (2018) An efficient algorithm for mining periodic high-utility sequential patterns. Appl Intell 48(12):4694–4714

    Article  Google Scholar 

  5. Duong QH, Fournier-Viger P, Ramampiaro H, Nørvåg K., Dam TL (2018) Efficient high utility itemset mining using buffered utility-lists. Appl Intell 48(7):1859–1877

    Article  Google Scholar 

  6. Fournier-Viger P, Gomariz A, Gueniche T, Soltani A, Wu CW, Tseng VS (2014) Spmf: a java open-source pattern mining library. J Mach Learn Res 15(1):3389–3393

    MATH  Google Scholar 

  7. Fournier-Viger P, Lin JCW, Duong QH, Dam TL (2016) Phm: mining periodic high-utility itemsets. In: Industrial conference on data mining. Springer, pp 64–79

  8. Fournier-Viger P, Lin JCW, Truong-Chi T, Nkambou R (2019) A survey of high utility itemset mining. In: High-utility pattern mining. Springer, pp 1–45

  9. Fournier-Viger P, Lin JCW, Vo B, Chi TT, Zhang J, Le HB (2017) A survey of itemset mining. Wiley Interdiscip Rev Data Min Knowl Discov 7(4):e1207

    Article  Google Scholar 

  10. Fournier-Viger P, Wu CW, Zida S, Tseng VS (2014) Fhm: Faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International symposium on methodologies for intelligent systems. Springer, pp 83–92

  11. Fournier-Viger P, Yang P, Lin JCW, Yun U (2019) Hue-span: fast high utility episode mining. In: International conference on advanced data mining and applications. Springer, pp 169– 184

  12. Gan W, Lin JCW, Fournier-Viger P, Chao HC, Tseng VS, Yu PS (2018) A survey of utility-oriented pattern mining. arXiv:1805.10511

  13. Gomariz A, Campos M, Marin R, Goethals B (2013) Clasp: An efficient algorithm for mining frequent closed sequences. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 50–61

  14. Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier

  15. Han X, Liu X, Chen J, Lai G, Gao H, Li J (2019) Efficiently mining frequent itemsets on massive data. IEEE Access 7:31,409–31,421

    Article  Google Scholar 

  16. Krishnamoorthy S (2017) Hminer: Efficiently mining high utility itemsets. Expert Syst Appl 90:168–183

    Article  Google Scholar 

  17. Lee J, Yun U, Lee G, Yoon E (2018) Efficient incremental high utility pattern mining based on pre-large concept. Eng Appl Artif Intel 72:111–123

    Article  Google Scholar 

  18. Li CH, Wu CW, Huang J, Tseng VS (2019) An efficient algorithm for mining high utility quantitative itemsets. In: 2019 international conference on data mining workshops (ICDMW). IEEE, pp 1005–1012

  19. Li CH, Wu CW, Tseng VS (2014) Efficient vertical mining of high utility quantitative itemsets. In: 2014 IEEE international conference on granular computing (GrC). IEEE, pp 155–160

  20. Lin CW, Hong TP, Lu WH (2011) An effective tree structure for mining high utility itemsets. Expert Syst Appl 38(6):7419–7424

    Article  Google Scholar 

  21. Lin YF, Wu CW, Huang CF, Tseng VS (2015) Discovering utility-based episode rules in complex event sequences. Expert Syst Appl 42(12):5303–5314

    Article  Google Scholar 

  22. Liu J, Wang K, Fung BC (2012) Direct discovery of high utility itemsets without candidate generation. In: 2012 IEEE 12th international conference on data mining. IEEE, pp 984–989

  23. Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM international conference on information and knowledge management, pp 55–64

  24. Liu Y, Liao WK, Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 689–695

  25. Luna JM, Fournier-Viger P, Ventura S (2019) Frequent itemset mining: A 25 years review. Wiley Interdiscip Rev Data Min Knowl Discov 9(6):e1329

    Article  Google Scholar 

  26. Peng AY, Koh YS, Riddle P (2017) Mhuiminer: A fast high utility itemset mining algorithm for sparse datasets. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 196–207

  27. Qu JF, Liu M, Fournier-Viger P (2019) Efficient algorithms for high utility itemset mining without candidate generation. In: High-utility pattern mining. Springer, pp 131–160

  28. Truong-Chi T, Fournier-Viger P (2019) A survey of high utility sequential pattern mining. In: High-utility pattern mining. Springer, pp 97–129

  29. Tseng VS, Wu CW, Shie BE, Yu PS (2010) Up-growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 253–262

  30. Wang CM, Chen SH, Huang YF (2009) A fuzzy approach for mining high utility quantitative itemsets. In: 2009 IEEE International conference on fuzzy systems. IEEE, pp 1909–1913

  31. Wang JZ, Huang JL, Chen YC (2016) On efficiently mining high utility sequential patterns. Knowl Inf Syst 49(2):597–627

    Article  Google Scholar 

  32. Wu CW, Lin YF, Yu PS, Tseng VS (2013) Mining high utility episodes in complex event sequences. In: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 536–544

  33. Yen SJ, Lee YS (2007) Mining high utility quantitative association rules. In: International conference on data warehousing and knowledge discovery. Springer, pp 283–292

  34. Yun U, Ryang H, Ryu KH (2014) High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates. Expert Syst Appl 41(8):3861–3878

    Article  Google Scholar 

  35. Zhang C, Han M, Sun R, Du S, Shen M (2020) A survey of key technologies for high utility patterns mining. IEEE Access 8:55,798–55,814

    Article  Google Scholar 

  36. Zhang S, Wu X (2011) Fundamentals of association rules in data mining and knowledge discovery. Wiley Interdiscip Rev Data Min Knowl Discov 1(2):97–116

    Article  Google Scholar 

  37. Zida S, Fournier-Viger P, Lin JCW, Wu CW, Tseng VS (2017) Efim: A fast and memory efficient algorithm for high-utility itemset mining. Knowl Inf Syst 51(2):595–625

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Philippe Fournier-Viger.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nouioua, M., Fournier-Viger, P., Wu, CW. et al. FHUQI-Miner: Fast high utility quantitative itemset mining. Appl Intell 51, 6785–6809 (2021). https://doi.org/10.1007/s10489-021-02204-w

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-02204-w

Keywords

Navigation