Maintenance algorithm for high average-utility itemsets with transaction deletion

Lin, Jerry Chun-Wei; Shao, Yina; Fournier-Viger, Philippe; Djenouri, Youcef; Guo, Xiangmin

doi:10.1007/s10489-018-1180-8

Maintenance algorithm for high average-utility itemsets with transaction deletion

Published: 18 April 2018

Volume 48, pages 3691–3706, (2018)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Jerry Chun-Wei Lin^1,2,
Yina Shao¹,
Philippe Fournier-Viger³,
Youcef Djenouri⁴ &
…
Xiangmin Guo⁵

297 Accesses
17 Citations
Explore all metrics

Abstract

High-utility itemset mining (HUIM) is an extension of traditional association-rule mining that can find profitable itemsets for decision-making. It faces, however, a limitation since the utility of an itemset increases along with the size of it. High-average utility itemset mining (HAUIM) provides a fair measure to find the average-utility of an itemset, which is more reasonable to design the sales strategies for making the efficient decision. Traditional algorithms of HAUIM mostly focus on mining high average-utility itemsets (HAUIs) from the static database. When the database size is changed, for example, transaction insertion/deletion, the discovered information is required to be updated, thus the updated database is necessary to be re-scanned for identifying the set of HAUIs in the batch manner. In this paper, we present an updating algorithm called FUP-HAUIMD to maintain the discovered HAUIs with transaction deletion. When some transactions in the database are deleted, the designed FUP-HAUIMD algorithm can easily update the discovered HAUIs without scanning the database all the time. The designed FUP-HAUIMD algorithm divides the itemsets into four cases based on the modified fast updated (MFUP) concept. The average-utility (AU)-list structure is further utilized to keep the necessary ramification for later mining progress. Experiments are then conducted to compare the designed FUP-HAUIMD algorithm with the state-of-the-art baseline algorithm running on the batch mode, and the developed approach shows better performance in terms of runtime, number of examined patterns, and scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Incrementally updating the high average-utility patterns with pre-large concept

Article Open access 30 June 2020

Maintaining the discovered high-utility itemsets with transaction modification

Article 31 July 2015

Dynamic maintenance model for high average-utility pattern mining with deletion operation

Article Open access 22 June 2021

References

Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. The International Conference on Very Large Data Bases, pp 487–499
Agrawal R, Srikant R (1994) Quest synthetic data generator. http://www.Almaden.ibm.com/cs/quest/syndata.html
Ahmed CF, Tanbeer SK, Jeong BS, Lee YK (2009) Efficient tree structures for high utility pattern mining in incremental databases. IEEE Trans Knowl Data Eng 21(12):1708–1721
Article Google Scholar
Erwin A, Gopalan RP, Achuthan NR (2008) Efficient mining of high utility itemsets from large datasets. Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, pp 554–561
Cheung DW, Wong CY, Han J, Ng VT (1996) Maintenance of discovered association rules in large databases: an incremental updating techniques. Proceedings of the Twelfth International Conference, pp 106–114
Cheung DW, Lee SD, Kao B (1997) A general incremental technique for maintaining discovered association rules. The International Conference on Database Systems for Advanced Applications, pp 185–194
Chen MS, Park JS, Yu PS (1998) Efficient data mining for path traversal patterns. IEEE Trans Knowl Data Eng 10(2):209–221
Article Google Scholar
Creighton C, Hanash S (2003) Mining gene expression databases for association rules. Bioinformatics 19 (1):79–86
Article Google Scholar
Deng Z, Lv SL (2014) Fast mining frequent itemsets using nodesets. Expert Syst Appl 41(10):4505–4512
Article Google Scholar
Fournier-Viger P, Lin JCW, Gomariz A, Gueniche T, Soltani A, Deng Z, Lam HT (2016) The SPMF open-source data mining library version 2. Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp 36–40
Han J, Pei J, Yin Y, Mao R (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min Knowl Discov 8(1):53–87
Article MathSciNet Google Scholar
Hong TP, Lin CW, Wu YL (2008) Incrementally fast updated frequent pattern trees. Expert Syst Appl 34(4):2424–2435
Article Google Scholar
Hong TP, Lin CW, Wu YL (2009) Maintenance of fast updated frequent pattern trees for record deletion. Comput Stat Data Anal 53(7):2485–2499
Article MathSciNet MATH Google Scholar
Hong TP, Lee CH, Wang SL (2011) Effective utility mining with the measure of average utility. Expert Syst Appl 38(7):8259–8265
Article Google Scholar
Liu Y, Liao WK, Choudhary A (2005) A fast high utility itemsets mining algorithm. International Workshop on Utility-Based Data Mining, pp 90–99
Liu Y, Liao WK, Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, pp 689– 695
Lucchese C, Orlando S, Perego R (2006) Fast and memory efficient mining of frequent closed itemsets. IEEE Trans Knowl Data Eng 18(1):21–36
Article Google Scholar
Lin CW, Hong TP, Lu WH (2009) The Pre-FUFP algorithm for incremental mining. Expert Syst Appl 36(5):9498–9505
Article Google Scholar
Lin CW, Lan GC, Hong TP (2009) An incremental mining algorithm for high utility itemsets. Expert Syst Appl 39(8):7173–7180
Article Google Scholar
Lin CW, Hong TP, Lu WH (2010) Efficiently mining high average utility itemsets with a tree structure. Asian Conference on Intelligent Information and Database Systems, pp 131–139
Lin CW, Hong TP, Lu WH (2011) An effective tree structure for mining high utility itemsets. Expert Syst Appl 38(6):7419– 7424
Article Google Scholar
Liu J, Wang K, Fung BCM (2012) Direct discovery of high utility itemsets without candidate generation. IEEE International Conference on Data Mining, pp 984–989
Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. ACM International Conference on Information and Knowledge Management, pp 55–64
Lan GC, Hong TP, Tseng VS (2012) Efficient mining high average-utility itemsets with an improved upper-bound strategy. Int J Inform Technol Decis Making 11(5):1009–1030
Article Google Scholar
Lu T, Vo B, Nguyen HT, Hong TP (2014) A new method for mining high average utility itemsets. Computer Information Systems and Industrial Management, pp 33–42
Lin CW, Hong TP, Lan GC, Wong JW, Lin WY (2015) Efficient updating of discovered high-utility itemsets for transaction deletion in dynamic databases. Adv Eng Inform 29(1):16–27
Article Google Scholar
Liu J, Wang K, Fung BCM (2016) Mining high utility patterns in one phase without generating candidates. IEEE Trans Knowl Data Eng 28(5):1245–1257
Article Google Scholar
Lin CW, Li T, Fournier-Viger P, Hong TP, Zhan J, Voznak M (2016) An efficient algorithm to mine high average-utility itemsets. Adv Eng Inform 30(2):233–243
Article Google Scholar
Lin JCW, Ren S, Fournier-Viger P, Hong TP (2017) EHAUPM: Efficient high average-utility pattern mining with tighter upper-bound models. IEEE Access 5:12927–12940
Article Google Scholar
Lin JCW, Ren S, Fournier-Viger P, Hong TP (2017) EHAUPM: efficient high average-utility pattern mining with tighter upper-bound models. IEEE Access 5:12927–12940
Article Google Scholar
Tseng VS, Shie BE, Wu CW, Yu PS (2013) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25(8):1772–1786
Article Google Scholar
Yao H, Hamilton HJ, Butz CJ (2004) A foundational approach to mining itemset utilities from databases. SIAM International Conference on Data Mining, pp 215–221
Yen SJ, Lee YS (2007) Mining high utility quantitative association rules. International Conference on Data Warehousing and Knowledge Discovery, pp 283–292

Download references

Acknowledgments

This research was partially supported by the National Natural Science Foundation of China (NSFC) under grant No. 61503092, by the Shenzhen Technical Project under grants No. KQJSCX20170726103424709 and JCYJ20170307151733005, by the National Science Funding of Guangdong Province under grant No. 2016A030313659, and by the Science Research Project of Guangdong Province under grant No. 2017A020220011.

Author information

Authors and Affiliations

School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, China
Jerry Chun-Wei Lin & Yina Shao
Department of Computing, Mathematics and Physics, Western Norway University of Applied Sciences (HVL), Bergen, Norway
Jerry Chun-Wei Lin
School of Natural Sciences and Humanities, Harbin Institute of Technology (Shenzhen), Shenzhen, China
Philippe Fournier-Viger
IMADA Lab, Southern Denmark University, Odense, Denmark
Youcef Djenouri
School of Architecture and Urban Planning, Harbin Institute of Technology (Shenzhen), Shenzhen, China
Xiangmin Guo

Authors

Jerry Chun-Wei Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yina Shao
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Fournier-Viger
View author publications
You can also search for this author in PubMed Google Scholar
Youcef Djenouri
View author publications
You can also search for this author in PubMed Google Scholar
Xiangmin Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jerry Chun-Wei Lin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, J.CW., Shao, Y., Fournier-Viger, P. et al. Maintenance algorithm for high average-utility itemsets with transaction deletion. Appl Intell 48, 3691–3706 (2018). https://doi.org/10.1007/s10489-018-1180-8

Download citation

Published: 18 April 2018
Issue Date: October 2018
DOI: https://doi.org/10.1007/s10489-018-1180-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Maintenance algorithm for high average-utility itemsets with transaction deletion

Abstract

Access this article

Similar content being viewed by others

Incrementally updating the high average-utility patterns with pre-large concept

Maintaining the discovered high-utility itemsets with transaction modification

Dynamic maintenance model for high average-utility pattern mining with deletion operation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Maintenance algorithm for high average-utility itemsets with transaction deletion

Abstract

Access this article

Similar content being viewed by others

Incrementally updating the high average-utility patterns with pre-large concept

Maintaining the discovered high-utility itemsets with transaction modification

Dynamic maintenance model for high average-utility pattern mining with deletion operation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation