Updating high-utility pattern trees with transaction modification

Lin, Chun-Wei; Zhang, Binbin; Gan, Wensheng; Chen, Bo-Wei; Rho, Seungmin; Hong, Tzung-Pei

doi:10.1007/s11042-014-2178-9

Updating high-utility pattern trees with transaction modification

Published: 30 July 2014

Volume 75, pages 4887–4912, (2016)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Chun-Wei Lin^1,2,
Binbin Zhang³,
Wensheng Gan¹,
Bo-Wei Chen⁴,
Seungmin Rho⁵ &
…
Tzung-Pei Hong^6,7

311 Accesses
1 Citation
Explore all metrics

Abstract

Traditional association-rule mining only concerns the occurrence frequencies of the items in a binary database. In real-world applications, customers may buy several copies of the purchased items. Other factors such as profit, quantity, or price should be concerned to measure the utilities of the purchased items. High-utility itemsets mining was thus proposed to consider the factors of quantity and profit. Two-phase model was the most commonly way to keep the transaction-weighted utilization downward closure property, thus reducing the numerous candidates in utility mining. Most methods for finding high-utility itemsets are used to handle a static database. In practical applications, transactions are changed whether insertion, deletion, or modification. Some itemsets may arise as the new high-utility itemsets or become invalid knowledge in the updated database. In this paper, a maintenance Fast Updated High Utility Pattern tree for transaction MODification (FUP-HUP-tree-MOD) algorithm is thus proposed to effective maintain and update the built HUP tree for mining high-utility itemsets in dynamic databases without candidate generation. Experiments are conducted to show better performance of the proposed algorithm compared to the two-phase algorithm and the HUP tree algorithm in batch mode.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An efficient join operations for utility list-based high-utility mining approaches using hybrid search technique

Article 12 April 2024

Data distribution tailoring revisited: cost-efficient integration of representative data

Article 12 April 2024

Privacy-preserving data (stream) mining techniques and their impact on data mining accuracy: a systematic literature review

Article Open access 22 February 2023

References

Abdullah Z, Herawan T, Deris M (2010) Mining significant least association rules using fast slp-growth algorithm. Lect Notes Comput Sci 6059:324–336
Article Google Scholar
Agrawal R, Imielinski T, Swami A (1993) Database mining: a performance perspective. IEEE Trans Knowl Data Eng 5:914–925
Article Google Scholar
Agrawal R, Srikant R (1994) “Fast algorithms for mining association rules in large databases.” The Int Conf Very Large Data Bases: 487–499
Chan R, Yang Q, Shen YD (2003) “Mining high utility itemsets,” IEEE Int Conf Data Min: 19–26
Chen MS, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 8:866–883
Article Google Scholar
Cheung DWL, Han J, Ng V, Wong CY (1996) “Maintenance of discovered association rules in large databases: An incremental updating technique.” Int Conf Data Eng:106–114
Cheung DWL, Lee SD, Kao B (1997) “A general incremental technique for maintaining discovered association rules.” The Int Conf Database Syst Adv Appl: 185–194
Frequent itemset mining dataset repository. Available: http://fimi.ua.ac.be/data/ (2012)
Gharib TF, Nassar H, Taha M, Abrahamd A (2010) An efficient algorithm for incremental mining of temporal association rules. Data Knowl Eng 69:800–815
Article Google Scholar
Han J, Pei J, Yin Y, Mao R (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min Knowl Disc 8:53–87
Article MathSciNet Google Scholar
Hong TP, Lin CW, Wu YL (2008) Incrementally fast updated frequent pattern trees. Expert Syst Appl 34:2424–2435
Article Google Scholar
Hong TP, Lin CW, Wu YL (2008) “An efficient fufp-tree mainteance algorithm for record modification,”. Int J Innov Comput Inf Control 4:2875–2887
Google Scholar
Li YC, Yeh JS, Chang CC (2005) Direct candidates generation: a novel algorithm for discovering complete share-frequent itemsets. Fuzzy Syst Knowl Discov 3614:551–560
Article Google Scholar
Li YC, Yeh JS, Chang CC (2005) “Direct candidates generation: A novel algorithm for discovering complete share-frequent itemsets.” Lect Notes Comput Sci: 551–560
Li YC, Yeh JS, Chang CC (2005) “A fast algorithm for mining share-frequent itemsets.” Lect Notes Comput Sci: 417–428
Lin CW, Hong TP, Lu WH (2009) The pre-fufp algorithm for incremental mining. Expert Syst Appl 36:9498–9505
Article Google Scholar
Lin CW, Hong TP, Lu WH (2010) “Maintaining high utility pattern trees in dynamic databases.” Int Conf Comput Eng Appl: 304–308
Lin CW, Hong TP, Lu WH (2011) An effective tree structure for mining high utility itemsets. Expert Syst Appl 38:7419–7424
Article Google Scholar
Lin CW, Lan GC, Hong TP, Kong L (2014) Mining high utility itemsets based on transaction deletion. Lect Notes Electr Eng 260:983–990
Article Google Scholar
Lin CW, Lan GC, Hong TP (2012) An incremental mining algorithm for high utility itemsets. Expert Syst Appl 39:7173–7180
Article Google Scholar
Liu Y, Liao WK, Choudhary A (2005) “A two-phase algorithm for fast discovery of high utility itemsets.” Adv Knowl Discov Data Min: 689–695
Liu M, Qu J (2012) “Mining high utility itemsets without candidate generation>” ACM Int Conf Inf Knowl Manag: 55–64
Liu J, Wang K, Fung BCM (2012) “Direct discovery of high utility itemsets without candidate generation.” IEEE Int Conf Data Min: 984–989
Microsoft. Example database foodmart of microsoft analysis services. Available: http://msdn.microsoft.com/en-us/library/aa217032(SQL.80).aspx
Nath B, Bhattacharyya DK, Ghosh A (2013) “Incremental association rule mining: A survey,” WIREs Data Mining Knowledge Discovery, vol. 3
Song W, Liu Y, Li J (2013) “Mining high utility itemsets by dynamically pruning the tree structure.” Appl Intell: 1–15
Tseng VS, Bai-En S, Cheng-Wei W, Yu PS (2013) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25:1772–1786
Article Google Scholar
Wu CW, Lin YF, Yu PS, Tseng VS (2013) “Mining high utility episodes in complex event sequences.” ACM Int Conf Knowl Discov Data Min: 536–544
Yao H, Hamilton HJ (2006) Mining itemset utilities from transaction databases. Data Knowl Eng 59:603–626
Article Google Scholar
Yao H, Hamilton HJ, Butz CJ (2004) “A foundational approach to mining itemset utilities from databases.” SIAM Int Conf Data Min: 211–225
Yuna U, Ryanga H, Ryub KH (2014) High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates. Expert Syst Appl 41:3861–3878
Article Google Scholar

Download references

Acknowledgments

This research was partially supported by the Shenzhen Peacock Project, China, under grant KQC201109020055A, by the Natural Scientific Research Innovation Foundation in Harbin Institute of Technology under grant HIT.NSRIF.2014100, and by the Shenzhen Strategic Emerging Industries Program under grant ZDSY20120613125016389.

Author information

Authors and Affiliations

Innovative Information Industry Research Center (IIIRC), School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
Chun-Wei Lin & Wensheng Gan
Shenzhen Key Laboratory of Internet Information Collaboration, School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
Chun-Wei Lin
Medical School, Shenzhen University, Shenzhen, China
Binbin Zhang
Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan
Bo-Wei Chen
Department of Multimedia, Sungkyul University, Anyang-si, South Korea
Seungmin Rho
Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan, Republic of China
Tzung-Pei Hong
Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, Republic of China
Tzung-Pei Hong

Authors

Chun-Wei Lin
View author publications
You can also search for this author in PubMed Google Scholar
Binbin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wensheng Gan
View author publications
You can also search for this author in PubMed Google Scholar
Bo-Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Seungmin Rho
View author publications
You can also search for this author in PubMed Google Scholar
Tzung-Pei Hong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Binbin Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, CW., Zhang, B., Gan, W. et al. Updating high-utility pattern trees with transaction modification. Multimed Tools Appl 75, 4887–4912 (2016). https://doi.org/10.1007/s11042-014-2178-9

Download citation

Received: 28 May 2014
Revised: 12 June 2014
Accepted: 01 July 2014
Published: 30 July 2014
Issue Date: May 2016
DOI: https://doi.org/10.1007/s11042-014-2178-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Updating high-utility pattern trees with transaction modification

Abstract

Access this article

Similar content being viewed by others

An efficient join operations for utility list-based high-utility mining approaches using hybrid search technique

Data distribution tailoring revisited: cost-efficient integration of representative data

Privacy-preserving data (stream) mining techniques and their impact on data mining accuracy: a systematic literature review

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Updating high-utility pattern trees with transaction modification

Abstract

Access this article

Similar content being viewed by others

An efficient join operations for utility list-based high-utility mining approaches using hybrid search technique

Data distribution tailoring revisited: cost-efficient integration of representative data

Privacy-preserving data (stream) mining techniques and their impact on data mining accuracy: a systematic literature review

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation