Mining fuzzy high average-utility itemsets using fuzzy utility lists and efficient pruning approach

Hajihoseini, Manijeh; Sohrabi, Mohammad Karim

doi:10.1007/s00500-022-07123-7

Mining fuzzy high average-utility itemsets using fuzzy utility lists and efficient pruning approach

Fuzzy systems and their mathematics
Published: 25 May 2022

Volume 26, pages 6063–6086, (2022)
Cite this article

Soft Computing Aims and scope Submit manuscript

Manijeh Hajihoseini¹ &
Mohammad Karim Sohrabi¹

236 Accesses
Explore all metrics

Abstract

Fuzzy high average-utility itemset (FHAUI) mining problem considers the effect of the length of the itemsets on their calculated utilities, in addition to the number of occurrences of each item in each transaction and the unit profit of each item as the internal and external utility of that item, respectively. FHAUI mining avoids generating large fuzzy high utility itemsets which consist of fuzzy low utility items. The fuzzy theory has been combined with the high average-utility itemset mining problem in order to better understanding the users' results and provide more useful information, such as determining the overlapping range of the set of discovered items. In this paper, by extending MHAI method (Yun and Kim in Future Gener Comp Syst 68:346–36, 2017) for fuzzy itemset using the fuzzy approach of (Lan et al. in Appl Soft Comput 30:767–777, 2015), a method called HiFAM is presented for efficient exploration of FHAUIs. The algorithm introduces a fuzzy average-utility list (FAUL) structure, by extending HAI-list from Yun and Kim (Future Gener Comp Syst 68:346–36, 2017), to summarize required information of the dataset in a compact form to explore the FHAUIs without candidate generation. After creating the FAUL of the items (1-itemsets), there is no need to re-scan the database in the proposed algorithm, and all the FAULs of the (k + 1)-itemsets can be obtained from the combination of the FAULs of k-itemsets and (k − 1)-itemsets. The complete set of all FHAUIs can be extracted by HiFAM, through a depth first exploration process. A pruning technique is also used in the proposed method to prevent the exploration of unpromising itemsets; their exploration does not lead to discover FHAUIs. This pruning strategy effectively reduces the memory consumption and time complexity of the proposed method. Various experiments are conducted using real and synthetic datasets, the results of which show the efficiency of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

Fig. 6

Fig. 7

Fig. 8

High-Utility Itemset Mining using Fuzzy Sets

EA-HUFIM: Optimization for Fuzzy-Based High-Utility Itemsets Mining

Article 07 January 2021

Efficient Mining of Weighted Frequent Itemsets in Uncertain Databases

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

Our manuscript has no associated data.

References

Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proc. 20th int conf very large data bases, VLDB'94. pp 487–499
Ahmed CF, Tanbeer SK, Jeong BS, Lee YK (2009) Efficient tree structures for high utility pattern mining in incremental databases. IEEE T Knowl Data En 21(12):1708–1721
Google Scholar
Alcalá-Fdez J, Alcalá R, Gacto MJ, Herrera F (2009) Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms. Fuzzy Sets Syst 160(7):905–921
MathSciNet MATH Google Scholar
Bai A, Deshpande PS, Dhabu M (2018) Selective database projections based approach for mining high-utility itemsets. IEEE Access 6:14389–14409
Google Scholar
Cao L (2010) Domain-driven data mining: Challenges and prospects. IEEE T Knowl Data En 22(6):755–769
Google Scholar
Cao L, Zhao Y, Zhang H, Luo D, Zhang C, Park EK (2010) Flexible frameworks for actionable knowledge discovery. IEEE T Knowl Data En 22(9):1299–1312
Google Scholar
Cao L, Dong X, Zheng Z (2016) e-NSP: efficient negative sequential pattern mining. Artif Intell 235:156–182
MathSciNet MATH Google Scholar
Chan KC, Au WH (1997a) An effective algorithm for mining interesting quantitative association rules. In: proc 1997a ACM symposium on applied computing, ACM
Chan KC, Au WH (1997b) Mining fuzzy association rules. In: proc 6th int conf Information and knowledge management, ACM
Chan R, Yang Q, Shen YD (2003) Mining high utility itemsets. In Proc. 3rd IEEE Int. Conf. Data Mining, 2003, (Washington, D.C., USA, 2003). pp 19–22
Chen CH, Hong TP, Tseng VS (2009) An improved approach to find membership functions and multiple minimum supports in fuzzy data mining. Expert Syst Appl 36(6):10016–10024
Google Scholar
Chen CH, Hong TP, Tseng VS (2012) Fuzzy data mining for time-series data. Appl Soft Comput 12(1):536–542
Google Scholar
Chen CH, Li AF, Lee YC (2014) Actionable high-coherent-utility fuzzy itemset mining. Soft Comput 18(12):2413–2424
Google Scholar
Dawar S, Goyal V, Bera D (2017) A hybrid framework for mining high-utility itemsets in a sparse transaction database. Appl Intell 47:809–827
Google Scholar
Deng ZH (2016) DiffNodesets: an efficient structure for fast mining frequent itemsets. Appl Soft Comput 41:214–223
Google Scholar
Deng ZH, Lv SL (2014) Fast mining frequent itemsets using nodesets. Expert Syst Appl 41(10):4505–4512
Google Scholar
Deng ZH, Wang ZH (2010) A new fast vertical method for mining frequent itemsets. Int J Comput Int Syst 3(6):733–744
Google Scholar
Deng ZH, Wang ZH, Jiang JJ (2012) A new algorithm for fast mining frequent itemsets using n-lists. Sci China Inform Sci 55(9):2008–2030
MathSciNet MATH Google Scholar
Erwin A, Gopalan RP, Achuthan NR (2007) A bottom-up projection based algorithm for mining high utility itemsets. In: Proc. 2nd Int. Workshop Integrating artificial intelligence and data mining, 2007, (Gold Coast, Australia, 2007), vol 84. pp 3–11
Erwin A, Gopalan RP, Achuthan NR (2008) Efficient mining of high utility itemsets from large datasets, Pacific-Asia Conf. Advances in knowledge discovery and data mining, 2008, (Osaka, Japan, 2008), In: Washio T, Suzuki E, Ting KM, Inokuchi A (eds) Advances in knowledge discovery and data mining. PAKDD 2008. LNCS, vol 5012. Springer, Berlin, Heidelberg. pp 554–561
Fournier-Viger P, Wu CW, Zida S, Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: Int. symposium on methodologies for intelligent systems, (Roskilde, Denmark, 2013), In: Andreasen T, Christiansen H, Cubero JC, Raś ZW (eds) Foundations of intelligent systems. ISMIS, LNCS, vol 8502. Springer, Cham, pp 83–92
Gan W, Lin JCW, Fournier-Viger P, Chao HC, Tseng VS, Yu P (2019) A survey of utility-oriented pattern mining. IEEE T Knowl Data En. https://doi.org/10.1109/TKDE.2019.2942594
Article Google Scholar
Gyenesei A (2001) A fuzzy approach for mining quantitativeassociation rules. Acta Cybern 15(2):305–320
MathSciNet MATH Google Scholar
Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: Proc. ACM SIGMOD Int. Conf. Management of data, 1993 (Dallas, TX, USA, 2000), ACM SIGMOD Record 29. pp 1–12
Hong TP, Kuo CS, Chi SH (1999) Mining association rules from quantitative data. Intell Data Anal 3(5):363–376
MATH Google Scholar
Hong TP, Lin KY, Wang SL (2003) Fuzzy data mining for interesting generalized association rules. Fuzzy Sets Syst 138(2):255–269
MathSciNet Google Scholar
Hong TP, Kuo CS, Chi SH (2004) A fuzzy AprioriTid mining algorithm with reduced computational time. Appl Soft Comput 5(1):1–10
Google Scholar
Hong TP, Lee CH, Wang SL (2011) Effective utility mining with the measure of average utility. Expert Syst Appl 38(7):8259–8265
Google Scholar
Hong TP, Chiang MJ, Wang SL (2002) Mining from quantitative data with linguistic minimum supports and confidences. Fuzzy systems, 2002. FUZZ-IEEE’02. In: Proceedings of the 2002 IEEE international conference on, IEEE.3
Hong TP, Lin CW, Lin TC, Wang SL (2012) Incremental multiple fuzzy frequent pattern tree. In: Proc IEEE int conf fuzzy systems (FUZZ-IEEE)
Huang TCK (2012) Mining the change of customer behavior in fuzzy time-interval sequential patterns. Appl Soft Comput 12(3):1068–1086
Google Scholar
Krishnamoorthy S (2015) Pruning strategies for mining high utility itemsets. Expert Syst Appl 42(5):2371–2381
Google Scholar
Kuok CM, Fu A, Wong MH (1998) Mining fuzzy association rules in databases. ACM SIGMOD Rec 27(1):41–46
Google Scholar
Lai CP, Chung PC, Tseng VS (2010) A novel algorithm for mining fuzzy high utility itemsets. Int J Innov Comput Inf Control (ICIC) 6(10):4347–4361
Google Scholar
Lan GC, Hong TP, Tseng VS (2012a) Efficiently mining high average-utility itemsets with an improved upper-bound strategy. Int J Inf Tech Decis 11(05):1009–1030
Google Scholar
Lan GC, Hong TP, Tseng VS (2012b) A projection-based approach for discovering high average-utility itemsets. J Inf Sci Eng 28(1):193–209
Google Scholar
Lan GC, Hong TP, Tseng VS (2014) An efficient projection-based indexing approach for mining high utility itemsets. Knowl Inf Syst 38:85–107
Google Scholar
Lan GC, Hong TP, Lin YH, Wang SL (2015) Fuzzy utility mining with upper-bound measure. Appl Soft Comput 30:767–777
Google Scholar
Lee YC, Hong TP, Wang TC (2008) Multi-levelfuzzy mining with multiple minimum supports. Expert Syst Appl 34(1):459–468
Google Scholar
Lee H, Shao B, Kang U (2015) Fast graph mining with HBase. Inf Sci 315:56–66
MathSciNet Google Scholar
Li YC, Yeh JS, Chang CC (2008) Isolated items discarding strategy for discovering high utility itemsets. Data Knowl Eng 64:198–217
Google Scholar
Lin CW, Hong TP (2014) Mining fuzzy frequent itemsets based on UBFFP trees. J Intell Fuzzy Syst 27(1):535–548
MathSciNet Google Scholar
Lin JCW, Hong TP, Lu WH (2010a) Linguistic data mining with fuzzy FP-trees. Expert Syst Appl 37(6):4560–4567
Google Scholar
Lin JCW, Hong TP, Lu WH (2010b) An efficient tree-based fuzzy data mining approach. Int J Fuzzy Syst 12(2):150–157
Google Scholar
Lin JCW, Hong TP, Lu WH (2011) An effective tree structure for mining high utility itemsets. Expert Syst Appl 38(6):7419–7424
Google Scholar
Lin JCW, Hong TP, Lin TC (2015) A CMFFP-tree algorithm to mine complete multiple fuzzy frequent itemsets. Appl Soft Comput 28:431–439
Google Scholar
Lin JCW, Ren S, Fournier-Viger P, Hong TP (2017) EHAUPM: efficient high average-utility pattern mining with tighter upper bounds. IEEE Access 5:12927–12940
Google Scholar
Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proc. 21st ACM Int. Conf. Information and knowledge management, Maui, Hawaii, USA. pp 55–64
Liu Y, Liao WK, Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. In: Ho T, Cheung D, Liu H (eds) Pacific-Asia Conf. Advances in knowledge discovery and data mining, 2005, Hanoi, Vietnam. Advances in knowledge discovery and data mining 2005, LCNS, vol 3518, Springer, Heidelberg. pp 689–695
Liu L, Wang K, Fung BCM (2012) Direct discovery of high utility itemsets without candidate generation. In: Proc. 12th IEEE Int. Conf. Data mining, 2012, Washington, D.C., USA. pp 984–989
Lu T, Vo B, Nguyen HT, Hong TP (2014) A new method for mining high average utility itemsets. In: Saeed K, Snášel V (eds) computer information systems and industrial management. CISIM 2015. In: Lecture notes in computer science, vol 8838. Springer, Berlin, Heidelberg
Papadimitriou S, Mavroudi S (2005) The fuzzy frequent pattern tree. In: Proc 9^th WSEAS int conf computers, ICCOMP'05. pp 1–7
IBM Quest Data Mining Projection, Quest synthetic data generation code (1996) Available at http://www.almaden.ibm.com/cs/quest/syndata.htm
Rahmati B, Sohrabi MK (2019) A systematic survey of high utility itemset mining. Int J Inf Tech Decis 18(4):1113–1185
Google Scholar
Ryang H, Yun U (2017) Indexed list-based high utility pattern mining with utility upper-bound reduction and pattern combination techniques. Knowl Inf Syst 51:627–659
Google Scholar
Sim ATH, Indrawan M, Zutshi S, Srinivasan B (2010) Logic-based pattern discovery. IEEE T Knowl Data En 22(6):798–811
Google Scholar
Sohrabi MK (2018) Gossip-based information fusion protocol for distributed frequent itemset mining. Enterp Inf Syst-UK 12(6):674–694
Google Scholar
Sohrabi MK, Barforoush AA (2012) Efficient colossal pattern mining in high dimensional datasets. Knowl-Based Syst 33:41–52
Google Scholar
Sohrabi MK, Barforoush AA (2013) Parallel frequent itemset mining using systolic arrays. Knowl-Based Syst 37:462–471
Google Scholar
Sohrabi MK, Roshani R (2017) Frequent itemset mining using cellular learning automata. Comput Hum Behav 68:244–253
Google Scholar
Sohrabi MK, Ghods V (2014) Top-down vertical itemset mining. In: Proc. 6th Int. Conf. Graphic and image processing, ICGIP. pp 94431V–94431V7
Sohrabi MK, Ghods V (2016) CUSE: A novel cube-based approach for sequential pattern mining. In: Proc 4^th int symposium on Computational and Business Intelligence (ISCBI), 2016, Olten, Switzerland. pp 186–190
Srikant R, Agrawal R (1996) Mining quantitative association rules in large relational tables. ACM SIGMOD Rec 25(2):1–12
Google Scholar
Tran MT, Vo LB, B, (2015) Combination of dynamic bit vectors and transaction information for mining frequent closed sequences efficiently. Eng Appl Artif Intel 38:183–189
Google Scholar
Truong T, Duong H, Le HB, Fournier-Viger P (2019) Efficient vertical mining of high average-utility itemsets based on novel upper-bounds. IEEE T Knowl Data En 31(2):301–314
Google Scholar
Tseng VS, Shie BE, Wu CW, Yu PS (2013) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE T Knowl Data En 25:1772–1786
Google Scholar
Tseng VS, Wu CW, Shie BE, Yu PS (2010) UP-Growth: An efficient algorithm for high utility itemset mining. In Proc. 16th ACM SIGKDD Int. Conf. Knowledge discovery and data mining, 2010, Washington, D.C., USA. pp 253–262
Wang CM, Chen SH, Huang YF (2009) A fuzzy approach for mining high utility quantitative itemsets. In: 2009 IEEE int conf fuzzy systems, FUZZ-IEEE
Wu JMT, Lin JCW, Pirouz M, Fournier-Viger P (2018) TUB-HAUPM: tighter upper bound for mining high average-utility patterns. IEEE Access 6:18655–18669
Google Scholar
Yao H, Hamilton HJ (2006) Mining itemset utilities from transaction databases. Data Knowl Eng 59:603–626
Google Scholar
Yue JS, Tseng E, Yeung D, Shi D (2000). Mining fuzzy association rules with weighted items. Systems. In: Proc 2000 IEEE int conf systems, man and cybernetics. pp 1906–1911
Yun U, Kim D (2017) Mining of high average-utility itemsets using novel list structure and pruning strategy. Future Gener Comp Sy 68:346–360
Google Scholar
Yun U, Lee G (2016) Incremental mining of weighted maximal frequent itemsets from dynamic databases. Expert Syst Appl 54:304–327
Google Scholar
Zida S, Fournier-Viger P, Lin JCW, Wu C, Tseng VS (2017) EFIM: a fast and memory efficient algorithm for high-utility itemset mining. Knowl Inf Syst 51:595–625
Google Scholar

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Department of Computer Engineering, Semnan Branch, Islamic Azad University, Semnan, Iran
Manijeh Hajihoseini & Mohammad Karim Sohrabi

Authors

Manijeh Hajihoseini
View author publications
You can also search for this author inPubMed Google Scholar
Mohammad Karim Sohrabi
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Mohammad Karim Sohrabi.

Ethics declarations

Conflict of interest

Manijeh Hajihosseini (first author) declares that she has no conflict of interest. Mohammad Karim Sohrabi (second author) declares that he has no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hajihoseini, M., Sohrabi, M.K. Mining fuzzy high average-utility itemsets using fuzzy utility lists and efficient pruning approach. Soft Comput 26, 6063–6086 (2022). https://doi.org/10.1007/s00500-022-07123-7

Download citation

Accepted: 07 April 2022
Published: 25 May 2022
Issue Date: July 2022
DOI: https://doi.org/10.1007/s00500-022-07123-7

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mining fuzzy high average-utility itemsets using fuzzy utility lists and efficient pruning approach

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

High-Utility Itemset Mining using Fuzzy Sets

EA-HUFIM: Optimization for Fuzzy-Based High-Utility Itemsets Mining

Efficient Mining of Weighted Frequent Itemsets in Uncertain Databases

Explore related subjects

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now