Skip to main content
Log in

High Utility Item-set Mining from retail market data stream with various discount strategies using EGUI-tree

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

High Utility Item-set Mining (HUIM) is the futuristic remodel version of Frequent Item-set Mining (FIM). It discovers customer purchase trends in the retail market. This knowledge is useful to retailers to incorporate various innovative schemes in their businesses to attract the customers such as discounts, cross-marketing, seasonal sale offers…etc. Even though many HUIM algorithms are available to detect profitable patterns, most of them cannot apply to all kinds of retail market data sets due to certain assumptions. The first assumption is that the items always produce a positive profit. Even though purchased items’ overall profit could be positive, few items may have negative profit. Another assumption is they are built for static transactional data. The data is gathered up to the point of time and is used for analysis. It is helpful to make decisions at some intervals like quarterly, half-yearly, yearly. But, to take decisions at any time by analyzing the present sales trend, it is required to process the data stream. This paper presents an innovative idea named Extended Global Utility Item-sets Tree(EGUI-tree) to extract High utility item-sets in the retail market data stream with positive and negative profit items. The sliding window-based technique is applied to the data stream to pick up the very recent data to process. An experimental study on real-world datasets shows that the proposed EGUI-tree algorithm is faster and scalable.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Agrawal R, Imielinski T, Swami A (1993) Database mining: a performance perspective. IEEE Trans Knowl Data Eng 5(6):914–925

    Article  Google Scholar 

  • Bansal R, Dawar S, Goyal V (2015) An efficient algorithm for mining high-utility itemsets with discount notion. Springer, Berlin, pp 84–98

    Google Scholar 

  • Borah A, Nath B (2019) Rare pattern mining: challenges and future perspectives. Complex Intell Syst 5:1–23

    Article  Google Scholar 

  • Fournier V, Philippe L, Chun-Wei R, Uday K, Yun S, Thomas R (2017) A survey of sequential pattern mining. Data Sci Pattern Recognit 1:54–77

    Google Scholar 

  • Fournier-Viger P, Chun-Wei Lin J, Truong-Chi T, Nkambou R (2019) A survey of high utility itemset mining. In: Fournier-Viger P, Lin JW, Nkambou R, Vo B, Tseng V (eds) High-utility pattern mining. Studies in big data. Springer, Berlin

    Chapter  Google Scholar 

  • Gan W, Lin JCW, Fournier-Viger P, Chao HC, Tseng VS, Yu PS (2021) A survey of utility-oriented pattern mining. IEEE Trans Knowl Data Eng 33(4):1306–1327. https://doi.org/10.1109/TKDE.2019.2942594

    Article  Google Scholar 

  • Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. SIGMOD Rec 29(2):1–12

    Article  Google Scholar 

  • Krishnamoorthy S (2015) Pruning strategies for mining high utility itemsets. Expert Syst Appl 42:2371–2381

    Article  Google Scholar 

  • Lee V, Jin R, Agrawal G (2014) Frequent pattern mining in data streams. In: Aggarwal C, Han J (eds) Frequent pattern mining. Springer, Cham

    Google Scholar 

  • Li H, Huang H, Lee S (2011) Fast and memory efficient mining of high-utilityitemsets from data streams: with and without negative item profits. Knowl Inf Syst 28:495–522

    Article  Google Scholar 

  • Lin JC-W, Fournier-Viger P, Gan W (2016a) FHN: an efficient algorithm for mining high-utility itemsets with negative unit profits. Knowl Based Syst 30:109–126

    Google Scholar 

  • Lin C-W, Gan W, Viger F, Philippe H, Tzung-Pei H, Tsengs V (2016b) Fast algorithms for mining high-utilityitemsets with various discount strategies. Adv Eng Inform 30:109–126

    Article  Google Scholar 

  • Rakesh A, Ramakrishnan S (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB ’94), pp 487–499

  • Singh K, Shakya HK, Singh A (2018) Mining of high-utility item sets with negative utility. Expert Syst 35(8):e12296

    Article  Google Scholar 

  • Singh K, Singh SS, Kumar A, Biswas B (2019) High utility itemsets mining with negative utility value: a survey. J Intell Fuzzy Syst 35(6):6551–6562

    Article  Google Scholar 

  • Truong-Chi T, Fournier-Viger P (2019) A survey of high utility sequential pattern mining. In: Fournier-Viger P, Lin JW, Nkambou R, Vo B, Tseng V (eds) High-utility pattern mining. Studies in big data. Springer, Berlin

    Google Scholar 

  • Tseng V, Wu C-W, Viger F, Philippe Y (2015) Efficient algorithms for mining top-K High utility item sets. IEEE Trans Knowl Data Eng 28:1–1

    Google Scholar 

  • Yun U, Lee G, Yoon E (2017) Efficient high utility pattern mining for establishing manufacturing plans with sliding window control. IEEE Trans Industr Electron 64(9):7239–7249

    Article  Google Scholar 

  • Zhang C, Almpanidis G, Wang W, Liu C (2018) An Empirical Evaluation of High Utility Itemset Mining Algorithms. Expert Syst Appl 101:91–115

    Article  Google Scholar 

  • Zhang C, Han M, Sun R, Du S, Shen M (2020) A Survey of key technologies for high utility patterns mining. IEEE Access 8:55798–55814

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pandillapalli Amaranatha Reddy.

Additional information

Publisher’s Note

Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Amaranatha Reddy, P., Hazarath Murali Krishna Prasad, M. High Utility Item-set Mining from retail market data stream with various discount strategies using EGUI-tree. J Ambient Intell Human Comput 14, 871–882 (2023). https://doi.org/10.1007/s12652-021-03341-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-021-03341-3

Keywords

Navigation