Extracting Top-k High Utility Patterns from Multi-level Transaction Databases

Le, Tuan M.; Nguyen, Trinh D. D.; Nguyen, Loan T. T.; Kozierkiewicz, Adrianna; Tung, N. T.

doi:10.1007/978-981-99-5834-4_24

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13995))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

302 Accesses
1 Citations

Abstract

Several approaches have been introduced to solve the problem of high utility pattern mining (HUPM). However, the proposed algorithms require a minimum utility threshold before execution. This task is impractical for end users as they do not know utility distributions in the transaction datasets. The output will contain too many patterns if this value is too low. In contrast, if the threshold is set too high, the result would be empty or insufficient for analysis. Recently, HUPM was extended to work with hierarchical transaction datasets. With the search space of the mining task expanded, selecting a proper threshold is far more challenging. To address this issue, we propose a top-\(k\) high utility pattern mining method from multi-level transactions databases. The users only need to specify a \(k\) value, denotes the desired number of patterns of interest. To the best of our knowledge, the method proposed in our work is the first to address this mining topic. Experiments on both real and synthetic hierarchical datasets were extensively conducted to evaluate the performance of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Source: https://github.com/arunkjn/foodmart-mysql.

References

Fournier-Viger, P., Lin, J.C.W., Vo, B., Chi, T.T., Zhang, J., Le, H.B.: A survey of itemset mining. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 7(4) (2017)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: 20th International Conference on Very Large Data Bases (VLDB’94), Morgan Kaufmann Publishers Inc., pp. 487–499 (1994)
Google Scholar
Yao, H., Hamilton, H.J., Butz, G.J.: A foundational approach to mining itemset utilities from databases. In: SIAM International Conference on Data Mining, pp. 482–486 (2004)
Google Scholar
Fournier-Viger, P., Lin, J.C.-W., Truong-Chi, T., Nkambou, R.: A survey of high utility itemset mining. In: High-Utility Pattern Mining: Theory, Algorithms and Applications, Fournier-Viger, P., Lin, J.C.-W., Nkambou, R., Vo, B., Tseng, V.S. (eds.), Springer International Publishing, Cham, pp. 1–45 (2019)
Google Scholar
Cagliero, L., Chiusano, S., Garza, P., Ricupero, G.: Discovering high-utility itemsets at multiple abstraction levels. In: Kirikova, M., et al. (eds.) European Conference on Advances in Databases and Information Systems, pp. 224–234. Springer International Publishing, Cham (2017)
Google Scholar
Nouioua, M., Wang, Y., Fournier-Viger, P., Lin, J.C.-W., Wu, J.M.-T.: TKC: Mining top-k cross-level high utility itemsets. In: 2020 International Conference on Data Mining Workshops (ICDMW), pp. 673–682 (2020)
Google Scholar
Tung, N.T., Nguyen, L.T.T., Nguyen, T.D.D., Vo, B.: An efficient method for mining multi-level high utility Itemsets. Appl. Intell. 52(5), 5475–5496 (2022)
Article Google Scholar
Tung, N.T., Nguyen, L.T.T., Nguyen, T.D.D., Fourier-Viger, P., Nguyen, N.T., Vo, B.: Efficient mining of cross-level high-utility itemsets in taxonomy quantitative databases. Inf. Sci. (Ny) 587, 41–62 (2022)
Article Google Scholar
Nguyen, T. D.D., Nguyen, L.T.T., Kozierkiewicz, A., Pham, T., Vo, B.: An efficient approach for mining high-utility itemsets from multiple abstraction levels. In: Intelligent Information and Database Systems., Springer International Publishing, pp. 92–103 (2021). https://doi.org/10.1007/978-3-030-73280-6_8
Baralis, E., Cagliero, L., Cerquitelli, T., D’Elia, V., Garza, P.: Expressive generalized itemsets. Inf. Sci. (Ny) 278, 327–343 (2014)
Article MathSciNet MATH Google Scholar
Liu, M., Qu, J.: Mining high utility itemsets without candidate generation. In: ACM International Conference Proceeding Series, pp. 55–64 (2012)
Google Scholar
Fournier-Viger, P., Wu, C.W., Zida, S., Tseng, V.S.: FHM: Faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International Symposium on Methodologies for Intelligent Systems, pp. 83–92 (2014)
Google Scholar
Nguyen, L.T.T., Nguyen, P., Nguyen, T.D.D., Vo, B., Fournier-Viger, P., Tseng, V.S.: Mining high-utility itemsets in dynamic profit databases. Knowledge-Based Syst. 175, 130–144 (2019)
Article Google Scholar
Tseng, V.S., Wu, C.W., Fournier-Viger, P., Yu, P.S.: Efficient algorithms for mining top-K high utility itemsets. IEEE Trans. Knowl. Data Eng. 28(1), 54–67 (2016)
Article Google Scholar
Ryang, H., Yun, U.: Top-k high utility pattern mining with effective threshold raising strategies. Knowledge-Based Syst. 76, 109–126 (2015)
Article Google Scholar
Krishnamoorthy, S.: Mining top-k high utility itemsets with effective threshold raising strategies. Expert Syst. Appl. 117, 148–165 (2019)
Article Google Scholar
Fournier-Viger, P., Yang, Y., Lin, J.C.-W., Luna, J.M., Ventura, S.: Mining cross-level high utility itemsets. In: 33rd International Conference on Industrial, p. 12. Springer, Engineering and Other Applications of Applied Intelligent Systems (2020)
Google Scholar
Liu, Y., Liao, W.K., Choudhary, A.: A two-phase algorithm for fast discovery of high utility itemsets. In: 9th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, in PAKDD’05, vol. 3518. Springer-Verlag, pp. 689–695 (2005)
Google Scholar
Fournier-Viger, P., et al.: The SPMF open-source data mining library version 2. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 36–40 (2016)
Google Scholar

Download references

Acknowledgment

This research is funded by Vietnam National University HoChiMinh City (VNU-HCM) under grant number B2023-28-02.

Author information

Authors and Affiliations

School of Computer Science and Engineering, International University, Ho Chi Minh City, Vietnam
Tuan M. Le & Loan T. T. Nguyen
Vietnam National University, Ho Chi Minh City, Vietnam
Tuan M. Le & Loan T. T. Nguyen
Faculty of Information Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City, Vietnam
Trinh D. D. Nguyen
Faculty of Computer Science and Management, Wroclaw University of Science and Technology, Wrocław, Poland
Adrianna Kozierkiewicz
Faculty of Information Technology, HUTECH University, Ho Chi Minh City, Vietnam
N. T. Tung

Authors

Tuan M. Le
View author publications
You can also search for this author in PubMed Google Scholar
Trinh D. D. Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Loan T. T. Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Adrianna Kozierkiewicz
View author publications
You can also search for this author in PubMed Google Scholar
N. T. Tung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Loan T. T. Nguyen .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand
Siridech Boonsang
Iwate Prefectural University Iwate, Iwate, Japan
Hamido Fujita
Wroclaw University of Science and Technology, Wrocław, Poland
Bogumiła Hnatkowska
National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong
King Mongkut’s Institute of Technology Ladkrabang, Bangkok, Thailand
Kitsuchart Pasupa
Malaysia Japan International Institute of Technology, Kuala Lumpur, Malaysia
Ali Selamat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Le, T.M., Nguyen, T.D.D., Nguyen, L.T.T., Kozierkiewicz, A., Tung, N.T. (2023). Extracting Top-k High Utility Patterns from Multi-level Transaction Databases. In: Nguyen, N.T., et al. Intelligent Information and Database Systems. ACIIDS 2023. Lecture Notes in Computer Science(), vol 13995. Springer, Singapore. https://doi.org/10.1007/978-981-99-5834-4_24

Download citation

DOI: https://doi.org/10.1007/978-981-99-5834-4_24
Published: 05 September 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-5833-7
Online ISBN: 978-981-99-5834-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics