A novel approach to discover frequent weighted subgraphs using the average measure

Le, Ngoc-Thao; Vo, Bay; Yun, Unil; Le, Bac

doi:10.1007/s10489-023-04501-y

A novel approach to discover frequent weighted subgraphs using the average measure

Published: 08 March 2023

Volume 53, pages 19491–19504, (2023)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Ngoc-Thao Le^1,2,3,
Bay Vo³,
Unil Yun⁴ &
…
Bac Le ORCID: orcid.org/0000-0002-4306-6945^1,2

273 Accesses
1 Altmetric
Explore all metrics

Abstract

Mining a weighted single large graph has recently attracted many researchers. The WeGraMi algorithm is considered the state-of-the-art among current approaches. It uses a MaxMin measure to calculate weights for all mined subgraphs. However, if all values in the domain have the same role and the user needs an average value of that domain, then we have to use another measure. In this paper, we introduce a novel algorithm called AWeGraMi (Average Weighted Graph Mining) to solve the above problem, and our method calculates the weight based on the average of all values in the domain. We also apply the MaxMin measure as an upper-bound to prune the search space. The new algorithm can mine all frequent weighted subgraphs effectively. Our experiments on the directed and undirected datasets have shown that AWeGraMi has better performance in comparison to post-processing GraMi for all three criteria: search space (the number of candidate subgraphs), running time, and memory consumption.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

An efficient and scalable approach for mining subgraphs in a single large graph

Article 06 April 2022

Frequent Closed Subgraph Mining: A Multi-thread Approach

Optimized Candidate Generation for Frequent Subgraph Mining in a Single Graph

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Nguyen LBQ, Vo B, Le N-T, Snasel V, Zelinka I (2020) Fast and scalable algorithms for mining subgraphs in a single large graph. Eng Appl Artif Intell 90:103539
Article Google Scholar
Le N-T, Vo B, Nguyen LBQ, Fujita H, Le B (2020) Mining weighted subgraphs in a single large graph. Inf Sci 514:149–165
Article MathSciNet MATH Google Scholar
Le N-T, Vo B, Nguyen LBQ, Le B (2022) “OWGraMi: Efficient Method for Mining Weighted Subgraphs in a Single Graph”, Expert Systems with Applications
Elseidy M, Abdelhamid E, Skiadopoulos S, Kalnis P (2014) Grami: Frequent subgraph and pattern mining in a single large graph. Proc VLDB Endowm 7(7):517–528
Article Google Scholar
Yan X, Han J (2002) “gspan: Graph-based substructure pattern mining,” in 2002 IEEE International Conference on Data Mining, 2002. Proceedings., pp. 721–724.
Nabti C, Seba H (2016) Subgraph isomorphism search in massive graph databases. Université de Lyon, Doctoral dissertation
Book Google Scholar
Lin JC-W, Ren S, Fournier-Viger P (2018) MEMU: more efficient algorithm to mine high average-utility patterns with multiple minimum average-utility thresholds. IEEE Access 6:7593–7609
Article Google Scholar
Gan W, Lin JC-W, Zhang J, Yu PS (2020) Utility mining across multi-sequences with individualized thresholds. ACM Transac Data Sci 1(2):1–29
Article Google Scholar
Nguyen LBQ (2022) “Efficient Methods for Mining Subgraphs in a Single Large Graph,” Doctoral dissertation, VSB – Technical University of Ostrava
Lin JC-W, Li T, Fournier-Viger P, Zhang J, Guo X, “Mining of high average-utility patterns with item-level thresholds”, J Int Technol, vol. 20, no. 1, pp. 187–194, 2019.
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. Proc 20th Int Conf Large Data Bases, VLDB 1215:487–499
Google Scholar
Grahne G, Zhu J (2005) Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans Knowl Data Eng 17(10):1347–1362
Article Google Scholar
Zaki MJ, Hsiao C-J (2005) Efficient algorithms for mining closed itemsets and their lattice structure. IEEE Trans Knowl Data Eng 17(4):462–478
Article Google Scholar
Vo B, Hong T-P, Le B (2012) DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets. Expert Syst Appl 39(8):7196–7206
Article Google Scholar
Bui H, Vo B, Nguyen H, Nguyen-Hoang T-A, Hong T-P (2018) A weighted N-list-based method for mining frequent weighted itemsets. Expert Syst Appl 96:388–405
Article Google Scholar
Vo B, Pham S, Le T, Deng Z-H (2017) A novel approach for mining maximal frequent patterns. Expert Syst Appl 73:178–186
Article Google Scholar
Nguyen LTT, Vu VV, Lam MTH, Duong TTM, Manh LT, Nguyen TTT, Vo B, Fujita H (2019) An efficient method for mining high utility closed itemsets. Inf Sci 495:78–99
Article Google Scholar
Nguyen LTT, Nguyen P, Nguyen TDD, Vo B, Fournier-Viger P, Tseng VS (2019) Mining high-utility itemsets in dynamic profit databases. Knowled-Based Syst 175:130–144
Article Google Scholar
Fournier-Viger P, Zhang Y, Lin JC-W, Fujita H, Koh YS (2019) Mining local and peak high utility itemsets. Inf Sci 481:344–367
Article MathSciNet Google Scholar
Wang X, Xu Y, Zhan H (2020) Extending association rules with graph patterns. Expert Syst Appl 141:112897
Article Google Scholar
Nguyen LBQ, Zelinka I, Snasel V, Nguyen LTT, Vo B (2022) “Subgraph mining in a large graph: A review,” Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, p. e1454
Nguyen LBQ, Nguyen LTT, Vo B, Zelinka I, Lin JCW, Yun U, Nguyen HS (2022) An efficient and scalable approach for mining subgraphs in a single large graph. Appl Intell 52:1–15
Article Google Scholar
Talukder N, Zaki MJ (2016) “Parallel graph mining with dynamic load balancing,” in 2016 IEEE International Conference on Big Data (Big Data). pp. 3352–3359
Abdelhamid E, Abdelaziz I, Kalnis P, Khayyat Z, Jamour F (2016) “Scalemine: Scalable parallel frequent subgraph mining in a single large graph,” in SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. pp. 716–727
Qiao F, Zhang X, Li P, Ding Z, Jia S, Wang H (2018) A parallel approach for frequent subgraph mining in a single large graph using spark. Appl Sci 8(2):230
Article Google Scholar
Fournier-Viger P et al (2020) A survey of pattern mining in dynamic graphs. Wiley Interdiscip Rev: Data Mining Knowled Discover 10(6):e1372
Google Scholar
Pedrycz W (2020) The benefits and drawbacks of data mining technologies. Wiley Interdiscip Rev: Data Mining Knowled Discover 10(1):e1344
Google Scholar
Nguyen LBQ, Zelinka I, Diep QB (2021) CCGraMi: An Effective Method for Mining Frequent Subgraphs in a Single Large Graph. MENDEL 27(2):90–99
Article Google Scholar
Nguyen LBQ, Nguyen LTT, Zelinka I, Snasel V, Nguyen HS, Vo B (2021) “A Method for Closed Frequent Subgraph Mining in a Single Large Graph”, IEEE Access
Teixeira CHC, Fonseca AJ, Serafini M, Siganos G, Zaki MJ, Aboulnaga A (2015) “Arabesque: a system for distributed graph mining”, In: Proceedings of the 25th Symposium on Operating Systems Principles, pp. 425–440
Talukder N, Zaki MJ (2016) A distributed approach for graph mining in massive networks. Data Min Knowl Disc 30(5):1024–1052
Article MathSciNet MATH Google Scholar
Zhao X, Chen Y, Xiao C, Ishikawa Y, Tang J (2016) Frequent subgraph mining based on Pregel. Comput J 59(8):1113–1128
Article Google Scholar
Kuramochi M, Karypis G (2005) Finding frequent patterns in a large sparse graph. Data Min Knowl Disc 11(3):243–271
Article MathSciNet Google Scholar
Kuramochi M, Karypis G (2004) “Grew-a scalable frequent subgraph discovery algorithm,” in Fourth IEEE International Conference on Data Mining (ICDM’04). pp. 439–442
Chen C, Yan X, Zhu F, Han J (2007) “gapprox: Mining frequent approximate patterns from a massive network”, In: Seventh IEEE International Conference on Data Mining (ICDM 2007). pp. 445–450
Guzmán-Ponce A, Marcial-Romero JR, Valdovinos-Rosas RM, Sánchez-Garreta JS (2020) Weighted complete graphs for condensing data. Electron Notes Theoret Comput Sci 354:45–60
Article MathSciNet MATH Google Scholar
Liu N, Li D, Zhang Y, Li X (2020) Large-scale graph processing systems: a survey. Front Inform Technol Electr Eng 21(3):384–404
Article Google Scholar
Sun L, Huang X, Li R, Choi B, Xu J (2020) “Index-based intimate-core community search in large weighted graphs,” IEEE Transactions on Knowledge and Data Engineering
Ramalingeswara Rao T, Ghosh SK, Goswami A (2021) Mining user–user communities for a weighted bipartite network using spark GraphFrames and Flink Gelly. J Supercomput 77(6):5984–6035
Article Google Scholar
Preti G, Lissandrini M, Mottin D, Velegrakis Y (2021) Mining patterns in graphs with multiple weights. Distrib Paral Datab 39(2):281–319
Article Google Scholar
Lin JC-W, Gan W, Fournier-Viger P, Hong T-P, Zhan J (2016) Efficient mining of high-utility itemsets using multiple minimum utility thresholds. Knowl-Based Syst 113:100–115
Article Google Scholar
Liu X, Wang X (2021) Cohesive subgraph identification in weighted bipartite graphs. Appl Sci 11(19):9051
Article Google Scholar
Raayatpanah MA, Khodayifar S, Weise T, Pardalos P (2022) A novel approach to subgraph selection with multiple weights on arcs. J Comb Optim 44(1):242–268
Article MathSciNet MATH Google Scholar
Hu Y, Xiao F (2022) “Time Series Forecasting Based on Fuzzy Cognitive Visibility Graph and Weighted Multi-Subgraph Similarity,” IEEE Transactions on Fuzzy Systems
Li M-W, Xu D-Y, Geng J, Hong W-C (2022) A hybrid approach for forecasting ship motion using CNN–GRU–AM and GCWOA. Appl Soft Comput 114:108084
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, University of Science, Ho Chi Minh City, Vietnam
Ngoc-Thao Le & Bac Le
Vietnam National University, Ho Chi Minh City, Vietnam
Ngoc-Thao Le & Bac Le
Faculty of Information Technology, HUTECH University, Ho Chi Minh City, Vietnam
Ngoc-Thao Le & Bay Vo
Department of Computer Engineering, Sejong University, Seoul, Republic of Korea
Unil Yun

Authors

Ngoc-Thao Le
View author publications
You can also search for this author in PubMed Google Scholar
Bay Vo
View author publications
You can also search for this author in PubMed Google Scholar
Unil Yun
View author publications
You can also search for this author in PubMed Google Scholar
Bac Le
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bac Le.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Le, NT., Vo, B., Yun, U. et al. A novel approach to discover frequent weighted subgraphs using the average measure. Appl Intell 53, 19491–19504 (2023). https://doi.org/10.1007/s10489-023-04501-y

Download citation

Accepted: 02 February 2023
Published: 08 March 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s10489-023-04501-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A novel approach to discover frequent weighted subgraphs using the average measure

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

An efficient and scalable approach for mining subgraphs in a single large graph

Frequent Closed Subgraph Mining: A Multi-thread Approach

Optimized Candidate Generation for Frequent Subgraph Mining in a Single Graph

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A novel approach to discover frequent weighted subgraphs using the average measure

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

An efficient and scalable approach for mining subgraphs in a single large graph

Frequent Closed Subgraph Mining: A Multi-thread Approach

Optimized Candidate Generation for Frequent Subgraph Mining in a Single Graph

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation