Skip to main content
Log in

A novel approach to discover frequent weighted subgraphs using the average measure

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Mining a weighted single large graph has recently attracted many researchers. The WeGraMi algorithm is considered the state-of-the-art among current approaches. It uses a MaxMin measure to calculate weights for all mined subgraphs. However, if all values in the domain have the same role and the user needs an average value of that domain, then we have to use another measure. In this paper, we introduce a novel algorithm called AWeGraMi (Average Weighted Graph Mining) to solve the above problem, and our method calculates the weight based on the average of all values in the domain. We also apply the MaxMin measure as an upper-bound to prune the search space. The new algorithm can mine all frequent weighted subgraphs effectively. Our experiments on the directed and undirected datasets have shown that AWeGraMi has better performance in comparison to post-processing GraMi for all three criteria: search space (the number of candidate subgraphs), running time, and memory consumption.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3 
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Nguyen LBQ, Vo B, Le N-T, Snasel V, Zelinka I (2020) Fast and scalable algorithms for mining subgraphs in a single large graph. Eng Appl Artif Intell 90:103539

    Article  Google Scholar 

  2. Le N-T, Vo B, Nguyen LBQ, Fujita H, Le B (2020) Mining weighted subgraphs in a single large graph. Inf Sci 514:149–165

    Article  MathSciNet  MATH  Google Scholar 

  3. Le N-T, Vo B, Nguyen LBQ, Le B (2022) “OWGraMi: Efficient Method for Mining Weighted Subgraphs in a Single Graph”, Expert Systems with Applications

  4. Elseidy M, Abdelhamid E, Skiadopoulos S, Kalnis P (2014) Grami: Frequent subgraph and pattern mining in a single large graph. Proc VLDB Endowm 7(7):517–528

    Article  Google Scholar 

  5. Yan X, Han J (2002) “gspan: Graph-based substructure pattern mining,” in 2002 IEEE International Conference on Data Mining, 2002. Proceedings., pp. 721–724.

  6. Nabti C, Seba H (2016) Subgraph isomorphism search in massive graph databases. Université de Lyon, Doctoral dissertation

    Book  Google Scholar 

  7. Lin JC-W, Ren S, Fournier-Viger P (2018) MEMU: more efficient algorithm to mine high average-utility patterns with multiple minimum average-utility thresholds. IEEE Access 6:7593–7609

    Article  Google Scholar 

  8. Gan W, Lin JC-W, Zhang J, Yu PS (2020) Utility mining across multi-sequences with individualized thresholds. ACM Transac Data Sci 1(2):1–29

    Article  Google Scholar 

  9. Nguyen LBQ (2022) “Efficient Methods for Mining Subgraphs in a Single Large Graph,” Doctoral dissertation, VSB – Technical University of Ostrava

  10. Lin JC-W, Li T, Fournier-Viger P, Zhang J, Guo X, “Mining of high average-utility patterns with item-level thresholds”, J Int Technol, vol. 20, no. 1, pp. 187–194, 2019.

  11. Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. Proc 20th Int Conf Large Data Bases, VLDB 1215:487–499

    Google Scholar 

  12. Grahne G, Zhu J (2005) Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans Knowl Data Eng 17(10):1347–1362

    Article  Google Scholar 

  13. Zaki MJ, Hsiao C-J (2005) Efficient algorithms for mining closed itemsets and their lattice structure. IEEE Trans Knowl Data Eng 17(4):462–478

    Article  Google Scholar 

  14. Vo B, Hong T-P, Le B (2012) DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets. Expert Syst Appl 39(8):7196–7206

    Article  Google Scholar 

  15. Bui H, Vo B, Nguyen H, Nguyen-Hoang T-A, Hong T-P (2018) A weighted N-list-based method for mining frequent weighted itemsets. Expert Syst Appl 96:388–405

    Article  Google Scholar 

  16. Vo B, Pham S, Le T, Deng Z-H (2017) A novel approach for mining maximal frequent patterns. Expert Syst Appl 73:178–186

    Article  Google Scholar 

  17. Nguyen LTT, Vu VV, Lam MTH, Duong TTM, Manh LT, Nguyen TTT, Vo B, Fujita H (2019) An efficient method for mining high utility closed itemsets. Inf Sci 495:78–99

    Article  Google Scholar 

  18. Nguyen LTT, Nguyen P, Nguyen TDD, Vo B, Fournier-Viger P, Tseng VS (2019) Mining high-utility itemsets in dynamic profit databases. Knowled-Based Syst 175:130–144

    Article  Google Scholar 

  19. Fournier-Viger P, Zhang Y, Lin JC-W, Fujita H, Koh YS (2019) Mining local and peak high utility itemsets. Inf Sci 481:344–367

    Article  MathSciNet  Google Scholar 

  20. Wang X, Xu Y, Zhan H (2020) Extending association rules with graph patterns. Expert Syst Appl 141:112897

    Article  Google Scholar 

  21. Nguyen LBQ, Zelinka I, Snasel V, Nguyen LTT, Vo B (2022) “Subgraph mining in a large graph: A review,” Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, p. e1454

  22. Nguyen LBQ, Nguyen LTT, Vo B, Zelinka I, Lin JCW, Yun U, Nguyen HS (2022) An efficient and scalable approach for mining subgraphs in a single large graph. Appl Intell 52:1–15

    Article  Google Scholar 

  23. Talukder N, Zaki MJ (2016) “Parallel graph mining with dynamic load balancing,” in 2016 IEEE International Conference on Big Data (Big Data). pp. 3352–3359

  24. Abdelhamid E, Abdelaziz I, Kalnis P, Khayyat Z, Jamour F (2016) “Scalemine: Scalable parallel frequent subgraph mining in a single large graph,” in SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. pp. 716–727

  25. Qiao F, Zhang X, Li P, Ding Z, Jia S, Wang H (2018) A parallel approach for frequent subgraph mining in a single large graph using spark. Appl Sci 8(2):230

    Article  Google Scholar 

  26. Fournier-Viger P et al (2020) A survey of pattern mining in dynamic graphs. Wiley Interdiscip Rev: Data Mining Knowled Discover 10(6):e1372

    Google Scholar 

  27. Pedrycz W (2020) The benefits and drawbacks of data mining technologies. Wiley Interdiscip Rev: Data Mining Knowled Discover 10(1):e1344

    Google Scholar 

  28. Nguyen LBQ, Zelinka I, Diep QB (2021) CCGraMi: An Effective Method for Mining Frequent Subgraphs in a Single Large Graph. MENDEL 27(2):90–99

    Article  Google Scholar 

  29. Nguyen LBQ, Nguyen LTT, Zelinka I, Snasel V, Nguyen HS, Vo B (2021) “A Method for Closed Frequent Subgraph Mining in a Single Large Graph”, IEEE Access

  30. Teixeira CHC, Fonseca AJ, Serafini M, Siganos G, Zaki MJ, Aboulnaga A (2015) “Arabesque: a system for distributed graph mining”, In: Proceedings of the 25th Symposium on Operating Systems Principles, pp. 425–440

  31. Talukder N, Zaki MJ (2016) A distributed approach for graph mining in massive networks. Data Min Knowl Disc 30(5):1024–1052

    Article  MathSciNet  MATH  Google Scholar 

  32. Zhao X, Chen Y, Xiao C, Ishikawa Y, Tang J (2016) Frequent subgraph mining based on Pregel. Comput J 59(8):1113–1128

    Article  Google Scholar 

  33. Kuramochi M, Karypis G (2005) Finding frequent patterns in a large sparse graph. Data Min Knowl Disc 11(3):243–271

    Article  MathSciNet  Google Scholar 

  34. Kuramochi M, Karypis G (2004) “Grew-a scalable frequent subgraph discovery algorithm,” in Fourth IEEE International Conference on Data Mining (ICDM’04). pp. 439–442

  35. Chen C, Yan X, Zhu F, Han J (2007) “gapprox: Mining frequent approximate patterns from a massive network”, In: Seventh IEEE International Conference on Data Mining (ICDM 2007). pp. 445–450

  36. Guzmán-Ponce A, Marcial-Romero JR, Valdovinos-Rosas RM, Sánchez-Garreta JS (2020) Weighted complete graphs for condensing data. Electron Notes Theoret Comput Sci 354:45–60

    Article  MathSciNet  MATH  Google Scholar 

  37. Liu N, Li D, Zhang Y, Li X (2020) Large-scale graph processing systems: a survey. Front Inform Technol Electr Eng 21(3):384–404

    Article  Google Scholar 

  38. Sun L, Huang X, Li R, Choi B, Xu J (2020) “Index-based intimate-core community search in large weighted graphs,” IEEE Transactions on Knowledge and Data Engineering

  39. Ramalingeswara Rao T, Ghosh SK, Goswami A (2021) Mining user–user communities for a weighted bipartite network using spark GraphFrames and Flink Gelly. J Supercomput 77(6):5984–6035

    Article  Google Scholar 

  40. Preti G, Lissandrini M, Mottin D, Velegrakis Y (2021) Mining patterns in graphs with multiple weights. Distrib Paral Datab 39(2):281–319

    Article  Google Scholar 

  41. Lin JC-W, Gan W, Fournier-Viger P, Hong T-P, Zhan J (2016) Efficient mining of high-utility itemsets using multiple minimum utility thresholds. Knowl-Based Syst 113:100–115

    Article  Google Scholar 

  42. Liu X, Wang X (2021) Cohesive subgraph identification in weighted bipartite graphs. Appl Sci 11(19):9051

    Article  Google Scholar 

  43. Raayatpanah MA, Khodayifar S, Weise T, Pardalos P (2022) A novel approach to subgraph selection with multiple weights on arcs. J Comb Optim 44(1):242–268

    Article  MathSciNet  MATH  Google Scholar 

  44. Hu Y, Xiao F (2022) “Time Series Forecasting Based on Fuzzy Cognitive Visibility Graph and Weighted Multi-Subgraph Similarity,” IEEE Transactions on Fuzzy Systems

  45. Li M-W, Xu D-Y, Geng J, Hong W-C (2022) A hybrid approach for forecasting ship motion using CNN–GRU–AM and GCWOA. Appl Soft Comput 114:108084

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bac Le.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Le, NT., Vo, B., Yun, U. et al. A novel approach to discover frequent weighted subgraphs using the average measure. Appl Intell 53, 19491–19504 (2023). https://doi.org/10.1007/s10489-023-04501-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-023-04501-y

Keywords

Navigation