Skip to main content

Frequent Closed Subgraph Mining: A Multi-thread Approach

  • Conference paper
  • First Online:
Intelligent Information and Database Systems (ACIIDS 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13757))

Included in the following conference series:

Abstract

Frequent subgraph mining (FSM) is an interesting research field and has attracted a lot of attention from many researchers in recent years, in which closed subgraph mining is a new topic with many practical applications. In the field of graph mining, GraMi (GRAph MIning) is considered state-of-the-art, and many algorithms have been developed based on the improvement of this approach. In 2021, we proposed the CloGraMi algorithm based on GraMi to mine closed frequent subgraphs from a large graph rapidly and efficiently. However, with NP time complexity and extremely high cost in terms of running time, graph mining is always a challenging problem for all researchers. In this paper, we propose a parallel processing strategy aiming to improve the execution speed of our CloGraMi algorithm. Our experiments on six datasets, including both undirected and directed graphs, with different sizes, including large, medium and small, show that the new algorithm significantly reduces running time and improves performance, and has better performance compared to the original algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Elseidy, M., Abdelhamid, E., Skiadopoulos, S., Kalnis, P.: Grami: frequent subgraph and pattern mining in a single large graph. Proc. VLDB Endow. 7(7), 517–528 (2014)

    Article  Google Scholar 

  2. Nguyen, L.B.Q., Vo, B., Le, N.-T., Snasel, V., Zelinka, I.: Fast and scalable algorithms for mining subgraphs in a single large graph. Eng. Appl. Artif. Intell. 90, 103539 (2020)

    Article  Google Scholar 

  3. Nguyen, L.B.Q., Zelinka, I., Snasel, V., Nguyen, L.T.T., Vo, B.: Subgraph mining in a large graph: a review. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. e1454 (2022)

    Google Scholar 

  4. Velampalli, S., Jonnalagedda, V.R.M.: Frequent subgraph mining algorithms: framework, classification, analysis, comparisons. In: Satapathy, S.C., Bhateja, V., Raju, K.S., Janakiramaiah, B. (eds.) Data Engineering and Intelligent Computing. AISC, vol. 542, pp. 327–336. Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-3223-3_31

    Chapter  Google Scholar 

  5. Borrego, A., Ayala, D., Hernández, I., Rivero, C.R., Ruiz, D.: CAFE: knowledge graph completion using neighborhood-aware features. Eng. Appl. Artif. Intell. 103, 104302 (2021)

    Article  Google Scholar 

  6. Fox, J., Roughgarden, T., Seshadhri, C., Wei, F., Wein, N.: Finding cliques in social networks: a new distribution-free model. SIAM J. Comput. 49(2), 448–464 (2020)

    Article  MathSciNet  MATH  Google Scholar 

  7. Song, Q., Wu, Y., Lin, P., Dong, L.X., Sun, H.: Mining summaries for knowledge graph search. IEEE Trans. Knowl. Data Eng. 30(10), 1887–1900 (2018)

    Article  Google Scholar 

  8. Chehreghani, M.H., Abdessalem, T., Bifet, A., Bouzbila, M.: Sampling informative patterns from large single networks. Futur. Gener. Comput. Syst. 106, 653–658 (2020)

    Article  Google Scholar 

  9. Chen, Y., Zhao, X., Lin, X., Wang, Y., Guo, D.: Efficient mining of frequent patterns on uncertain graphs. IEEE Trans. Knowl. Data Eng. 31(2), 287–300 (2018)

    Article  Google Scholar 

  10. Iqbal, R., Doctor, F., More, B., Mahmud, S., Yousuf, U.: Big data analytics and computational intelligence for cyber-physical systems: recent trends and state of the art applications. Futur. Gener. Comput. Syst. 105, 766–778 (2020)

    Article  Google Scholar 

  11. Demetrovics, J., Quang, H.M., Anh, N.V., Thi, V.D.: An optimization of closed frequent subgraph mining algorithm. Cybern. Inf. Technol. 17(1), 3–15 (2017)

    MathSciNet  Google Scholar 

  12. Nguyen, L.B.Q., Nguyen, L.T.T., Zelinka, I., Snasel, V., Nguyen, H.S., Vo, B.: A method for closed frequent subgraph mining in a single large graph. IEEE Access (2021)

    Google Scholar 

  13. Karabadji, N.E.I., Aridhi, S., Seridi, H.: A closed frequent subgraph mining algorithm in unique edge label graphs. In: International Conference on Machine Learning and Data Mining in Pattern Recognition, pp. 43–57 (2016)

    Google Scholar 

  14. Yan, X., Han, J.: Closegraph: mining closed frequent graph patterns. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 286–295 (2003)

    Google Scholar 

  15. Bendimerad, A., Plantevit, M., Robardet, C.: Mining exceptional closed patterns in attributed graphs. Knowl. Inf. Syst. 56(1), 1–25 (2017). https://doi.org/10.1007/s10115-017-1109-2

    Article  MATH  Google Scholar 

  16. Acosta-Mendoza, N., Gago-Alonso, A., Carrasco-Ochoa, J.A., Martínez-Trinidad, J.F., Medina-Pagola, J.E.: Mining generalized closed patterns from multi-graph collections. In: Iberoamerican Congress on Pattern Recognition, pp. 10–18 (2017)

    Google Scholar 

  17. Jia, Y., Zhang, J., Huan, J.: An efficient graph-mining method for complicated and noisy data with real-world applications. Knowl. Inf. Syst. 28(2), 423–447 (2011)

    Article  Google Scholar 

  18. Nejad, S.J., AhmadiAbkenari, F., Bayat, P.: A combination of frequent pattern mining and graph traversal approaches for aspect elicitation in customer reviews. IEEE Access 8, 151908–151925 (2020)

    Article  Google Scholar 

  19. Jie, F., Wang, C., Chen, F., Li, L., Wu, X.: A framework for subgraph detection in interdependent networks via graph block-structured optimization. IEEE Access 8, 157800–157818 (2020)

    Article  Google Scholar 

  20. Guan, H., Zhao, Q., Ren, Y., Nie, W.: View-based 3D model retrieval by joint subgraph learning and matching. IEEE Access 8, 19830–19841 (2020)

    Article  Google Scholar 

  21. Karwa, V., Raskhodnikova, S., Smith, A., Yaroslavtsev, G.: Private analysis of graph structure. ACM Trans. Database Syst. 39(3), 1–33 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  22. Nguyen, L., et al.: An efficient and scalable approach for mining subgraphs in a single large graph. Appl. Intell. 1–15 (2022)

    Google Scholar 

  23. Nguyen, L.B.Q., Zelinka, I., Diep, Q.B.: CCGraMi: an effective method for mining frequent subgraphs in a single large graph. MENDEL 27(2), 90–99 (2021)

    Article  Google Scholar 

  24. Ullmann, J.R.: An algorithm for subgraph isomorphism. J. ACM 23(1), 31–42 (1976)

    Article  MathSciNet  Google Scholar 

  25. Le, N.-T., Vo, B., Nguyen, L.B.Q., Fujita, H., Le, B.: Mining weighted subgraphs in a single large graph. Inf. Sci. (Ny) 514, 149–165 (2020)

    Article  MathSciNet  MATH  Google Scholar 

  26. Seeland, M., Girschick, T., Buchwald, F., Kramer, S.: Online structural graph clustering using frequent subgraph mining. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 213–228 (2010)

    Google Scholar 

  27. Abdelhamid, E., Abdelaziz, I., Kalnis, P., Khayyat, Z., Jamour, F.: Scalemine: scalable parallel frequent subgraph mining in a single large graph. In: SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 716–727 (2016)

    Google Scholar 

  28. Yan, X., Han, J.: gspan: Graph-based substructure pattern mining. In: 2002 IEEE International Conference on Data Mining, Proceedings, pp. 721–724 (2002)

    Google Scholar 

  29. Dhulipala, L., Blelloch, G.E., Shun, J.: Theoretically efficient parallel graph algorithms can be fast and scalable. ACM Trans. Parallel Comput. 8(1), 1–70 (2021)

    Article  MathSciNet  Google Scholar 

  30. Thomas, L.T., Valluri, S.R., Karlapalem, K.: Margin: Maximal frequent subgraph mining. ACM Trans. Knowl. Discov. from Data 4(3), 1–42 (2010)

    Article  Google Scholar 

  31. Farag, A., Abdelkader, H., Salem, R.: Parallel graph-based anomaly detection technique for sequential data. J. King Saud Univ. Inf. Sci. 34(1), 1446–1454 (2022)

    Google Scholar 

  32. Teixeira, C.H.C., Fonseca, A.J., Serafini, M., Siganos, G., Zaki, M.J., Aboulnaga, A.: Arabesque: a system for distributed graph mining. In: Proceedings of the 25th Symposium on Operating Systems Principles, pp. 425–440 (2015)

    Google Scholar 

  33. Qiao, F., Zhang, X., Li, P., Ding, Z., Jia, S., Wang, H.: A parallel approach for frequent subgraph mining in a single large graph using spark. Appl. Sci. 8(2), 230 (2018)

    Article  Google Scholar 

  34. Kuramochi, M., Karypis, G.: Finding frequent patterns in a large sparse graph. Data Min. Knowl. Discov. 11(3), 243–271 (2005)

    Article  MathSciNet  Google Scholar 

  35. Kepner, J.: Keynote talk: large scale parallel sparse matrix streaming graph/network analysis. In: Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures, p. 61 (2022)

    Google Scholar 

  36. Bouhenni, S., Yahiaoui, S., Nouali-Taboudjemat, N., Kheddouci, H.: A survey on distributed graph pattern matching in massive graphs. ACM Comput. Surv. 54(2), 1–35 (2021)

    Article  Google Scholar 

  37. Güvenoglu, B., Bostanoglu, B.E.: A qualitative survey on frequent subgraph mining. Open Comput. Sci. 8(1), 194–209 (2018)

    Article  Google Scholar 

  38. FournierViger, P., et al.: A survey of pattern mining in dynamic graphs. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 10(6), e1372 (2020)

    Google Scholar 

Download references

Acknowledgement

This work was supported by Institute for Computational Science and Technology (ICST) – Ho Chi Minh City and the Department of Science and Technology (DOST) – Ho Chi Minh City under grant no. 23/2021/HĐ-QKHCN.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bay Vo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nguyen, L.B.Q., Le, NT., Nguyen, H.S., Pham, T., Vo, B. (2022). Frequent Closed Subgraph Mining: A Multi-thread Approach. In: Nguyen, N.T., Tran, T.K., Tukayev, U., Hong, TP., Trawiński, B., Szczerbicki, E. (eds) Intelligent Information and Database Systems. ACIIDS 2022. Lecture Notes in Computer Science(), vol 13757. Springer, Cham. https://doi.org/10.1007/978-3-031-21743-2_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-21743-2_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-21742-5

  • Online ISBN: 978-3-031-21743-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics