Abstract
Frequent subgraph mining (FSM) is an interesting research field and has attracted a lot of attention from many researchers in recent years, in which closed subgraph mining is a new topic with many practical applications. In the field of graph mining, GraMi (GRAph MIning) is considered state-of-the-art, and many algorithms have been developed based on the improvement of this approach. In 2021, we proposed the CloGraMi algorithm based on GraMi to mine closed frequent subgraphs from a large graph rapidly and efficiently. However, with NP time complexity and extremely high cost in terms of running time, graph mining is always a challenging problem for all researchers. In this paper, we propose a parallel processing strategy aiming to improve the execution speed of our CloGraMi algorithm. Our experiments on six datasets, including both undirected and directed graphs, with different sizes, including large, medium and small, show that the new algorithm significantly reduces running time and improves performance, and has better performance compared to the original algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Elseidy, M., Abdelhamid, E., Skiadopoulos, S., Kalnis, P.: Grami: frequent subgraph and pattern mining in a single large graph. Proc. VLDB Endow. 7(7), 517–528 (2014)
Nguyen, L.B.Q., Vo, B., Le, N.-T., Snasel, V., Zelinka, I.: Fast and scalable algorithms for mining subgraphs in a single large graph. Eng. Appl. Artif. Intell. 90, 103539 (2020)
Nguyen, L.B.Q., Zelinka, I., Snasel, V., Nguyen, L.T.T., Vo, B.: Subgraph mining in a large graph: a review. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. e1454 (2022)
Velampalli, S., Jonnalagedda, V.R.M.: Frequent subgraph mining algorithms: framework, classification, analysis, comparisons. In: Satapathy, S.C., Bhateja, V., Raju, K.S., Janakiramaiah, B. (eds.) Data Engineering and Intelligent Computing. AISC, vol. 542, pp. 327–336. Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-3223-3_31
Borrego, A., Ayala, D., Hernández, I., Rivero, C.R., Ruiz, D.: CAFE: knowledge graph completion using neighborhood-aware features. Eng. Appl. Artif. Intell. 103, 104302 (2021)
Fox, J., Roughgarden, T., Seshadhri, C., Wei, F., Wein, N.: Finding cliques in social networks: a new distribution-free model. SIAM J. Comput. 49(2), 448–464 (2020)
Song, Q., Wu, Y., Lin, P., Dong, L.X., Sun, H.: Mining summaries for knowledge graph search. IEEE Trans. Knowl. Data Eng. 30(10), 1887–1900 (2018)
Chehreghani, M.H., Abdessalem, T., Bifet, A., Bouzbila, M.: Sampling informative patterns from large single networks. Futur. Gener. Comput. Syst. 106, 653–658 (2020)
Chen, Y., Zhao, X., Lin, X., Wang, Y., Guo, D.: Efficient mining of frequent patterns on uncertain graphs. IEEE Trans. Knowl. Data Eng. 31(2), 287–300 (2018)
Iqbal, R., Doctor, F., More, B., Mahmud, S., Yousuf, U.: Big data analytics and computational intelligence for cyber-physical systems: recent trends and state of the art applications. Futur. Gener. Comput. Syst. 105, 766–778 (2020)
Demetrovics, J., Quang, H.M., Anh, N.V., Thi, V.D.: An optimization of closed frequent subgraph mining algorithm. Cybern. Inf. Technol. 17(1), 3–15 (2017)
Nguyen, L.B.Q., Nguyen, L.T.T., Zelinka, I., Snasel, V., Nguyen, H.S., Vo, B.: A method for closed frequent subgraph mining in a single large graph. IEEE Access (2021)
Karabadji, N.E.I., Aridhi, S., Seridi, H.: A closed frequent subgraph mining algorithm in unique edge label graphs. In: International Conference on Machine Learning and Data Mining in Pattern Recognition, pp. 43–57 (2016)
Yan, X., Han, J.: Closegraph: mining closed frequent graph patterns. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 286–295 (2003)
Bendimerad, A., Plantevit, M., Robardet, C.: Mining exceptional closed patterns in attributed graphs. Knowl. Inf. Syst. 56(1), 1–25 (2017). https://doi.org/10.1007/s10115-017-1109-2
Acosta-Mendoza, N., Gago-Alonso, A., Carrasco-Ochoa, J.A., Martínez-Trinidad, J.F., Medina-Pagola, J.E.: Mining generalized closed patterns from multi-graph collections. In: Iberoamerican Congress on Pattern Recognition, pp. 10–18 (2017)
Jia, Y., Zhang, J., Huan, J.: An efficient graph-mining method for complicated and noisy data with real-world applications. Knowl. Inf. Syst. 28(2), 423–447 (2011)
Nejad, S.J., AhmadiAbkenari, F., Bayat, P.: A combination of frequent pattern mining and graph traversal approaches for aspect elicitation in customer reviews. IEEE Access 8, 151908–151925 (2020)
Jie, F., Wang, C., Chen, F., Li, L., Wu, X.: A framework for subgraph detection in interdependent networks via graph block-structured optimization. IEEE Access 8, 157800–157818 (2020)
Guan, H., Zhao, Q., Ren, Y., Nie, W.: View-based 3D model retrieval by joint subgraph learning and matching. IEEE Access 8, 19830–19841 (2020)
Karwa, V., Raskhodnikova, S., Smith, A., Yaroslavtsev, G.: Private analysis of graph structure. ACM Trans. Database Syst. 39(3), 1–33 (2014)
Nguyen, L., et al.: An efficient and scalable approach for mining subgraphs in a single large graph. Appl. Intell. 1–15 (2022)
Nguyen, L.B.Q., Zelinka, I., Diep, Q.B.: CCGraMi: an effective method for mining frequent subgraphs in a single large graph. MENDEL 27(2), 90–99 (2021)
Ullmann, J.R.: An algorithm for subgraph isomorphism. J. ACM 23(1), 31–42 (1976)
Le, N.-T., Vo, B., Nguyen, L.B.Q., Fujita, H., Le, B.: Mining weighted subgraphs in a single large graph. Inf. Sci. (Ny) 514, 149–165 (2020)
Seeland, M., Girschick, T., Buchwald, F., Kramer, S.: Online structural graph clustering using frequent subgraph mining. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 213–228 (2010)
Abdelhamid, E., Abdelaziz, I., Kalnis, P., Khayyat, Z., Jamour, F.: Scalemine: scalable parallel frequent subgraph mining in a single large graph. In: SC’16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 716–727 (2016)
Yan, X., Han, J.: gspan: Graph-based substructure pattern mining. In: 2002 IEEE International Conference on Data Mining, Proceedings, pp. 721–724 (2002)
Dhulipala, L., Blelloch, G.E., Shun, J.: Theoretically efficient parallel graph algorithms can be fast and scalable. ACM Trans. Parallel Comput. 8(1), 1–70 (2021)
Thomas, L.T., Valluri, S.R., Karlapalem, K.: Margin: Maximal frequent subgraph mining. ACM Trans. Knowl. Discov. from Data 4(3), 1–42 (2010)
Farag, A., Abdelkader, H., Salem, R.: Parallel graph-based anomaly detection technique for sequential data. J. King Saud Univ. Inf. Sci. 34(1), 1446–1454 (2022)
Teixeira, C.H.C., Fonseca, A.J., Serafini, M., Siganos, G., Zaki, M.J., Aboulnaga, A.: Arabesque: a system for distributed graph mining. In: Proceedings of the 25th Symposium on Operating Systems Principles, pp. 425–440 (2015)
Qiao, F., Zhang, X., Li, P., Ding, Z., Jia, S., Wang, H.: A parallel approach for frequent subgraph mining in a single large graph using spark. Appl. Sci. 8(2), 230 (2018)
Kuramochi, M., Karypis, G.: Finding frequent patterns in a large sparse graph. Data Min. Knowl. Discov. 11(3), 243–271 (2005)
Kepner, J.: Keynote talk: large scale parallel sparse matrix streaming graph/network analysis. In: Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures, p. 61 (2022)
Bouhenni, S., Yahiaoui, S., Nouali-Taboudjemat, N., Kheddouci, H.: A survey on distributed graph pattern matching in massive graphs. ACM Comput. Surv. 54(2), 1–35 (2021)
Güvenoglu, B., Bostanoglu, B.E.: A qualitative survey on frequent subgraph mining. Open Comput. Sci. 8(1), 194–209 (2018)
FournierViger, P., et al.: A survey of pattern mining in dynamic graphs. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 10(6), e1372 (2020)
Acknowledgement
This work was supported by Institute for Computational Science and Technology (ICST) – Ho Chi Minh City and the Department of Science and Technology (DOST) – Ho Chi Minh City under grant no. 23/2021/HĐ-QKHCN.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Nguyen, L.B.Q., Le, NT., Nguyen, H.S., Pham, T., Vo, B. (2022). Frequent Closed Subgraph Mining: A Multi-thread Approach. In: Nguyen, N.T., Tran, T.K., Tukayev, U., Hong, TP., Trawiński, B., Szczerbicki, E. (eds) Intelligent Information and Database Systems. ACIIDS 2022. Lecture Notes in Computer Science(), vol 13757. Springer, Cham. https://doi.org/10.1007/978-3-031-21743-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-031-21743-2_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21742-5
Online ISBN: 978-3-031-21743-2
eBook Packages: Computer ScienceComputer Science (R0)