Abstract
Recently, some algorithms have been reported for mining multi-graph Frequent Approximate Subgraphs (FASs). This kind of algorithm has a large applicability in social network analysis, image classification tasks and clustering, among others. However, all of them mine a large number of patterns. For this reason, some algorithms for mining a subset of FASs have been proposed. In this paper, we propose an efficient algorithm for mining a subset of FASs on multi-graph collections. Our proposed algorithm becomes an alternative for reducing the number of mined FASs by computing the clique FASs. It is important to highlight that, to the best of our knowledge; our proposal is the first algorithm for mining clique FASs on multi-graph collections. Our proposal is compared against other reported solutions and evaluated over several synthetic and real-world multi-graphs. In addition, we show the usefulness of the patterns mined by our proposal for image classification.
Similar content being viewed by others
Notes
A maximal FAS is a FAS that is not sub-isomorphic to another FAS [16].
A closed FAS is a FAS that is not sub-isomorphic to another FAS with the same frequency [38].
A clique FAS is a FAS such that every vertex is connected to every other by an edge [26].
Multi-edges [2] are different edges connecting the same pair of vertices (i.e., e and \(e^{\prime } \) are multi-edges if \(e \neq e^{\prime }\) and \(\phi _{G}(e) = \phi _{G}(e^{\prime }) = \{u,v\}\) such that u, v ∈ VG, u≠v).
PyGen is available in http://pywebgraph.sourceforge.net.
References
Acosta-Mendoza N, Carrasco-Ochoa J, Martínez-Trinidad J, Gago-Alonso A, Medina-pagola J (2015) A new method based on graph transformation for FAS mining in multi-graph collections. In: The 7th Mexican conference on pattern recognition (MCPR’2015), Pattern Recognition, vol LNCS 9116, pp 13–22. Springer
Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2016) A new algorithm for approximate pattern mining in multi-graph collections. Knowl-Based Syst 109:198–207
Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2017) Extension of canonical adjacency matrices for frequent approximate subgraph mining on multi-graph collections. Int J Pattern Recognit Artif Intell 31(7):25
Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2018) A new multi-graph transformation method for frequent approximate subgraph mining. Cuban Journal of Informatics Sciences (RCCI) 12(3):1–16
Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2018) Mining generalized closed frequent approximate subgraphs from multi-graph collections. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. (22nd Iberoamerican Congress, CIARP 2017), LNCS 10657, pp 10–18
Acosta-Mendoza N, Gago-Alonso A, Medina-Pagola J (2012) Frequent approximate subgraphs as features for graph-based image classification. Knowl-Based Syst 27:381–392
Anwar T, Liu C, Vu HL, Leckie C (2017) Partitioning road networks using density peak graphs: efficiency vs. accuracy. Inf Syst 64:22–40
Ayush A, Chhavi S, Minni J, Amita J (2018) Semi supervised graph based keyword extraction using lexical chains and centrality measures. Computación y Sistemas 22(4):1307–1315
Bhatia V, Rani R (2018) Ap-fsm: a parallel algorithm for approximate frequent subgraph mining using pregel. Expert Syst Appl 106:217–232
Brun L, Foggia P, Vento M (2018) Trends in graph-based representations for pattern recognition. Pattern Recogn Lett. https://doi.org/10.1016/j.patrec.2018.03.016
Chen Y, Zhao X, Lin X, Wang Y, Guo D (2019) Efficient mining of frequent patterns on uncertain graphs. IEEE Trans Knowl Data Eng 31(2):287–300
Conte D, Foggia P, Sansone C, Vento M (2004) Thirty years of graph matching in pattern recognition. IJPRAI 18(3):265–298
Dhifli W, Aridhi S, Nguifo EM (2017) Mr-simlab: scalable subgraph selection with label similarity for big data. Inf Syst 69:155–163
Dwivedi SP, Singh RS (2018) Error-tolerant graph matching using node contraction. Pattern Recogn Lett 116:58–64
Emmert-Streib F, Dehmer M, Shi Y (2016) Fifty years of graph matching, network alignment and network comparison. Inf Sci, 346–347:1–22
Flores-Garrido M, Carrasco-Ochoa J, Martínez-Trinidad J (2014) Mining maximal frequent patterns in a single graph using inexact matching. Knowl-Based Syst 66:166–177
Flores-Garrido M, Carrasco-Ochoa J, Martínez-Trinidad J (2018) Extensions to AGRAP algorithm for findidng a reduced set of inexact graph patterns. Int J Pattern Recognit Artif Intell, 32(01):1–28
Gao X, Xiao B, Tao D, Li X (2010) A survey of graph edit distance. Pattern Anal Applic 13 (1):113–129
Huan J, Wang W, Prins J (2003) Efficient mining of frequent subgraphs in the presence of isomorphism. In: The 3rd IEEE international conference on data mining, pp 549–552. Melbourne, FL
Hulianytsky L, Pavlenko A (2015) Ant colony optimization for time dependent shortest path problem in directed multigraph. International Journal Information Content and Processing 2(1):50–61
Ingalalli V, Ienco D, Poncelet P (2018) Mining frequent subgraphs in multigraphs. Inf Sci 451:50–66
Kropatsch W, Haxhimusa Y, Pizlo Z, Langs G (2005) Vision pyramids that do not grow too high. Pattern Recogn Lett 26:319–337
Morales-González A, García-Reyes EB (2013) Simple object recognition based on spatial relations and visual features represented using irregular pyramids. Multimed Tools Appl 63(3):875–897
Moussaoui M, Zaghdoud M, Akaichi J (2018) A new framework of frequent uncertain subgraph mining. Prog Comput Sci 126:413–422
Papalexakis E, Akoglu L, Ienco D (2013) Do more views of a graph help? community detection and clustering in multi-graphs. In: 16th international conference on information fusion, IEEE, Istanbul, Turkey, pp 899–905
Rahman M (2017) Basic graph theory. Undergraduate topics in computer science springer international publishing
Riesen K, Bunke H (2008) IAM graph database repository for graph based pattern recognition and machine learning. Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop, SSPR and SPR 2008, pp 208–297
Santhi S, Padmaja P (2015) A survey of frequent subgraph mining algorithms for uncertain graph data. International Research Journal of Engineering and Technology (IRJET) 2(2):688–696
Setak M, Habibi M, Karimi H, Abedzadeh M (2015) A time-dependent vehicle routing problem in multigraph with FIFO property. J Manuf Syst 35:37–45
Shakibian H, Charkari NM, Jalili S (2018) Multi-kernel one class link prediction in heterogeneous complex networks. Appl Intell 48(10):3411–3428
Terroso-Saez F, Valdés-Vela M., Skarmeta-gómez A (2015) Online urban mobility detection based on velocity features. In: 17th international conference of big data analytics and knowledge discovery, Valencia, Spain, LNCS, vol 9263, pp 351–362
Verma A, Bharadwaj K (2017) Identifying community structure in a multi-relational network employing non-negative tensor factorization and GA k-means clustering. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7(1):1–22
Wang H, Ma W, Shi H, Xia C (2015) An interval algebra-based modeling and routing method in bus delay tolerant network. KSII Trans Internet Inf Syst 9(4):1376–1391
Wei D, Liu H, Qin Y (2015) Modeling cascade dynamics of railway networks under inclement weather. Transp Res E 80:95–122
Wu D, Ren J, Sheng L (2018) Uncertain maximal frequent subgraph mining algorithm based on adjacency matrix and weight. Int J Mach Learn Cybern 9(9):1445–1455
Wu X, Liu X, Chen Y, Shen J, Zhao W (2018) A graph based superpixel generation algorithm. Appl Intell 48(11):4485–4496
Yan X, Han J (2002) gSpan: graph-based substructure pattern mining. In: International conference on data mining. Maebashi, Japan
Yan X, Han J (2003) Closedgraph: mining closed frequent graph patterns. In: Proceedings of the 9th ACM SIGKDD of international conference on knowledge discovery and data mining (KDD), pp 286–295, Washington, DC
Yan X, Han J, Afshar R (2003) CloSpan: mining closed sequential patterns in large datasets. In: 3rd SIAM international conference on data mining, pp 166–177, SIAM, San Franciso, USA
Youssef R, Kacem A, Sevestre-Ghalila S, Chappard C (2015) Graph structuring of skeleton object for its high-level exploitation. In: Image analysis and recognition - 12th international conference, ICIAR 2015, Niagara Falls, ON, Canada, July 22-24, 2015, Proceedings, LNCS, vol 9164, pp 419–426. Springer
Acknowledgements
This work was partly supported by the National Council of Science and Technology of Mexico (CONACyT) through the scholarship grant 287045.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Acosta-Mendoza, N., Carrasco-Ochoa, J.A., Martínez-Trinidad, J.F. et al. Mining clique frequent approximate subgraphs from multi-graph collections. Appl Intell 50, 878–892 (2020). https://doi.org/10.1007/s10489-019-01564-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-019-01564-8