Skip to main content
Log in

Mining clique frequent approximate subgraphs from multi-graph collections

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Recently, some algorithms have been reported for mining multi-graph Frequent Approximate Subgraphs (FASs). This kind of algorithm has a large applicability in social network analysis, image classification tasks and clustering, among others. However, all of them mine a large number of patterns. For this reason, some algorithms for mining a subset of FASs have been proposed. In this paper, we propose an efficient algorithm for mining a subset of FASs on multi-graph collections. Our proposed algorithm becomes an alternative for reducing the number of mined FASs by computing the clique FASs. It is important to highlight that, to the best of our knowledge; our proposal is the first algorithm for mining clique FASs on multi-graph collections. Our proposal is compared against other reported solutions and evaluated over several synthetic and real-world multi-graphs. In addition, we show the usefulness of the patterns mined by our proposal for image classification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. A graph G is a multi-graph if it can have more than one edge connecting the same pair of vertices (multi-edges) and edges connecting a vertex with itself (loops) [2, 21].

  2. A maximal FAS is a FAS that is not sub-isomorphic to another FAS [16].

  3. A closed FAS is a FAS that is not sub-isomorphic to another FAS with the same frequency [38].

  4. A clique FAS is a FAS such that every vertex is connected to every other by an edge [26].

  5. Multi-edges [2] are different edges connecting the same pair of vertices (i.e., e and \(e^{\prime } \) are multi-edges if \(e \neq e^{\prime }\) and \(\phi _{G}(e) = \phi _{G}(e^{\prime }) = \{u,v\}\) such that u, vVG, uv).

  6. A canonical form is a unique representation for isomorphic graphs [19, 39]

  7. PyGen is available in http://pywebgraph.sourceforge.net.

  8. http://www.csc.liv.ac.uk/∼frans/KDD/Software/ImageGenerator/imageGenerator.html

References

  1. Acosta-Mendoza N, Carrasco-Ochoa J, Martínez-Trinidad J, Gago-Alonso A, Medina-pagola J (2015) A new method based on graph transformation for FAS mining in multi-graph collections. In: The 7th Mexican conference on pattern recognition (MCPR’2015), Pattern Recognition, vol LNCS 9116, pp 13–22. Springer

  2. Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2016) A new algorithm for approximate pattern mining in multi-graph collections. Knowl-Based Syst 109:198–207

    Article  Google Scholar 

  3. Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2017) Extension of canonical adjacency matrices for frequent approximate subgraph mining on multi-graph collections. Int J Pattern Recognit Artif Intell 31(7):25

    MathSciNet  Google Scholar 

  4. Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2018) A new multi-graph transformation method for frequent approximate subgraph mining. Cuban Journal of Informatics Sciences (RCCI) 12(3):1–16

    Google Scholar 

  5. Acosta-Mendoza N, Gago-Alonso A, Carrasco-Ochoa J, Martínez-Trinidad J, Medina-Pagola J (2018) Mining generalized closed frequent approximate subgraphs from multi-graph collections. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. (22nd Iberoamerican Congress, CIARP 2017), LNCS 10657, pp 10–18

    Chapter  Google Scholar 

  6. Acosta-Mendoza N, Gago-Alonso A, Medina-Pagola J (2012) Frequent approximate subgraphs as features for graph-based image classification. Knowl-Based Syst 27:381–392

    Article  Google Scholar 

  7. Anwar T, Liu C, Vu HL, Leckie C (2017) Partitioning road networks using density peak graphs: efficiency vs. accuracy. Inf Syst 64:22–40

    Article  Google Scholar 

  8. Ayush A, Chhavi S, Minni J, Amita J (2018) Semi supervised graph based keyword extraction using lexical chains and centrality measures. Computación y Sistemas 22(4):1307–1315

    Google Scholar 

  9. Bhatia V, Rani R (2018) Ap-fsm: a parallel algorithm for approximate frequent subgraph mining using pregel. Expert Syst Appl 106:217–232

    Article  Google Scholar 

  10. Brun L, Foggia P, Vento M (2018) Trends in graph-based representations for pattern recognition. Pattern Recogn Lett. https://doi.org/10.1016/j.patrec.2018.03.016

  11. Chen Y, Zhao X, Lin X, Wang Y, Guo D (2019) Efficient mining of frequent patterns on uncertain graphs. IEEE Trans Knowl Data Eng 31(2):287–300

    Article  Google Scholar 

  12. Conte D, Foggia P, Sansone C, Vento M (2004) Thirty years of graph matching in pattern recognition. IJPRAI 18(3):265–298

    Google Scholar 

  13. Dhifli W, Aridhi S, Nguifo EM (2017) Mr-simlab: scalable subgraph selection with label similarity for big data. Inf Syst 69:155–163

    Article  Google Scholar 

  14. Dwivedi SP, Singh RS (2018) Error-tolerant graph matching using node contraction. Pattern Recogn Lett 116:58–64

    Article  Google Scholar 

  15. Emmert-Streib F, Dehmer M, Shi Y (2016) Fifty years of graph matching, network alignment and network comparison. Inf Sci, 346–347:1–22

    Article  MathSciNet  Google Scholar 

  16. Flores-Garrido M, Carrasco-Ochoa J, Martínez-Trinidad J (2014) Mining maximal frequent patterns in a single graph using inexact matching. Knowl-Based Syst 66:166–177

    Article  Google Scholar 

  17. Flores-Garrido M, Carrasco-Ochoa J, Martínez-Trinidad J (2018) Extensions to AGRAP algorithm for findidng a reduced set of inexact graph patterns. Int J Pattern Recognit Artif Intell, 32(01):1–28

    Article  Google Scholar 

  18. Gao X, Xiao B, Tao D, Li X (2010) A survey of graph edit distance. Pattern Anal Applic 13 (1):113–129

    Article  MathSciNet  Google Scholar 

  19. Huan J, Wang W, Prins J (2003) Efficient mining of frequent subgraphs in the presence of isomorphism. In: The 3rd IEEE international conference on data mining, pp 549–552. Melbourne, FL

  20. Hulianytsky L, Pavlenko A (2015) Ant colony optimization for time dependent shortest path problem in directed multigraph. International Journal Information Content and Processing 2(1):50–61

    Google Scholar 

  21. Ingalalli V, Ienco D, Poncelet P (2018) Mining frequent subgraphs in multigraphs. Inf Sci 451:50–66

    Article  MathSciNet  Google Scholar 

  22. Kropatsch W, Haxhimusa Y, Pizlo Z, Langs G (2005) Vision pyramids that do not grow too high. Pattern Recogn Lett 26:319–337

    Article  Google Scholar 

  23. Morales-González A, García-Reyes EB (2013) Simple object recognition based on spatial relations and visual features represented using irregular pyramids. Multimed Tools Appl 63(3):875–897

    Article  Google Scholar 

  24. Moussaoui M, Zaghdoud M, Akaichi J (2018) A new framework of frequent uncertain subgraph mining. Prog Comput Sci 126:413–422

    Article  Google Scholar 

  25. Papalexakis E, Akoglu L, Ienco D (2013) Do more views of a graph help? community detection and clustering in multi-graphs. In: 16th international conference on information fusion, IEEE, Istanbul, Turkey, pp 899–905

  26. Rahman M (2017) Basic graph theory. Undergraduate topics in computer science springer international publishing

  27. Riesen K, Bunke H (2008) IAM graph database repository for graph based pattern recognition and machine learning. Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop, SSPR and SPR 2008, pp 208–297

    Google Scholar 

  28. Santhi S, Padmaja P (2015) A survey of frequent subgraph mining algorithms for uncertain graph data. International Research Journal of Engineering and Technology (IRJET) 2(2):688–696

    Google Scholar 

  29. Setak M, Habibi M, Karimi H, Abedzadeh M (2015) A time-dependent vehicle routing problem in multigraph with FIFO property. J Manuf Syst 35:37–45

    Article  Google Scholar 

  30. Shakibian H, Charkari NM, Jalili S (2018) Multi-kernel one class link prediction in heterogeneous complex networks. Appl Intell 48(10):3411–3428

    Article  Google Scholar 

  31. Terroso-Saez F, Valdés-Vela M., Skarmeta-gómez A (2015) Online urban mobility detection based on velocity features. In: 17th international conference of big data analytics and knowledge discovery, Valencia, Spain, LNCS, vol 9263, pp 351–362

  32. Verma A, Bharadwaj K (2017) Identifying community structure in a multi-relational network employing non-negative tensor factorization and GA k-means clustering. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7(1):1–22

    Google Scholar 

  33. Wang H, Ma W, Shi H, Xia C (2015) An interval algebra-based modeling and routing method in bus delay tolerant network. KSII Trans Internet Inf Syst 9(4):1376–1391

    Google Scholar 

  34. Wei D, Liu H, Qin Y (2015) Modeling cascade dynamics of railway networks under inclement weather. Transp Res E 80:95–122

    Article  Google Scholar 

  35. Wu D, Ren J, Sheng L (2018) Uncertain maximal frequent subgraph mining algorithm based on adjacency matrix and weight. Int J Mach Learn Cybern 9(9):1445–1455

    Article  Google Scholar 

  36. Wu X, Liu X, Chen Y, Shen J, Zhao W (2018) A graph based superpixel generation algorithm. Appl Intell 48(11):4485–4496

    Article  Google Scholar 

  37. Yan X, Han J (2002) gSpan: graph-based substructure pattern mining. In: International conference on data mining. Maebashi, Japan

  38. Yan X, Han J (2003) Closedgraph: mining closed frequent graph patterns. In: Proceedings of the 9th ACM SIGKDD of international conference on knowledge discovery and data mining (KDD), pp 286–295, Washington, DC

  39. Yan X, Han J, Afshar R (2003) CloSpan: mining closed sequential patterns in large datasets. In: 3rd SIAM international conference on data mining, pp 166–177, SIAM, San Franciso, USA

  40. Youssef R, Kacem A, Sevestre-Ghalila S, Chappard C (2015) Graph structuring of skeleton object for its high-level exploitation. In: Image analysis and recognition - 12th international conference, ICIAR 2015, Niagara Falls, ON, Canada, July 22-24, 2015, Proceedings, LNCS, vol 9164, pp 419–426. Springer

Download references

Acknowledgements

This work was partly supported by the National Council of Science and Technology of Mexico (CONACyT) through the scholarship grant 287045.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Niusvel Acosta-Mendoza.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Acosta-Mendoza, N., Carrasco-Ochoa, J.A., Martínez-Trinidad, J.F. et al. Mining clique frequent approximate subgraphs from multi-graph collections. Appl Intell 50, 878–892 (2020). https://doi.org/10.1007/s10489-019-01564-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-019-01564-8

Keywords

Navigation