FSMS: A Frequent Subgraph Mining Algorithm Using Mapping Sets

Abedijaberi, Armita; Leopold, Jennifer

doi:10.1007/978-3-319-41920-6_58

Armita Abedijaberi¹⁴ &
Jennifer Leopold¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9729))

Included in the following conference series:

International Conference on Machine Learning and Data Mining in Pattern Recognition

3154 Accesses
1 Citations

Abstract

With the increasing prevalence of data that model relationships between various entities, the use of a graph-based representation for real-world problems offers a logical strategy for organizing information and making knowledge-based decisions. In particular, often it is useful to identify the most frequent patterns or relationships amongst the data in a graph, which requires finding frequent subgraphs. Algorithms for addressing that problem have been proposed for over 15 years. In the worst case, all subgraphs in the graph must be examined, which is exponential in complexity, and subgraph isomorphisms must be computed, which is an NP-complete problem. Frequent subgraph algorithms may attempt to improve the actual runtime performance by reducing the size of the search space, avoiding duplicate comparisons, and/or minimizing the amount of memory required for compiling intermediate results. Herein we present a frequent subgraph mining algorithm that leverages mapping sets in order to eliminate the isomorphism computation during the search for non-edge-disjoint frequent subgraphs. Experimental results show that absence of isomorphism computation leads to much faster frequent subgraph detection when there is a need to identify all occurrences of those subgraphs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Optimized Candidate Generation for Frequent Subgraph Mining in a Single Graph

Frequent Generalized Subgraph Mining via Graph Edit Distances

An efficient and scalable approach for mining subgraphs in a single large graph

Article 06 April 2022

References

Google. Inside Search: Algorithms [Online] Written (2012) (accessed: 04–30-2015)
Google Scholar
Clement, A.: NSA surveillance: exploring the geographies of internet interception. In: iConference 2014 Proceedings, pp. 412–425 (2014). doi:10.9776/14119
Rhodes, D.R., Yu, J., Shanker, K., Deshpande, N., Varambally, R., Ghosh, D., Barrette, T., Pander, A., Chinnaiyan, A.M.: ONCOMINE: A cancer microarray database and integrated datamining platform. Neoplasia 6(1), 1–6 (2004). ISSN: 1476-5586 (accessed 04-30-2015). http://dx.doi.org/10.1016/S1476-5586(04)80047-2
Article Google Scholar
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph structure in the web. Computer Networks 33 (2000)
Google Scholar
Bader, D.A., Madduri, K.: A graph-theoretic analysis of the human proteininteraction network using multicore parallel algorithms. Parallel Comput. (2008)
Google Scholar
Faloutsos, M., Faloutsos, P., Faloutsos, C.: On powerlaw relationships of the internet topology. In: SIGCOMM, pp. 251–262, August-September (1999)
Google Scholar
Kuramochi, M., Karypis, G.: Finding frequent patterns in a large sparse graph*. Data Mining and Knowledge Discovery 11(3), 243–271 (2005)
Article MathSciNet Google Scholar
Gholami, M., Salajegheh, A.: A survey on algorithms of mining frequent subgraphs. International Journal of Engineering Inventions 1(5), 60–63 (2012)
Google Scholar
Inokuchi, A., Washio, T., Motoda, H.: An apriori-based algorithm for mining frequent substructures from graph data. In: Zighed, D.A., Komorowski, J., Żytkow, J.M. (eds.) PKDD 2000. LNCS (LNAI), vol. 1910, pp. 13–23. Springer, Heidelberg (2000)
Chapter Google Scholar
Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: Proceedings of the 2001 IEEE International Conference on Data Mining. IEEE Computer Society (2001)
Google Scholar
Yan, X., Han, J.W.: gSpan: graph-based substructure pattern mining. In: Proceedings of the 2002 IEEE International Conference on Data Mining. IEEE Computer Society (2002)
Google Scholar
Elseidy, M., Abdelhamid, E., Skiadopoulos, S., Kalnis, P.: GRAMI: frequent subgraph and pattern mining in a single large graph. In: Proceedings of the VLDB Endowment, pp. 517–528 (2014)
Google Scholar
Kuramochi, M., Karypis, G.: GREW - a scalable frequent subgraph discovery algorithm. In: Proceedings of ICDM, pp. 439–442 (2004)
Google Scholar
Lu, W., et al.: Efficiently extracting frequent subgraphs using mapreduce. In: 2013 IEEE International Conference on Big Data. IEEE (2013)
Google Scholar
National Center for Biotechnology Information. PubChem BioAssay Database; AID=2299, Source=Scripps Research Institute Molecular Screening Center (accessed February 22, 2011). http://pubchem.ncbi.nlm.nih.gov/assay/assay.cgi?aid=2299
Cisco. Cisco global cloud index: forecast and methodology 2013–2018 White Paper. [Online]. Written (2014) (accessed 04/27/2015). http://www.cisco.com/c/en/ussolutions/collateral/serviceprovider/global-cloud-index-gci/CloudIndexWhitePaper.html
Kang, U., Faloutsos, C.: Big graph mining: algorithms and discoveries. SIGKDDD Explorations 14(2), 29–36 (2013)
Article Google Scholar
Bhuiyan, M., Al Hasan, M.: MiRage: An iterative MapReduce based subgraph mining algorithm, July 22, 2013 (accessed 05/31/2015). arXiv:1307.5894
Puolamiki, K., Papapetrou, P., Lijffitj, J.: Visually controllable data mining methods. In: Proceedings of the 2010 IEEE International Conference on Data Mining Workshops, pp. 409–417, December 2010
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Missouri University of Science and Technology, Rolla, MO, 65401, USA
Armita Abedijaberi & Jennifer Leopold

Authors

Armita Abedijaberi
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Leopold
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Armita Abedijaberi .

Editor information

Editors and Affiliations

IBaI, Inst of Comp Vision and applied Comp Sci, Leipzig, Sachsen, Germany
Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abedijaberi, A., Leopold, J. (2016). FSMS: A Frequent Subgraph Mining Algorithm Using Mapping Sets. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2016. Lecture Notes in Computer Science(), vol 9729. Springer, Cham. https://doi.org/10.1007/978-3-319-41920-6_58

Download citation

DOI: https://doi.org/10.1007/978-3-319-41920-6_58
Published: 28 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41919-0
Online ISBN: 978-3-319-41920-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

FSMS: A Frequent Subgraph Mining Algorithm Using Mapping Sets

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Optimized Candidate Generation for Frequent Subgraph Mining in a Single Graph

Frequent Generalized Subgraph Mining via Graph Edit Distances

An efficient and scalable approach for mining subgraphs in a single large graph

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

FSMS: A Frequent Subgraph Mining Algorithm Using Mapping Sets

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Optimized Candidate Generation for Frequent Subgraph Mining in a Single Graph

Frequent Generalized Subgraph Mining via Graph Edit Distances

An efficient and scalable approach for mining subgraphs in a single large graph

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation