Mine ’Em All: A Note on Mining All Graphs

Kuželka, Ondřej; Ramon, Jan

doi:10.1007/978-3-319-40566-7_8

Ondřej Kuželka¹⁶ &
Jan Ramon^17,18

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9575))

Included in the following conference series:

International Conference on Inductive Logic Programming

539 Accesses

Abstract

We study the complexity of the problem of enumerating all graphs with frequency at least 1 and computing their support. We show that there are hereditary classes of graphs for which the complexity of this problem depends on the order in which the graphs should be enumerated (e.g. from frequent to infrequent or from small to large). For instance, the problem can be solved with polynomial delay for databases of planar graphs when the enumerated graphs should be output from large to small but it cannot be solved even in incremental-polynomial time when the enumerated graphs should be output from most frequent to least frequent (unless P=NP).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Graph support measures and flows

Article 25 August 2022

Counting frequent patterns in large labeled graphs: a hypergraph-based approach

Article 05 May 2020

Diversified Pattern Mining on Large Graphs

Notes

1.
$\mathsf {ALL}_{F \rightarrow I}$ stands for ’frequent to infrequent’, $\mathsf {ALL}_{I \rightarrow F}$ stands for ’infrequent to frequent’, $\mathsf {ALL}_{S \rightarrow L}$ stands for ’small to large’ and $\mathsf {ALL}_{L \rightarrow S}$ stands for ’large to small’.
2.
The hardness for the $\mathsf {ALL}_{F \rightarrow I}$ problem follows from Theorem 1 together with Remark 1, whereas hardness of the $\mathsf {ALL}_{S \rightarrow L}$ problem follows from a simple modification of the proof of Theorem 1 where we use a hypothetic incr.-poly.-time algorithm for solving the $\mathsf {ALL}_{S \rightarrow L}$ problem and stop it after printing the first graph with more edges than H (or more vertices than H in the case of induced subgraph mining).
3.
The complexity of the $\mathsf {ALL}_{S \rightarrow L}$ problem in these cases remains an interesting open problem.
4.
Here, IDs are just some identifiers given to the database graphs.
5.
The reason why the function $\mathsf {KEYS}$ does not just return all the key graphs but rather a pointer to the linked list is that if it did otherwise, Algorithm 1 could never run with polynomial delay.
6.
Note that we just remove the pointer and do not actually “free” the memory occupied by the graphs. For the practical implementation, we used a programming language with a garbage collector.
7.
For homeomorphism, for instance, we could allow the subdivided edges to have different labels or, to the contrary, we could require them all to have the same label etc. Then another question could be how we should treat labels of vertices etc. While these considerations are interesting even for practice, they are out of the scope of this paper.
8.
The latter constraint on degrees would not probably be very relevant for bounded-treewidth graphs or planar graphs. This is because any graph of treewidth k must always have at least one vertex of degree at most k and because any planar graph must always have at least one vertex of degree at most 5. However, isomorphism algorithms are extremely fast in practice, despite not being polynomial in the worst case, so the algorithm that we would obtain for general graphs could still be practical.

References

Flum, J., Grohe, M.: Parameterized Complexity Theory. Springer, Berlin (2006)
MATH Google Scholar
Garey, M.R., Johnson, D.S., Tarjan, R.E.: The planar hamiltonian circuit problem is np-complete. SIAM J. Comput. 5(4), 704–714 (1976)
Article MathSciNet MATH Google Scholar
Grohe, M., Marx, D.: Structure theorem and isomorphism test for graphs with excluded topological subgraphs. CoRR, abs/1111.1109 (2011)
Google Scholar
Horváth, T., Otaki, K., Ramon, J.: Efficient frequent connected induced subgraph mining in graphs of bounded tree-width. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds.) ECML PKDD 2013, Part I. LNCS, vol. 8188, pp. 622–637. Springer, Heidelberg (2013)
Chapter Google Scholar
Horváth, T., Ramon, J.: Efficient frequent connected subgraph mining in graphs of bounded tree-width. Theoret. Comput. Sci. 411(31–33), 2784–2797 (2010)
Article MathSciNet MATH Google Scholar
Johnson, D.S., Yannakakis, M., Papadimitriou, C.H.: On generating all maximal independent sets. Inf. Process. Lett. 27(3), 119–123 (1988)
Article MathSciNet MATH Google Scholar
Lokshtanov, D., Pilipczuk, M., Pilipczuk, M., Saurabh, S.: Fixed-parameter tractable canonization and isomorphism test for graphs of bounded treewidth. In: 55th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2014, pp. 186–195 (2014)
Google Scholar
Luks, E.M.: Isomorphism of graphs of bounded valence can be tested in polynomial time. J. Comput. Syst. Sci. 25(1), 42–65 (1982)
Article MathSciNet MATH Google Scholar
Marx, D., Pilipczuk, M.: Everything you always wanted to know about the parameterized complexity of subgraph isomorphism (but were afraid to ask). In: 31st International Symposium on Theoretical Aspects of Computer Science, STACS 2014, pp. 542–553 (2014)
Google Scholar
Matoušek, J., Thomas, R.: On the complexity of finding iso- and other morphisms for partial k-trees. Discrete Math. 108(1–3), 343–364 (1992)
Article MathSciNet MATH Google Scholar
Robertson, N., Seymour, P.D.: Graph minors. XIII. the disjoint paths problem. J. Comb. Theory Ser. B 63(1), 65–110 (1995)
Article MathSciNet MATH Google Scholar
Torán, J., Wagner, F.: The complexity of planar graph isomorphism. Bull. EATCS 97, 60–82 (2009)
MathSciNet Google Scholar

Download references

Acknowledgement

This work has been supported by ERC Starting Grant 240186 “MiGraNT: Mining Graphs and Networks, a Theory-based approach”. The first author is supported by a grant from the Leverhulme Trust (RPG-2014-164).

Author information

Authors and Affiliations

School of Computer Science & Informatics, Cardiff University, Cardiff, UK
Ondřej Kuželka
Department of Computer Science, KU Leuven, Leuven, Belgium
Jan Ramon
INRIA, Lille, France
Jan Ramon

Authors

Ondřej Kuželka
View author publications
You can also search for this author in PubMed Google Scholar
Jan Ramon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ondřej Kuželka .

Editor information

Editors and Affiliations

National Institute of Informatics, Tokyo, Japan
Katsumi Inoue
Tokyo University of Science, Noda, Japan
Hayato Ohwada
Kyoto University, Kyoto, Japan
Akihiro Yamamoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kuželka, O., Ramon, J. (2016). Mine ’Em All: A Note on Mining All Graphs. In: Inoue, K., Ohwada, H., Yamamoto, A. (eds) Inductive Logic Programming. ILP 2015. Lecture Notes in Computer Science(), vol 9575. Springer, Cham. https://doi.org/10.1007/978-3-319-40566-7_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-40566-7_8
Published: 10 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40565-0
Online ISBN: 978-3-319-40566-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Mine ’Em All: A Note on Mining All Graphs

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Graph support measures and flows

Counting frequent patterns in large labeled graphs: a hypergraph-based approach

Diversified Pattern Mining on Large Graphs

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Mine ’Em All: A Note on Mining All Graphs

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Graph support measures and flows

Counting frequent patterns in large labeled graphs: a hypergraph-based approach

Diversified Pattern Mining on Large Graphs

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation