Skip to main content

Discovery of Frequent Graph Patterns that Consist of the Vertices with the Complex Structures

  • Conference paper
Mining Complex Data (MCD 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4944))

Included in the following conference series:

Abstract

In some real world applications, the data can be represented naturally in a special kind of graphs in which each vertex consists of a set of (structured) data such as item sets, sequences and so on. One of the typical examples is metabolic pathways in bioinformatics. Metabolic pathway is represented in a graph structured data in which each vertex corresponds to an enzyme described by a set of various kinds of properties such as amino acid sequence, enzyme number and so on. We call this kind of complex graphs multi-structured graphs. In this paper, we propose an algorithm named FMG for mining frequent patterns in multi-structured graphs. In FMG, while the external structure will be expanded by the same mechanism of conventional graph miners, the internal structure will be enumerated by the algorithms suitable for its structure. In addition, FMG employs novel pruning techniques to exclude uninteresting patterns. The preliminary experimental results with real datasets show the effectiveness of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Borgelt, C.: On Canonical Forms for Frequent Graph Mining. In: Proc. of the 3rd International Workshop on Mining Graphs, pp. 1–12 (2005)

    Google Scholar 

  2. Borgelt, C., Berthold, M.R.: Mining molecular fragments: Finding relevant substructures of molecules. In: Proc. of the 2nd IEEE International Conference on Data Mining, pp. 51–58 (2002)

    Google Scholar 

  3. Chen, Y.L., Kao, H., Ko, M.: Mining DAG Patterns from DAG Databases. In: Li, Q., Wang, G., Feng, L. (eds.) WAIM 2004. LNCS, vol. 3129, pp. 579–588. Springer, Heidelberg (2004)

    Google Scholar 

  4. De Raedt, L., Washio, T., Kok, J.N. (eds.): Advances in Mining Graphs, Trees and Sequences, Frontiers in Artificial Intelligence and Applications, vol. 124. IOS Press, Amsterdam (2005)

    Google Scholar 

  5. Huan, J., Wang, W., Prins, J.: Efficient mining of frequent subgraphs in the presence of isomorphism. In: Proc. of the 3rd IEEE International Conference on Data Mining, pp. 549–552 (2003)

    Google Scholar 

  6. Inokuchi, A., Washio, T., Motoda, H.: An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data. In: Proc. of the 4th European Conference on Principles and Practice of Knowledge Discovery in Databases, pp. 13–23 (2000)

    Google Scholar 

  7. Inokuchi, A., Washio, T., Motoda, H.: Complete mining of frequent patterns from graphs: Mining graph data. Machine Learning 50, 321–354 (2003)

    Article  MATH  Google Scholar 

  8. Knijf, J.D.: FAT-miner: Mining Frequent Attribute Trees. In: Proc. of the 2007 ACM symposium on Applied computing, pp. 417–422 (2007)

    Google Scholar 

  9. Koyuturk, M., Grama, A., Szpankowski, W.: An efficient algorithm for detecting frequent subgraphs in biological networks. Bioinformatics 20, 200–207 (2004)

    Article  Google Scholar 

  10. Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: Proc. of the 1st IEEE International Conference on Data Mining, pp. 313–320 (2001)

    Google Scholar 

  11. Nijssen, S., Kok, J.N.: A Quickstart in Frequent Structure Mining can make a Difference. In: Proc. of 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 647–652 (2004)

    Google Scholar 

  12. Pei, J., Han, J., Mortazavi-Asl, B., Pnto, H., Chen, Q., Dayal, U., Hsu, M.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth. In: Proc. of International Conference on Data Engineering, pp. 215–224 (2001)

    Google Scholar 

  13. Sato, I., Nakagawa, H.: Semi-structure Mining Method for Text Mining with a Chunk-Based Dependency Structure. In: Zhou, Z.-H., Li, H., Yang, Q. (eds.) PAKDD 2007. LNCS (LNAI), vol. 4426, pp. 777–784. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  14. Uno, T., Asai, T., Uchida, Y., Arimura, H.: An Efficient Algorithm for Enumerating Closed Patterns in Transaction Databases. In: Proc. of Discovery Science (2004)

    Google Scholar 

  15. Washio, T., Motoda, H.: State of the art of graph-based data mining. SIGKDD Explorations 5(1), 59–68 (2003)

    Article  Google Scholar 

  16. Yan, X., Han, J.: CloseGraph: Mining Closed Frequent Graph Patterns. In: Proc. of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 286–295 (2003)

    Google Scholar 

  17. Yan, X., Han, J.: gSpan: Graph-Based Substructure Pattern Mining. In: Proc. of the 2nd IEEE International Conference on Data Mining, pp. 721–724 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zbigniew W. Raś Shusaku Tsumoto Djamel Zighed

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yamamoto, T., Ozaki, T., Ohkawa, T. (2008). Discovery of Frequent Graph Patterns that Consist of the Vertices with the Complex Structures. In: Raś, Z.W., Tsumoto, S., Zighed, D. (eds) Mining Complex Data. MCD 2007. Lecture Notes in Computer Science(), vol 4944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68416-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68416-9_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68415-2

  • Online ISBN: 978-3-540-68416-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics