Knowledge Discovery from Structured Data by Beam-Wise Graph-Based Induction

Matsuda, Takashi; Motoda, Hiroshi; Yoshida, Tetsuya; Washio, Takashi

doi:10.1007/3-540-45683-X_29

Knowledge Discovery from Structured Data by Beam-Wise Graph-Based Induction

Takashi Matsuda³,
Hiroshi Motoda³,
Tetsuya Yoshida³ &
…
Takashi Washio³

Conference paper
First Online: 01 January 2002

857 Accesses
9 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2417))

Abstract

A machine learning technique called Graph-Based Induction (GBI) extracts typical patterns from graph data by stepwise pair expansion (pairwise chunking). Because of its greedy search strategy, it is very efficient but suffers from incompleteness of search. We improved its search capability without imposing much computational complexity by incorporating the idea of beam search. Additional improvement is made to extract patterns that are more discriminative than those simply occurring frequently, and to enumerate identical patterns accurately based on the notion of canonical labeling. This new algorithm was implemented (now called Beam-wise GBI, B-GBI for short) and tested against a DNA data set from UCI repository. Since DNA data is a sequence of symbols, representing each sequence by attribute-value pairs by simply assigning these symbols to the values of ordered attributes does not make sense. By transforming the sequence into a graph structure and running B-GBI it is possible to extract discriminative substructures. These can be new attributes for a classification problem. Effect of beam width on the number of discovered attributes and predictive accuracy was evaluated, together with extracted characteristic subsequences, and the results indicate the effectiveness of B-GBI.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

C. L. Blake, E. Keogh, and C.J. Merz. Uci repository of machine leaning database, 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html.
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. Classification and Regression Trees. Wadsworth & Brooks/Cole Advanced Books & Software, 1984.
Google Scholar
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone. The cn2 induction algorithm. Machine Learning, 3:261–283, 1989.
Google Scholar
D. J. Cook and L. B. Holder. Graph-based data mining. IEEE Intelligent Systems, 15(2):32–41, 2000.
Article Google Scholar
S. Fortin. The graph isomorphism problem, 1996.
Google Scholar
A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data. In Proc. of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, pages 13–23, 2000.
Google Scholar
T. Matsuda, T. Horiuchi, H. Motoda, and T. Washio. Extension of graph-based induction for general graph structured data. In Knowledge Discovery and Data Mining: Current Issues and New Applications, Springer Verlag, LNAI 1805, pages 420–431, 2000.
Google Scholar
R. S. Michalski. Learning flexible concepts: Fundamental ideas and a method based on two-tiered representaion. In Machine Learning, An Artificial Intelligence Approiach, 3:63–102, 1990.
Google Scholar
S. Muggleton and L. de Raedt. Inductive logic programming: Theory and methods. Journal of Logic Programming, 19(20):629–679, 1994.
Article MathSciNet Google Scholar
J. R. Quinlan. Induction of decision trees. Machine Learning, 1:81–106, 1986.
Google Scholar
J. R. Quinlan. C4.5:Programs For Machine Learning. Morgan Kaufmann Publishers, 1993.
Google Scholar
R. C. Read and D. G. Corneil. The graph isomorphism disease. Journal of Graph Theory, 1:339–363, 1977.
Article MATH MathSciNet Google Scholar
K. Yoshida and H. Motoda. Clip: Concept learning from inference pattern. Journal of Artificial Intelligence, 75(1):63–92, 1995.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Scientific and Industrial Research, Osaka University, 8-1, Mihogaoka, Ibaraki, Osaka, 567-0047, Japan
Takashi Matsuda, Hiroshi Motoda, Tetsuya Yoshida & Takashi Washio

Authors

Takashi Matsuda
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Motoda
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuya Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Washio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Science and Technology Department of Information and Communication Engineering, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
Mitsuru Ishizuka
School of Information Technology Knowledge Representation and Reasoning Unit (KRRU) Faculty of Engineering and Information Technology, Griffith University, PMB 50 Gold Coast Mail Centre, Queensland, 9726, Australia
Abdul Sattar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Matsuda, T., Motoda, H., Yoshida, T., Washio, T. (2002). Knowledge Discovery from Structured Data by Beam-Wise Graph-Based Induction. In: Ishizuka, M., Sattar, A. (eds) PRICAI 2002: Trends in Artificial Intelligence. PRICAI 2002. Lecture Notes in Computer Science(), vol 2417. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45683-X_29

Download citation

DOI: https://doi.org/10.1007/3-540-45683-X_29
Published: 21 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44038-3
Online ISBN: 978-3-540-45683-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics