Abstract
We have proposed a method called Decision Tree Graph-Based Induction (DT-GBI), which constructs a classifier (decision tree) for graph-structured data while simultaneously constructing attributes for classification. Graph-Based Induction (GBI) is utilized in DT-GBI for efficiently extracting typical patterns from graph-structured data by stepwise pair expansion (pairwise chunking). Attributes, i.e., substructures useful for classification task, are constructed by GBI on the fly while constructing a decision tree in DT-GBI. We applied DT-GBI to four classification tasks of hepatitis data using only the time-series data of blood inspection and urinalysis, which was provided by Chiba University Hospital. In the first and second experiments, the stages of fibrosis were used as classes and a decision tree was constructed for discriminating patients with F4 (cirrhosis) from patients with the other stages. In the third experiment, the types of hepatitis (B and C) were used as classes, and in the fourth experiment the effectiveness of interferon therapy was used as class label. The preliminary results of experiments, both constructed decision trees and their predictive accuracies, are reported in this paper. The validity of extracted patterns is now being evaluated by the domain experts (medical doctors). Some of the patterns match experts’ experience and the overall results are encouraging.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth & Brooks/Cole Advanced Books & Software (1984)
Fortin, S.: The graph isomorphism problem (1996)
Ho, T.B., Nguyen, T.D., Kawasaki, S., Le, S.Q., Nguyen, D.D., Yokoi, H., Takabayashi, K.: Mining hepatitis data with temporal abstraction. In: Proc. of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2003, pp. 369–377 (2003)
Matsuda, T., Yoshida, T., Motoda, H., Washio, T.: Knowledge discovery from structured data by beam-wise graph-based induction. In: Ishizuka, M., Sattar, A. (eds.) PRICAI 2002. LNCS (LNAI), vol. 2417, pp. 255–264. Springer, Heidelberg (2002)
Ohsaki, M., Sato, Y., Yokoi, H., Yamaguchi, T.: A rule discovery support system for sequential medical data - in the case study of a chronic hepatitis dataset -. In: Working note of International Workshop on Active Mining (AM2002), pp. 97–102 (2002)
Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Quinlan, J.R.: C4.5:Programs For Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Read, R.C., Corneil, D.G.: The graph isomorphism disease. Journal of Graph Theory 1, 339–363 (1977)
Tsumoto, S., Takabayashi, K., Nagira, M., Hirano, S.: Trend-evaluation multiscale analysis of the hepatitis dataset. In: Project “Realization of Active Mining in the Era of Information Flood” Report, pp. 191–197 (March 2003)
Warodom, G., Matsuda, T., Yoshida, T., Motoda, H., Washio, T.: Classifier construction by graph-based induction for graph-structured data. In: Whang, K.-Y., Jeon, J., Shim, K., Srivastava, J. (eds.) PAKDD 2003. LNCS (LNAI), vol. 2637, pp. 52–62. Springer, Heidelberg (2003)
Warodom, G., Matsuda, T., Yoshida, T., Motoda, H., Washio, T.: Performance evaluation of decision tree graph-based induction. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds.) DS 2003. LNCS (LNAI), vol. 2843, pp. 128–140. Springer, Heidelberg (2003)
Yamada, Y., Suzuki, E., Yokoi, H., Takabayashi, K.: Decision-tree induction from time-series data based on a standard-example split test. In: Proc. of the 12th International Conference on Machine Learning, August 2003, pp. 840–847 (2003)
Yoshida, K., Motoda, H.: Clip: Concept learning from inference pattern. Journal of Artificial Intelligence 75(1), 63–92 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Geamsakul, W. et al. (2005). Extracting Diagnostic Knowledge from Hepatitis Dataset by Decision Tree Graph-Based Induction. In: Tsumoto, S., Yamaguchi, T., Numao, M., Motoda, H. (eds) Active Mining. Lecture Notes in Computer Science(), vol 3430. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11423270_8
Download citation
DOI: https://doi.org/10.1007/11423270_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26157-5
Online ISBN: 978-3-540-31933-7
eBook Packages: Computer ScienceComputer Science (R0)