Algebraic specification of empirical inductive learning methods based on rough sets and matroid theory

Tsumoto, Shusaku; Tanaka, Hiroshi

doi:10.1007/3-540-60156-2_16

Shusaku Tsumoto¹ &
Hiroshi Tanaka¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 958))

Included in the following conference series:

International Conference on Artificial Intelligence and Symbolic Mathematical Computing

192 Accesses
2 Citations

Abstract

In order to acquire knowledge from databases, there have been proposed several methods of inductive learning, such as ID3 family and AQ family. These methods are applied to discover meaningful knowledge from large databases, and their usefulness is ensured. However, since there has been no formal approach proposed to treat these methods, efficiency of each method is only compared empirically. In this paper, we introduce matroid theory and rough sets to construct a common framework for empirical machine learning methods which induce the combination of attribute-value pairs from databases. Combination of the concepts of rough sets and matroid theory gives us an excellent framework and enables us to understand the differences and the similarities between these methods clearly. In this paper, we compare three classical methods, AQ, Pawlak's Consistent Rules and ID3. The results show that there exists the differences in algebraic structure between the former two and the latter and that this causes the differences between AQ and ID3.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bergadano, F., Matwin, S., Michalski, R.S. and Zhang, J. Learning Two-Tiered Descriptions of Flexible Concepts: The POSEIDON System, Machine Learning, 8, 5–43, 1992.
Google Scholar
Breiman, L., Freidman, J., Olshen, R. and Stone, C. Classification And Regression Trees. Belmont, CA: Wadsworth International Group, 1984.
Google Scholar
Hunter, L.(eds). Proceedings of AAAI-94 Spring Workshop on Goal-Driven Learning, AAAI Press, 1994.
Google Scholar
Michalski, R.S. A Theory and Methodology of Machine Learning. Michalski, R.S., Carbonell, J.G. and Mitchell, T.M., Machine Learning — An Artificial Intelligence Approach, 83–134, Morgan Kaufmann, CA, 1983.
Google Scholar
Michalski, R.S., et al. The Multi-Purpose Incremental Learning System AQ15 and its Testing Application to Three Medical Domains, Proc. of AAAI-86, 1041–1045, Morgan Kaufmann, CA, 1986.
Google Scholar
Michalski, R.S., and Tecuci, G.(eds) Machine Learning vol.4 — A Multistrategy Approach-, Morgan Kaufmann, CA, 1994.
Google Scholar
Mingers, J. An Empirical Comparison of Selection Measures for Decision Tree Induction. Machine Learning, 3, 319–342, 1989.
Google Scholar
Mingers, J. An Empirical Comparison of Pruning Methods for Decision Tree Induction. Machine Learning, 4, 227–243, 1989.
Article Google Scholar
Nakakuki, Y., Koseki, Y., and Tanaka, M. Inductive Learning in Probabilistic Domain in Proc. of AAAI-90, 809–814, 1990.
Google Scholar
Pawlak, Z. Rough Sets, Kluwer Academic Publishers, Dordrecht, 1991.
Google Scholar
Pendnault, E.P.D. Some Experiments in Applying Inductive Inference Principles to Surface Reconstruction, Proceedings of IJCAI-89, 1603–1609, 1989.
Google Scholar
Pendnault, E.P.D. Inferring probabilistic theories from data, Proceedings of AAAI-88, 1988.
Google Scholar
Quinlan, J.R. Induction of decision trees, Machine Learning, 1, 81–106, 1986.
Google Scholar
Quinlan, J.R. Simplifying Decision Trees. International Journal of Man-Machine Studies, 27, 221–234, 1987.
Google Scholar
Quinlan, J.R. and Rivest, R.L. Inferring Decision Trees Using the Minimum Description Length Principle, Information and Computation, 80, 227–248, 1989.
Article Google Scholar
Rissanen, J. Stochastic complexity and modeling, Ann. of Statist., 14, 1080–1100, 1986.
Google Scholar
Rissanen, J. Universal Coding, Information, Prediction, and Estimation, IEEE. Trans. Inform. Theory, IT-30, 629–636, 1984.
Article Google Scholar
Schaffer, C. Overfitting Avoidance as Bias. Machine Learning, 10, 153–178, 1993.
Google Scholar
Tsumoto, S. and Tanaka, H. PRIMEROSE: Probabilistic Rule Induction Method based on Rough Sets. in: Ziarko, W.(eds) Rough Sets, Fuzzy Sets, and Knowledge Discovery, Springer, London, 1994.
Google Scholar
Welsh, D.J.A. Matroid Theory, Academic Press, London, 1976.
Google Scholar
White, N.(ed.) Matroid Applications, Cambridge University Press, 1991.
Google Scholar
Whitney, H. On the abstract properties of linear dependence, Am. J. Math., 57, 509–533, 1935.
Google Scholar
Ziarko, W. The Discovery, Analysis, and Representation of Data Dependencies in Databases, in: Knowledge Discovery in Database, Morgan Kaufmann, 1991.
Google Scholar
Ziarko, W. Variable Precision Rough Set Model, Journal of Computer and System Sciences, 46, 39–59, 1993.
Article Google Scholar
Ziarko, W. Analysis of Uncertain Information in the Framework of Variable Precision Rough Sets, Foundation of Computing and Decision Science, 18, 381–396, 1993.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Medicine Medical Research Institute, Tokyo Medical and Dental University, 1-5-45 Yushima, Bunkyo-ku, 113, Tokyo, Japan
Shusaku Tsumoto & Hiroshi Tanaka

Authors

Shusaku Tsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Tanaka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Jacques Calmet John A. Campbell

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tsumoto, S., Tanaka, H. (1995). Algebraic specification of empirical inductive learning methods based on rough sets and matroid theory. In: Calmet, J., Campbell, J.A. (eds) Integrating Symbolic Mathematical Computation and Artificial Intelligence. AISMC 1994. Lecture Notes in Computer Science, vol 958. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60156-2_16

Download citation

DOI: https://doi.org/10.1007/3-540-60156-2_16
Published: 02 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60156-2
Online ISBN: 978-3-540-49533-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics