Abstract
A cascade model is a rule induction methodology using levelwise expansion of an itemset lattice, where the explanatory power of a rule set and its constituent rules are quantitatively expressed. The sum of squares for a categorical variable has been decomposed to within-group and between-group sum of squares, where the latter provides a good representation of the power concept in a cascade model. Using the model, we can readily derive discrimination and characteristic rules that explain as much of the sum of squares as possible. Plural rule sets are derived from the core to the outskirts of knowledge. The sum of squares criterion can be applied in any rule induction system. The cascade model was implemented as DISCAS. Its algorithms are shown and an applied example is provided for illustration purposes.
Chapter PDF
Similar content being viewed by others
References
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth, Belmont CA (1984)
Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: Proc. ACM SIGMOD, pp. 207–216 (1993)
Okada, T.: Finding Discrimination Rules Based on Cascade Model. submitted to J. Jpn. Soc. Artificial Intelligence (1999)
Agarawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proc. VLDB, pp. 487–499 (1994)
Takeuchi, K., et al. (eds.): Encyclopedia of Statistics (in Japanese). Toyo Keizai Shinpou, pp. 388-390 (1990)
Light, R.J., Margolin, B.H.: An Analysis of Variance for Categorical Data. J. Amer. Stat. Assoc. 66, 534–544 (1971)
Okada, T.: Sum of Squares Decomposition for Categorical Data. to appear in Kwansei Gakuin Studies in Computer Science 14 (1999), http://www.media.kwansei.ac.jp/home/kiyou/kiyou99/kiyou99-e.html
Okada, T.: Efficient Detection of Local Interactions in Cascade Model. submitted to Discovery Science (1999)
Ziarko, W.: The Discovery, Analysis, and Representation of Data Dependencies in Databases. In: Piatetsky-Shapiro, G., Frawley, W. (eds.) Knowledge Discovery in Databases, pp. 195–209. AAAI Press, Menlo Park (1991)
Hawkins, D.M., Kass, G.V.: Automatic Interaction Detection. In: Hawkins, D.M. (ed.) Topics in Applied Multivariate Analysis, pp. 269–302. Cambridge Univ. Press, Cambridge (1982)
Ali, K., Manganaris, S., Srikant, R.: Partial Classification using Association Rules. In: Proc. KDD 1997, pp. 115–118 (1997)
Bayardo, R.J.: Brute-Force Mining of High-Confidence Classification Rules. In: Proc. KDD 1997, pp. 123–126 (1997)
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proc. KDD 1998, pp. 80–86 (1998)
Silberschatz, A., Tuzhilin, A.: What Makes Patterns Interesting in KnowledgeDiscovery Systems. IEEE Trans. on Know. and Data Eng. 8(6), 970–974 (1996)
Washio, T., Matsuura, H., Motoda, H.: Mining Association Rules for Estimation and Prediction. In: Wu, X., Kotagiri, R., Korb, K.B. (eds.) PAKDD 1998. LNCS, vol. 1394, pp. 417–419. Springer, Heidelberg (1998)
Suzuki, E., Shimura, M.: Exceptional Knowledge Discovery in Databases based on Information Theory. In: Proc. KDD 1996, pp. 275–278 (1996)
Silverstein, C., Brin, S., Motwani, R.: Beyond Market Baskets: Generalizing Association Rules to Dependence Rules. Data Mining and Knowledge Discovery 2, 39–68 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Okada, T. (1999). Rule Induction in Cascade Model Based on Sum of Squares Decomposition. In: Żytkow, J.M., Rauch, J. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1999. Lecture Notes in Computer Science(), vol 1704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-48247-5_60
Download citation
DOI: https://doi.org/10.1007/978-3-540-48247-5_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66490-1
Online ISBN: 978-3-540-48247-5
eBook Packages: Springer Book Archive