Abstract
Classification has been one of the focal points in data mining research and applications. With an effective approach to building compact and accurate associative classification (namely GARC – Gain-based Association Rule Classification (Chen, Liu, Yu, Wei, & Zhang, 2006)) in forms of association rules, this chapter explores a way of fuzzy extension to GARC in dealing with the problem caused by crisp partitions for continuous attribute domains in data. Concretely, the sharp boundaries of the partitioned intervals are smoothened using fuzzy sets (or often conveniently labeled in linguistic terms) so as to reflect a variety of fuzziness on the domains (parameterized in f2), giving rise to a fuzzy associative classifier (i.e., GARC f2). Furthermore, due to the fuzziness involved, the notions of information gain, rule redundancy and conflicts are extended, aimed at providing the desirable features of GARC in the fuzzy extension context for accuracy and compactness. Moreover, data experiments on benchmarking datasets as well as a real-world application illustrate the effectiveness of the proposed fuzzy associative classifier.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of ACM-SIMOD international conference on management of data, Washington, DC, pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Fast algorithm for mining association rules. In: Proceedings of the 20th VLDB Conference, pp. 487–499. Morgan Kaufmann, San Francisco (1994)
Ali, K., Manganaris, K., Srikant, R.: Partial Classification using Association Rules. In: Proceeding of the Third International Conference on Knowledge Discovery and Data Mining, Newport Beach, California, pp. 115–118 (1997)
Aumann, Y., Lindell, Y.: A statistical theory for quantitative association rules. Knowledge Discovery and Data Mining, 261–270 (1999)
Berzal, F., Cubero, J., Sanchez, D., et al.: ART: A Hybrid Classification Model. Machine Learning 54, 67–92 (2004)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Bosc, P., Pivert, O.: On some Fuzzy Extensions of Association Rules. In: Proceeding of IFSA/NAFIPS 2001 Congress, Vancouver, Canada, pp. 1104–1109 (2001)
Breiman, L.: Classification and Regression trees. Wadsworth, Belmont (1984)
Chen, G.Q.: Fuzzy Logic in Data Modeling: semantics, constraints and database design. Kluwer Academic Publishers, Boston (1998)
Chen, G.Q., Liu, H.Y., Yu, L., Wei, Q., Zhang, X.: A New Approach to Classification Based on Association Rule Mining. Decision Support Systems 42, 674–689 (2006)
Chen, G.Q., Wei, Q.: Fuzzy Association Rules and the Extended Mining Algorithms. Information Sciences 147, 201–228 (2002)
Chen, G.Q., Wei, Q., Kerre, E.E.: Fuzzy logic in discovering association rules: an overview. In: Data Mining and Knowledge Discovery Approaches Based on Rule Induction Techniques. Massive Computing Series, pp. 474–480. Springer, Heidelberg (2006)
Chen, G.Q., Wei, Q., Liu, D., Wets, G.: Simple Association Rules (SAR) and the SAR-Based Rule Discovery. Computer & Industrial Engineering 43, 721–733 (2002)
Chen, G.Q., Yan, P., Kerre, E.E.: Computationally Efficient Mining for Fuzzy Implication-Based Association Rules in Quantitative Databases. International Journal of General Systems 33, 163–182 (2004)
Chen, G.Q., Yan, P., Wei, Q.: Discovering Associations with Uncertainty from Large Databases. In: Rakus-Andersson, E., Yager, R., Ichalkaranje, N., Jain, L. (eds.) Recent Advances in Decision Making, book series of Studies in Computational Intelligence, vol. 222, pp. 45–66 (2009)
Chen, G.Q., Zhang, X., Yu, L.: An improved GARC algorithm for building accurate and more understandable associative classifiers. Dynamics of Continuous, Discrete and Impulsive Systems – Series B – applications & algorithms 13, 161–165 (2006)
Chen, Z.L., Chen, G.Q.: Building an Associative Classifier Based on Fuzzy Association Rules. International Journal of Computational Intelligence Systems 1(3), 262–273 (2008)
Conover, W.J.: Practical Nonparametric Statistics, 3rd edn. Wiley, New York (1999)
De Luca, A., Termini, S.: A definition of Non-Probabilistic Entropy in the Setting of Fuzzy Sets. Information Control 20, 301–312 (1972)
Dong, G., Zhang, X., Wong, L., Li, J.: CAEP: Classification by aggregating emerging patterns. In: Proceedings of the Second International Conference on Discovery Science, Tokyo, Japan, pp. 30–42 (1999)
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: The Twelfth International Conference on Machine Learning, pp. 194–202 (1995)
Dunn, J.C.: A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well Separated Cluster. Cybernetics and Systems 3(3), 32–57 (1974)
Fayyad, U., Uthurusamy, R. (eds.): AAAI Workshop on Knowledge Discovery in Databases, Seattle, Washington, DC, USA (1994)
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Joint Conference on Artificial Intelligence, pp. 1022–1027 (1993)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian network classifier. Machine Learning 29, 131–163 (1997)
Han, J.W., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 2nd edn. Morgan Kaufmann Publishers, San Francisco (2005)
Klir, G.J., Yuan, B.: Fuzzy sets and fuzzy logic theories and applications. Prentice Hall PTR, Englewood Cliffs (1995)
Li, W.M., Han, J., Pei, J.: CMAR: Accurate and efficient classification based on multiple class-association rules. In: IEEE International Conference on Data Mining, San Jose, CA, pp. 69–376 (2001)
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: The 4th International Conference on Knowledge Discovery and Data Mining, New York, U.S. pp. 80–86 (1998)
Liu, B., Ma, Y.M., Wong, C.K.: Improving an association rule based classifier. Principles of Data Mining and Knowledge Discovery, 504–509 (2000)
Mannila, H., Toivonen, H., Verkamo, A.I.: Efficient Algorithms for Discovering Association Rules. In: AAAI Workshop on Knowledge Discovery in Databases, Seattle, Washington, pp. 181–192 (1994)
Merz, C.J., Murphy, P.: UCI repository of machine learning databases (1996)
Quinlan, J.R.: C4.5: Programs for Machine Learning, pp. 23–24. Morgan Kaufmann, San Francisco (1993)
Shafer, J., Agrawal, R., Mehta, M.: SPRINT: a scalable parallel classifier for data mining. In: Proceedings of the 22nd VLDB conference, India, pp. 544–555 (1996)
Sharpe, P.K., Glover, R.P.: Efficient GA based techniques for classification. Applied Intelligence 11, 277–284 (1999)
Shie, J.D., Chen, S.M.: Feature subset selection based on fuzzy entropy measures for handling classification problems. Applied Intelligence 28, 69–82 (2008)
Srikant, R., Agrawal, R.: Mining Quantitative Association Rules in Large Relational Tables. In: Proceedings of the ACM-SIGMOD Conference on Management of Data, Montreal, Canada, pp. 1–12 (1996)
Srikant, R., Vu, Q., Agrawal, R.: Mining Association Rules with Item Constraints. In: Proceedings of the 3rd Int’l Conference on Knowledge Discovery in Databases and Data Mining, Newport Beach, California, USA (1997)
Vapnik, V.N.: Statistical Learning Theory. John Wiley, Chichester (1998)
Wang, K., Zhou, S.Q., He, Y.: Growing decision tree on supportless association rules. In: KDD 2000, Boston, MA, USA (2000)
Weiss, S., Kulikowski, C.: Computer Systems that Learn: Classification and Prediction Methods from Statistics. In: Neural Nets, Machine Learning, and Expert Systems. Morgan Kaufma, San Francisco (1991)
Xiong, Y.X.: Fuzzy extensions to GARC associative classification (Master Thesis). Tsinghua University (2010)
Yin, X.X., Han, J.W.: CPAR: Classification based on predictive association rules. In: Proceedings of the third SIAM International Conference on Data Mining, pp. 331–334. SIAM, Philadelphia (2003)
Zadeh, L.A.: Fuzzy sets. Information Control 8, 338–353 (1965)
Zhang, X., Chen, G.Q., Wei, Q.: Building a Highly-Compact and Accurate Associative Classifier. Applied Intelligence (2009), doi:10.1007/s10489-009 -0181-z
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Chen, G., Xiong, Y., Wei, Q. (2010). A Fuzzy Extension to Compact and Accurate Associative Classification. In: Cornelis, C., Deschrijver, G., Nachtegael, M., Schockaert, S., Shi, Y. (eds) 35 Years of Fuzzy Set Theory. Studies in Fuzziness and Soft Computing, vol 261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16629-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-16629-7_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16628-0
Online ISBN: 978-3-642-16629-7
eBook Packages: EngineeringEngineering (R0)