Abstract
Decision tree algorithms deal with continuous variables by finding split points which provide best separation of objects belonging to different classes. Such criteria can also be used to augment methods which require or prefer symbolic data. A tool for continuous data discretization based on the SSV criterion (designed for decision trees) has been constructed. It significantly improves the performance of Naive Bayes Classifier. The combination of the two methods has been tested on 15 datasets from UCI repository and compared with similar approaches. The comparison confirms the robustness of the system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kerber, R.: Chimerge: Discretization for numeric attributes. In: National Conference on Artificial Intelligence, pp. 123–128. AAAI Press, Menlo Park (1992)
Liu, H., Setiono, R.: Chi2: Feature selection and discretization of numeric attributes. In: Proceedings of 7th IEEE Int’l Conference on Tools with Artificial Intelligence (1995)
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th International Joint Conference on Artifficial Intelligence, pp. 1022–1027. Morgan Kaufmann Publishers, San Francisco (1993)
Merz, C.J., Murphy, P.M.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: Proceedings of the ICML, pp. 194–202 (1995)
Yang, Y., Webb, G.I.: Acomparative study of discretization methods for Naive-Bayes classifiers. In: Proceedings of PKAW 2002: The 2002 Pacific Rim Knowledge Acquisition Workshop, pp. 159–173 (2002)
Gra̧bczewski, K., Duch, W.: The Separability of Split Value criterion. In: Proceedings of the 5th Conference on Neural Networks and Their Applications, Zakopane, Poland (2000)
Duch, W., Winiarski, T., Biesiada, J., Kachel, A.: Feature ranking, selection and discretization. In: Proceedings of the Internatijonal Conference on Artificial Neural Networks 2003 (2003)
Gr1bczewski, K., Jankowski, N.: Transformations of symbolic data for continuous data oriented models. In: Proceedings of the ICANN 2003, Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Grąbczewski, K. (2004). SSV Criterion Based Discretization for Naive Bayes Classifiers. In: Rutkowski, L., Siekmann, J.H., Tadeusiewicz, R., Zadeh, L.A. (eds) Artificial Intelligence and Soft Computing - ICAISC 2004. ICAISC 2004. Lecture Notes in Computer Science(), vol 3070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24844-6_86
Download citation
DOI: https://doi.org/10.1007/978-3-540-24844-6_86
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22123-4
Online ISBN: 978-3-540-24844-6
eBook Packages: Springer Book Archive