Abstract
Data classification is one of the basic tasks in data mining. In this paper, we propose a new classifier based on relative entropy, where data to particular class assignment is made by the majority good guess criteria. The presented approach is intended to be used when relations between datasets and assignment classes are rather complex, nonlinear, or with logical inconsistencies; because such datasets can be too complex to be classified by ordinary methods of decision trees or by the tools of logical analysis. The relative entropy evaluation of associative rules can be simple to interpret and offers better comprehensibility in comparison to decision trees and artificial neural networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kotsiantis, S.B.: Supervised machine learning: a review of classification techniques. Informatica 31, 249–268 (2007)
Fürnkranz, J., Flach, P.A.: ROC ‘n’ rule learning—towards a better understanding of covering rules. Mach. Learn. 58, 39–77 (2005)
Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423, 623–656 (1948)
Cover, T.M.: Elements of Information Theory. Wiley-Interscience, New York (1991)
Quinlan, J.R.: Learning efficient classification procedures and their application to chess endgames. Machine Learning: An Artificial Inteligence Approach, pp. 463–482. Palo Alto, Tioga (1983)
Cendrowska, J.: PRISM: an algorithm for inducing modular rules. Int. J. Man-Mach. Stud. 27, 349–370 (1987)
Thabtah, F.A., Cowling, P.I.: A greedy classification algorithm based on association rule. Appl. Soft Comput. 7, 1102–1111 (2007)
Li, J., Wong, L.: Using rules to analyse bio-medical data: a comparison between C4.5 and PCL. Adv. Web-Age Inf. Manag. 4, 254–265 (2003)
Fano, R.M.: Transmission of Information. A Statistical Theory of Communications. M.I.T. Press, New York (1961)
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22, 79–86 (1951)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)
Cohen, W.: Fast effective rule induction. In: Proceedings of ICML-95, pp. 115–123 (1995)
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proceedings of the KDD, pp. 80–86. New York (1998)
Acknowledgments
This work was supported by the SGS in VSB—Technical University of Ostrava, Czech Republic, under the grant No. SP2015/146.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Vašinek, M., Platoš, J. (2016). Experiments on Data Classification Using Relative Entropy. In: Burduk, R., Jackowski, K., Kurzyński, M., Woźniak, M., Żołnierek, A. (eds) Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015. Advances in Intelligent Systems and Computing, vol 403. Springer, Cham. https://doi.org/10.1007/978-3-319-26227-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-26227-7_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26225-3
Online ISBN: 978-3-319-26227-7
eBook Packages: EngineeringEngineering (R0)