Abstract
The expressivity of machine learning algorithms is considered to be critical in intelligent data analysis tasks for practical application. As an alternative set of classification rule learning algorithms to conventional decision tree, Prism family of algorithms induce modular rules concisely, thus exhibiting good expressiveness for human users. However, existing Prism rule induction techniques are limited by the assumption of Gaussian distribution for quantitative attributes, and may not be available for real life data analyzing, in which skewness is commonly observed. For this reason, we investigate a skew-adaptive mechanism for rule term boundary delimitation in Prism inductive learning. The propose algorithm, called P2-Prism, could learn expressive classification rules directly from quantitative data beyond Gaussian distribution. By employing statistical inference characteristics of Poisson process, our mechanism provides a significant contribution to classification rule inductive learning with adaption of skewed data distribution. The experimental evaluation of our algorithm demonstrates its skew-adaptive superiority on benchmark datasets, comparing with state-of-the-art algorithms. Furthermore, it is shown that P2-Prism is a robust classifier in the presence of various levels of noise, which further reveals its adaptability to the skewness of data distribution.
Similar content being viewed by others
References
Fong R, Vedaldi A (2017) Interpretable explanations of black boxes by meaningful perturbation. In: Proceedings of the IEEE international conference on computer vision, pp 3449–3457
Cohen N, Sharir O, Shashua A (2016) On the expressive power of deep learning: a tensor analysis. In: Conference on learning theory, pp 698–728
Barrientos F, Sainz G (2012) Interpretable knowledge extraction from emergency call data based on fuzzy unsupervised decision tree. Knowl-Based Syst 25(1):77–87
Wu C-C, Chen Y-L, Liu Y-H, Yang X-Y (2016) Decision tree induction with a constrained number of leaf nodes. Appl Intell 45(3):673–685
Andrzejak A, Langner F, Zabala S (2013) Interpretable models from distributed data via merging of decision trees. In: 2013 IEEE symposium on computational intelligence and data mining (CIDM). IEEE, pp 1–9
Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques. Elsevier, Burlington
Le T, Stahl F, Gaber MM, Gomes JB, Di Fatta G (2017) On expressiveness and uncertainty awareness in rule-based classification for data streams. Neurocomputing 265:127–141
Hao Z, Yao L, Liu B, Wang Y (2014) Arguing prism: an argumentation based approach for collaborative classification in distributed environments. In: International conference on database and expert systems applications. Springer, Cham, pp 34–41
Stahl F, Bramer M (2014) Random prism: a noise-tolerant alternative to random forests. Expert Syst 31(5):411–420
Almutairi M, Stahl F, Jennings M, Le T, Bramer M (2016) Towards expressive modular rule induction for numerical attributes. In: International conference on innovative techniques and applications of artificial intelligence. Springer, Cham, pp 229–235
Le T, Stahl F, Gomes JB, Gaber MM, Di Fatta G (2014) Computationally efficient rule-based classification for continuous streaming data. In: Research and development in intelligent systems XXXI. Springer, pp 21–34
Almutairi M, Stahl F, Bramer M (2017) Improving modular classification rule induction with G-prism using dynamic rule term boundaries. In: International conference on innovative techniques and applications of artificial intelligence. Springer, Cham, pp 115–128
Drosou M, Jagadish H, Pitoura E, Stoyanovich J (2017) Diversity in big data: a review. Big Data 5(2):73–84
Fürnkranz J, Gamberger D, Lavrač N (2012) Foundations of rule learning. Springer Science & Business Media, Heidelberg
Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, Amsterdam
Cendrowska J (1987) Prism: an algorithm for inducing modular rules. IJMMS 27(4):349–370
Bramer M (2007) Principles of data mining, vol 180. Springer, London
Došilović FK, Brčić M, Hlupić N (2018) Explainable artificial intelligence: a survey. In: 2018 41st international convention on information and communication technology, electronics and microelectronics (MIPRO). IEEE, pp 0210–0215
Zliobaite I, Bifet A, Gaber M, Gabrys B, Gama J, Minku L, Musial K (2012) Next challenges for adaptive learning systems. ACM SIGKDD Explorations Newsletter 14(1):48–55
Ribeiro MT, Singh S, Guestrin C (2016) Why should i trust you?: explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1135–1144
Nguyen A, Yosinski J, Clune J (2015) Deep neural networks are easily fooled: high confidence predictions for unrecognizable images. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 427–436
Tinelli E, Colucci S, Donini FM, Di Sciascio E, Giannini S (2017) Embedding semantics in human resources management automation via SQL. Appl Intell 46(4):952–982
Liu B, Yao L, Ding Z, Xu J, Wu J (2018) Combining ontology and reinforcement learning for zero-shot classification. Knowl-Based Syst 144:42–50
Moore A, Murdock V, Cai Y, Jones K (2018) Transparent tree ensembles. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. ACM, pp 1241–1244
Si Z, Zhu S-C (2013) Learning and-or templates for object recognition and detection. IEEE Trans Pattern Anal Mach Intell 35(9):2189–2205
Hao Z, Liu B, Wu J, Yao J (2015) Exploiting ontological reasoning in argumentation based multi-agent collaborative classification. In: Asian conference on intelligent information and database systems. Springer, Cham, pp 23–33
Johnson NL, Kemp AW, Kotz S (2005) Univariate discrete distributions, vol 444. Wiley, Hoboken
Frank A, Asuncion A (2010) UCI machine learning repository. Available from: http://archive.ics.uci.edu/ml. Accessed May 29 2018
Acknowledgments
This work was supported in part by the China Postdoctoral Science Foundation (Grant No. 2018M643187), the National Natural Science Foundation of China (Grant No. 71701134, 71571120, 71271140), the Humanity and Social Science Youth Foundation of the Ministry of Education of China (Grant No. 16YJC630153), and the Natural Science Foundation of Guangdong Province of China (Grant No. 2017A030310427).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Hao, Zy., Yang, C., Liu, L. et al. Exploiting skew-adaptive delimitation mechanism for learning expressive classification rules. Appl Intell 50, 746–758 (2020). https://doi.org/10.1007/s10489-019-01533-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-019-01533-1