Novel Interestingness Measures for Mining Significant Association Rules from Imbalanced Data

Abdellatif, Safa; Ben Hassine, Mohamed Ali; Ben Yahia, Sadok

doi:10.1007/978-3-030-15035-8_16

Novel Interestingness Measures for Mining Significant Association Rules from Imbalanced Data

Safa Abdellatif¹⁸,
Mohamed Ali Ben Hassine¹⁸ &
Sadok Ben Yahia^18,19

Conference paper
First Online: 15 March 2019

2674 Accesses
3 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 927))

Abstract

Associative classification is a rule-based approach that joins Association Rule Mining and Classification to build classifiers that predict class labels for new data. Associative classifiers may generate an overwhelming number of rules which are hard to handle. Delving through these rules to identify the most interesting ones is a challenging task. To overcome this problem, several measures have been proposed. However, for imbalanced datasets, existing measures are no more reliable. In fact, they tend either to favour rules of major classes and consider others as uninteresting or only emphasize on the rules of minor classes and omit other ones. In this respect, we propose five new measures which tend to be fair for both types of classes regardless of their imbalanced distribution. Extensive carried out experiments on real-world datasets show that the new measures are able to efficiently extract significant knowledge from minor classes without decreasing the global predictive accuracy.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Abdellatif, S., Ben Hassine, M.A., Ben Yahia, S., Bouzeghoub, A.: ARCID: a new approach to deal with imbalanced datasets classification. In: International Conference on Current Trends in Theory and Practice of Informatics. Springer (2018)
Google Scholar
Abdellatif, S., Ben Yahia, S., Ben Hassine, M.A., Bouzeghoub, A.: Fuzzy aggregation for rule selection in imbalanced datasets classification using choquet integral. In: 2018 IEEE International Conference on Fuzzy Systems, FUZZ-IEEE 2018, Rio de Janeiro, Brazil, 8–13 July 2018 (2018)
Google Scholar
Cohen, W.W.: Fast effective rule induction. In: Proceedings of the Twelfth International Conference on Machine Learning, pp. 115–123 (1995)
Google Scholar
Hu, B.G., Dong, W.M.: A study on cost behaviors of binary classification measures in class-imbalanced problems. arXiv preprint arXiv:1403.7100 (2014)
Lenca, P., Vaillant, B., Meyer, P., Lallich, S.: Association rule interestingness measures: experimental and theoretical studies. In: Quality Measures in Data Mining, pp. 51–76. Springer (2007)
Google Scholar
Ma, Y., Hsu, W., Liu, B.: Integrating classification and association rule mining. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (1998)
Google Scholar
Major, J.A., Mangano, J.J.: Selecting among rules induced from a hurricane database. J. Intell. Inf. Syst. 4(1), 39–52 (1995)
Article Google Scholar
Merz, C.: UCI repository of machine learning databases (1996). http://www.ics.uci.edu/~mlearn/MLRepository.html
Piatetsky-Shapiro, G.: Discovery, analysis, and presentation of strong rules. In: Knowledge Discovery in Databases, pp. 229–238 (1991)
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Sciences of Tunis, University of Tunis El Manar, LIPAH-LR11ES14, El Manar, 2092, Tunis, Tunisia
Safa Abdellatif, Mohamed Ali Ben Hassine & Sadok Ben Yahia
Department of Software Science, Tallinn University of Technology, Akadeemia tee 15a, 12618, Tallinn, Estonia
Sadok Ben Yahia

Authors

Safa Abdellatif
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Ali Ben Hassine
View author publications
You can also search for this author in PubMed Google Scholar
Sadok Ben Yahia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Safa Abdellatif .

Editor information

Editors and Affiliations

Department of Information and Communication Engineering, Fukuoka Institute of Technology, Fukuoka, Japan
Leonard Barolli
Department of Advanced Sciences, Hosei University, Koganei-Shi, Tokyo, Japan
Makoto Takizawa
Department of Computer Science, Technical University of Catalonia, Barcelona, Barcelona, Spain
Fatos Xhafa
Faculty of Business Administration, Rissho University, Tokyo, Japan
Tomoya Enokido

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abdellatif, S., Ben Hassine, M.A., Ben Yahia, S. (2019). Novel Interestingness Measures for Mining Significant Association Rules from Imbalanced Data. In: Barolli, L., Takizawa, M., Xhafa, F., Enokido, T. (eds) Web, Artificial Intelligence and Network Applications. WAINA 2019. Advances in Intelligent Systems and Computing, vol 927. Springer, Cham. https://doi.org/10.1007/978-3-030-15035-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-15035-8_16
Published: 15 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15034-1
Online ISBN: 978-3-030-15035-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics