Abstract
This paper presents a subgroup discovery algorithm APRIORI-SD, developed by adapting association rule learning to subgroup discovery. This was achieved by building a classification rule learner APRIORI-C, enhanced with a novel post-processing mechanism, a new quality measure for induced rules (weighted relative accuracy) and using probabilistic classification of instances. Results of APRIORI-SD are similar to the subgroup discovery algorithm CN2-SD while experimental comparisons with CN2, RIPPER and APRIORI-C demonstrate that the subgroup discovery algorithm APRIORI-SD produces substantially smaller rule sets, where individual rules have higher coverage and significance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Imielinski, T., Shrikant, R.: Mining Association Rules between Sets of Items in Large Databases. In: Proceedings of ACM SIGMOD Conference on Management of Data, Washington, D.C., pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proceedings of the 20th International Conference on Very Large Databases, Santiago, Chile, pp. 207–216 (1994)
Clark, P., Niblett, T.: The CN2 induction algorithm. Machine Learning 3(4), 261–283 (1989)
Cohen, W.W.: Fast effective rule induction. In: Proceedings of the 12th International Conference on Machine Learning, pp. 115–123. Morgan Kaufmann, San Francisco (1995)
Džeroski, S., Lavrač, N. (eds.): Relational Data Mining, pp. 74–99. Springer, Heidelberg (2001)
Ferri-Ramírez, C., Flach, P.A., Hernandez-Orallo, J.: Learning decision trees using the area under the ROC curve. In: Proceedings of the 19th International Conference on Machine Learning, pp. 139–146. Morgan Kaufmann, San Francisco (2002)
Jovanoski, V., Lavrač, N.: Classification Rule Learning with APRIORI-C. In: Brazdil, P., Jorge, A. (eds.) EPIA 2001. LNCS (LNAI), vol. 2258, pp. 44–51. Springer, Heidelberg (2001)
Kononenko, I.: On Biases in Estimating Multi-Valued Attributes. In: Proceedings of the 14th Int. Joint Conf. on Artificial Intelligence, pp. 1034–1040 (1995)
Lavrač, N., Flach, P., Zupan, B.: Rule evaluation measures: A unifying view. In: Proceedings of the 9th International Workshop on Inductive Logic Programming, pp. 74–185. Springer, Heidelberg (1999)
Lavrač, N., Flach, P., Kavšek, B., Todorovski, L.: Adapting classification rule induction to subgroup discovery. In: Proceedings of the IEEE International Conference on Data Mining (ICDM 2002), Maebashi City, Japan, pp. 266–273 (2002)
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining KDD 1998, New York, USA, pp. 80–86 (1998)
Michalski, R.S., Mozetič, I., Hong, J., Lavrač, N.: The multi-purpose incremental learning system AQ15 and its testing application on three medical domains. In: Proc. 5th National Conference on Artificial Intelligence, pp. 1041–1045. Morgan Kaufmann, San Francisco (1986)
Murphy, P.M., Aha, D.W.: UCI repository of machine learning databases. University of California, Department of Information and Computer Science, Irvine, CA (1994), http://www.ics.uci.edu/~mlearn/MLRepository.html
Provost, F., Fawcett, T.: Robust classification for imprecise environments. Machine Learning 42(3), 203–231 (2001)
Rivest, R.L.: Learning decision lists. Machine Learning 2(3), 229–246 (1987)
Todorovski, L., Flach, P., Lavrač, N.: Predictive performance of weighted relative accuracy. In: Zighed, D.A., Komorowski, J., Zytkow, J. (eds.) Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, pp. 255–264. Springer, Heidelberg (2000)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)
Wrobel, S.: An algorithm for multi-relational discovery of subgroups. In: Proceedings of the 1st European Symposium on Principles of Data Mining and Knowledge Discovery, pp. 78–87. Springer, Heidelberg (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kavšek, B., Lavrač, N., Jovanoski, V. (2003). APRIORI-SD: Adapting Association Rule Learning to Subgroup Discovery. In: R. Berthold, M., Lenz, HJ., Bradley, E., Kruse, R., Borgelt, C. (eds) Advances in Intelligent Data Analysis V. IDA 2003. Lecture Notes in Computer Science, vol 2810. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45231-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-45231-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40813-0
Online ISBN: 978-3-540-45231-7
eBook Packages: Springer Book Archive