Abstract
Contrast set learning is important to discover control variables that can distinguish different groups in a dataset. Association rule mining has an inherent connection to the contrast set learning problem and has also been used to address it. All of the association rule based contrast set learning techniques use support-confidence based methods and inherit their limitations. In recent years statistically significant rule mining has become a viable alternative to address those limitations. We propose a novel contrast set learning approach based on statistically significant rule mining that eliminates the limitations in using traditional rule mining approaches and identifies statistically significant contrast sets. We evaluated our method by building a classifier using the discovered contrast sets. The performance of our classifier, while our method is not for classification per se, reveals the effectiveness of our approach in distinguishing the groups.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Bay, S.D., Pazzani, M.J.: Detecting group differences: mining contrast sets. Data Min. Knowl. Disc. 5(3), 213–246. Springer (2001)
Dong, G., Li, J.: Efficient mining of emerging patterns: discovering trends and differences. In: 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 43–52. ACM (1999)
Hilderman, R.J., Peckham, T.: A statistically sound alternative approach to mining contrast sets. In: 4th Australia Data Mining Conference (AusDM 2005), pp. 157–172 (2005)
Webb, G.I., Butler, S., Newlands, D.: On detecting differences between groups. In: 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 256–265. ACM (2003)
Satsangi, A., Zaïane, O.R.: Contrasting the contrast sets: an alternative approach. In: 11th International Database Engineering and Applications Symposium (IDEAS 2007), pp. 114–119. IEEE (2007)
Li, J., Zaïane, O.R.: Associative classification with statistically significant positive and negative rules. In: 24th ACM International Conference on Information and Knowledge Management, pp. 633–642. ACM (2015)
Agrawal, R., Srikant, R.: Fast Algorithms for mining association rules. In: 20th International Conference in Very Large Data Bases (VLDB), pp. 487–499 (1994)
Hämäläinen, W.: Kingfisher: an efficient algorithm for searching for both positive and negative dependency rules with statistical significance measures. Knowl. Inf. Syst. 32(2), 383–414 (2012)
Quinlan, J.R.: C4. 5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., (1993)
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: 4th International Conference on Knowledge Discovery and Data Mining, pp. 80–86 (1998)
Yin, X., Han, J.: CPAR: classification based on predictive association rules. In: 3rd SIAM International Conference on Data Mining, pp. 369–376 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Mohomed Jabbar, M.S., Zaïane, O.R. (2016). Learning Statistically Significant Contrast Sets. In: Khoury, R., Drummond, C. (eds) Advances in Artificial Intelligence. Canadian AI 2016. Lecture Notes in Computer Science(), vol 9673. Springer, Cham. https://doi.org/10.1007/978-3-319-34111-8_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-34111-8_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-34110-1
Online ISBN: 978-3-319-34111-8
eBook Packages: Computer ScienceComputer Science (R0)