Skip to main content

Learning Statistically Significant Contrast Sets

  • Conference paper
  • First Online:
Advances in Artificial Intelligence (Canadian AI 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9673))

Included in the following conference series:

Abstract

Contrast set learning is important to discover control variables that can distinguish different groups in a dataset. Association rule mining has an inherent connection to the contrast set learning problem and has also been used to address it. All of the association rule based contrast set learning techniques use support-confidence based methods and inherit their limitations. In recent years statistically significant rule mining has become a viable alternative to address those limitations. We propose a novel contrast set learning approach based on statistically significant rule mining that eliminates the limitations in using traditional rule mining approaches and identifies statistically significant contrast sets. We evaluated our method by building a classifier using the discovered contrast sets. The performance of our classifier, while our method is not for classification per se, reveals the effectiveness of our approach in distinguishing the groups.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://archive.ics.uci.edu/ml/.

References

  1. Bay, S.D., Pazzani, M.J.: Detecting group differences: mining contrast sets. Data Min. Knowl. Disc. 5(3), 213–246. Springer (2001)

    Google Scholar 

  2. Dong, G., Li, J.: Efficient mining of emerging patterns: discovering trends and differences. In: 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 43–52. ACM (1999)

    Google Scholar 

  3. Hilderman, R.J., Peckham, T.: A statistically sound alternative approach to mining contrast sets. In: 4th Australia Data Mining Conference (AusDM 2005), pp. 157–172 (2005)

    Google Scholar 

  4. Webb, G.I., Butler, S., Newlands, D.: On detecting differences between groups. In: 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 256–265. ACM (2003)

    Google Scholar 

  5. Satsangi, A., Zaïane, O.R.: Contrasting the contrast sets: an alternative approach. In: 11th International Database Engineering and Applications Symposium (IDEAS 2007), pp. 114–119. IEEE (2007)

    Google Scholar 

  6. Li, J., Zaïane, O.R.: Associative classification with statistically significant positive and negative rules. In: 24th ACM International Conference on Information and Knowledge Management, pp. 633–642. ACM (2015)

    Google Scholar 

  7. Agrawal, R., Srikant, R.: Fast Algorithms for mining association rules. In: 20th International Conference in Very Large Data Bases (VLDB), pp. 487–499 (1994)

    Google Scholar 

  8. Hämäläinen, W.: Kingfisher: an efficient algorithm for searching for both positive and negative dependency rules with statistical significance measures. Knowl. Inf. Syst. 32(2), 383–414 (2012)

    Article  Google Scholar 

  9. Quinlan, J.R.: C4. 5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., (1993)

    Google Scholar 

  10. Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: 4th International Conference on Knowledge Discovery and Data Mining, pp. 80–86 (1998)

    Google Scholar 

  11. Yin, X., Han, J.: CPAR: classification based on predictive association rules. In: 3rd SIAM International Conference on Data Mining, pp. 369–376 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohomed Shazan Mohomed Jabbar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Mohomed Jabbar, M.S., Zaïane, O.R. (2016). Learning Statistically Significant Contrast Sets. In: Khoury, R., Drummond, C. (eds) Advances in Artificial Intelligence. Canadian AI 2016. Lecture Notes in Computer Science(), vol 9673. Springer, Cham. https://doi.org/10.1007/978-3-319-34111-8_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-34111-8_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-34110-1

  • Online ISBN: 978-3-319-34111-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics