A Clustering-Inspired Quality Measure for Exceptional Preferences Mining—Design Choices and Consequences

Verhaegh, Ruben Franciscus Adrianus; Kiezebrink, Jacco Johannes Egbert; Nusteling, Frank; Rio, Arnaud Wander André; Bendicsek, Márton Bendegúz; Duivesteijn, Wouter; Schouten, Rianne Margaretha

doi:10.1007/978-3-031-18840-4_31

Ruben Franciscus Adrianus Verhaegh⁹,
Jacco Johannes Egbert Kiezebrink⁹,
Frank Nusteling⁹,
Arnaud Wander André Rio⁹,
Márton Bendegúz Bendicsek⁹,
Wouter Duivesteijn⁹ &
…
Rianne Margaretha Schouten⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13601))

Included in the following conference series:

International Conference on Discovery Science

1087 Accesses

Abstract

Exceptional Preferences Mining (EPM) combines the research fields of Preference Learning and Exceptional Model Mining. It is a local pattern mining task, where we try to find coherent subgroups of the dataset featuring unusual preferences between a fixed set of labels. We introduce a new quality measure for Exceptional Preferences Mining, inspired by concepts from Clustering. On top of that, we draw conclusions on two design choices that must necessarily be made whenever one defines a quality measure for any version of Exceptional Model Mining: on the one hand, exceptional behavior is easily (spuriously) found in tiny subgroups, so what is the best way to compensate for that; on the other hand, when gauging exceptionality of a subgroup’s behavior, what does one use as reference for the normal behavior? We find that the choice of correction factor not only influences the subgroup size but it also effects the presumed exceptionality of found subgroups. The entropy function allows for detecting exceptional subgroups of a meaningful size, both when a candidate subgroup is evaluated against its complement and against the entire dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Discovering a taste for the unusual: exceptional models for preference mining

Article Open access 09 July 2018

Exceptional Preferences Mining

Exceptionally monotone models—the rank correlation model class for Exceptional Model Mining

Article 18 August 2016

References

Boley, M., Goldsmith, B.R., Ghiringhelli, L.M., Vreeken, J.: Identifying consistent statements about numerical data with dispersion-corrected subgroup discovery. Data Min. Knowl. Discov. 31(5), 1391–1418 (2017)
Article MathSciNet MATH Google Scholar
Cheng, W., Henzgen, S., Hüllermeier, E.: Labelwise versus pairwise decomposition in label ranking. In: Proceedings of the 15th LWA Workshops: KDML, IR and FGWM, pp. 129–136 (2013)
Google Scholar
Duivesteijn, W., Feelders, A., Knobbe, A.J.: Different slopes for different folks: mining for exceptional regression models with Cook’s distance. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2012), pp. 868–876 (2012)
Google Scholar
Duivesteijn, W., Feelders, A.J., Knobbe, A.: Exceptional model mining – supervised descriptive local pattern mining with complex target concepts. Data Min. Knowl. Disc. 30(1), 47–98 (2016)
Article MATH Google Scholar
Fürnkranz, J., Hüllermeier, E.: Preference learning: an introduction. In: Fürnkranz, J., Hüllermeier, E. (eds.) Preference Learning, pp. 1–17. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14125-6_1
Chapter MATH Google Scholar
Grosskreutz, H., Boley, M., Krause-Traudes, M.: Subgroup discovery for election analysis: a case study in descriptive data mining. In: Proceedings of the 13th International Conference on Discovery Science (DS 2010), pp. 57–71 (2010)
Google Scholar
Hand, D.J., Adams, N.M., Bolton, R.J. (eds.): Pattern Detection and Discovery. LNCS (LNAI), vol. 2447. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45728-3
Book MATH Google Scholar
Herrera, F., Carmona, C.J., González, P., Del Jesus, M.J.: An overview on subgroup discovery: foundations and applications. Knowl. Inf. Syst. 29(3), 495–525 (2011)
Article Google Scholar
Hüllermeier, E., Fürnkranz, J., Cheng, W., Brinker, K.: Label ranking by learning pairwise preferences. Artif. Intell. 172(16–17), 1897–1916 (2008)
Article MathSciNet MATH Google Scholar
Klösgen, W.: Explora: a multipattern and multistrategy discovery assistant. In: Advances in Knowledge Discovery and Data Mining, pp. 249–271 (1996)
Google Scholar
Lavrač, N., Kavšek, B., Flach, P., Todorovski, L.: Subgroup discovery with CN2-SD. J. Mach. Learn. Res. 5, 153–188 (2004)
MathSciNet Google Scholar
Leman, D., Feelders, A., Knobbe, A.: Exceptional model mining. In: Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECMLPKDD 2008), pp. 1–16 (2008)
Google Scholar
Morik, K., Boulicaut, J.-F., Siebes, A. (eds.): Local Pattern Detection. LNCS (LNAI), vol. 3539. Springer, Heidelberg (2005). https://doi.org/10.1007/b137601
Book Google Scholar
Pieters, B.F., Knobbe, A., Džeroski, S.: Subgroup discovery in ranked data, with an application to gene set enrichment. In: Proceedings of the Preference Learning Workshop at Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECMLPKDD 2010), pp. 1–18 (2010)
Google Scholar
de Sá, C.R., Duivesteijn, W., Azevedo, P.J., Jorge, A.M., Soares, C., Knobbe, A.J.: Discovering a taste for the unusual: exceptional models for preference mining. Mach. Learn. 107(11), 1775–1807 (2018)
Article MathSciNet MATH Google Scholar
de Sá, C.R., Duivesteijn, W., Soares, C., Knobbe, A.: Exceptional preferences mining. In: Proceedings of the 19th International Conference on Discovery Science (DS 2016), pp. 3–18 (2016)
Google Scholar
de Sá, C.R., Soares, C., Knobbe, A.: Entropy-based discretization methods for ranking data. Inf. Sci. 329, 921–936 (2016)
Article Google Scholar
Schouten, R.M., Bueno, M.L., Duivesteijn, W., Pechenizkiy, M.: Mining sequences with exceptional transition behaviour of varying order using quality measures based on information-theoretic scoring functions. Data Min. Knowl. Disc. 36, 379–413 (2022)
Article MathSciNet MATH Google Scholar
Umek, L., Zupan, B.: Subgroup discovery in data sets with multi-dimensional responses. Intell. Data Anal. 15(4), 533–549 (2011)
Article Google Scholar
Wrobel, S.: An algorithm for multi-relational discovery of subgroups. In: Proceedings of PKDD, pp. 78–87 (1997)
Google Scholar
Ženko, B., Džeroski, S., Struyf, J.: Learning predictive clustering rules. In: Proceedings of the International Workshop on Knowledge Discovery in Inductive Databases, pp. 234–250 (2005)
Google Scholar
Zimmermann, A., De Raedt, L.: Cluster-grouping: from subgroup discovery to clustering. Mach. Learn. 77(1), 125–159 (2009)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Eindhoven University of Technology, Eindhoven, the Netherlands
Ruben Franciscus Adrianus Verhaegh, Jacco Johannes Egbert Kiezebrink, Frank Nusteling, Arnaud Wander André Rio, Márton Bendegúz Bendicsek, Wouter Duivesteijn & Rianne Margaretha Schouten

Authors

Ruben Franciscus Adrianus Verhaegh
View author publications
You can also search for this author in PubMed Google Scholar
Jacco Johannes Egbert Kiezebrink
View author publications
You can also search for this author in PubMed Google Scholar
Frank Nusteling
View author publications
You can also search for this author in PubMed Google Scholar
Arnaud Wander André Rio
View author publications
You can also search for this author in PubMed Google Scholar
Márton Bendegúz Bendicsek
View author publications
You can also search for this author in PubMed Google Scholar
Wouter Duivesteijn
View author publications
You can also search for this author in PubMed Google Scholar
Rianne Margaretha Schouten
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Wouter Duivesteijn or Rianne Margaretha Schouten .

Editor information

Editors and Affiliations

University of Montpellier, Montpellier, France
Poncelet Pascal
INRAE, Montpellier, France
Dino Ienco

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Verhaegh, R.F.A. et al. (2022). A Clustering-Inspired Quality Measure for Exceptional Preferences Mining—Design Choices and Consequences. In: Pascal, P., Ienco, D. (eds) Discovery Science. DS 2022. Lecture Notes in Computer Science(), vol 13601. Springer, Cham. https://doi.org/10.1007/978-3-031-18840-4_31

Download citation

DOI: https://doi.org/10.1007/978-3-031-18840-4_31
Published: 06 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18839-8
Online ISBN: 978-3-031-18840-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Clustering-Inspired Quality Measure for Exceptional Preferences Mining—Design Choices and Consequences

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Discovering a taste for the unusual: exceptional models for preference mining

Exceptional Preferences Mining

Exceptionally monotone models—the rank correlation model class for Exceptional Model Mining

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Clustering-Inspired Quality Measure for Exceptional Preferences Mining—Design Choices and Consequences

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Discovering a taste for the unusual: exceptional models for preference mining

Exceptional Preferences Mining

Exceptionally monotone models—the rank correlation model class for Exceptional Model Mining

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation