Skip to main content
Log in

An overview on subgroup discovery: foundations and applications

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Subgroup discovery is a data mining technique which extracts interesting rules with respect to a target variable. An important characteristic of this task is the combination of predictive and descriptive induction. An overview related to the task of subgroup discovery is presented. This review focuses on the foundations, algorithms, and advanced studies together with the applications of subgroup discovery presented throughout the specialised bibliography.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Abudawood T, Flach P (2009) Evaluation measures for multi-class subgroup discovery. In: Proceedings of the European conference on machine learning and principles and practice of knowledge discovery in databases, vol 5781. Springer, LNAI, pp 35–50

  2. Agrawal R, Imieliski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data. ACM Press, pp 207–216

  3. Agrawal R, Mannila H, Srikant R, Toivonen H, Verkamo AI (1996) Fast discovery of association rules. In: Fayyad U, Piatetsky-Shapiro G, Smyth P, Uthurusamy R (eds) Advances in knowledge discovery and data mining. AAAI Press, Cambridge, pp 307–328

    Google Scholar 

  4. Alcalá-Fdez J, Sánchez L, García S, del Jesus M, Ventura S, Garrell J, Otero J, Romero C, Bacardit J, Rivas V, Fernández J, Herrera F (2009) KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft Comput 13(3): 307–318

    Article  Google Scholar 

  5. Alcalá-Fdez J, Fernández A, Luengo J, Derrac J, García S, Sánchez L, Herrera F (2010) KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J Multiple Valued Logic Soft Comput (in press)

  6. Andrienko N, Andrienko G, Savinov A, Voss H, Wettschereck D (2001) Exploratory analysis of spatial data using interactive maps and data mining. Cartogr Geogr Inf Sci 28(3): 151–165

    Article  Google Scholar 

  7. Atmueller M, Seipel D (2009) Using declarative specifications of domain knowledge for descriptive data mining. In: Proceedings of the international conference on applications of declarative programming and knowledge management and the workshop on logic programming, vol 5437. Springer, LNAI, pp 149–164

  8. Atzmueller M, Lemmerich F (2009) Fast subgroup discovery for continuous target concepts. In: Proceedings of the 18th international symposium on methodologies for intelligent systems, vol 5722. Springer, LNAI, pp 35–44

  9. Atzmueller M, Puppe F (2005) Semi-automatic visual subgroup mining using VIKAMINE. J Univers Comput Sci 11(11): 1752–1765

    Google Scholar 

  10. Atzmueller M, Puppe F (2006) SD-Map—a fast algorithm for exhaustive subgroup discovery. In: Proceedings of the 17th European conference on machine learning and 10th European conference on principles and practice of knowledge discovery in databases, vol 4213. Springer, LNCS, pp 6–17

  11. Atzmueller M, Puppe F (2008) A case-based approach for characterization and analysis of subgroup patterns. Appl Intell 28(3): 210–221

    Article  Google Scholar 

  12. Atzmueller M, Puppe F (2009) Knowledge discovery enhanced with semantic and social information, Springer, chap A Knowledge-Intensive Approach for Semi-Automatic Causal Subgroup Discovery, pp 19–36

  13. Atzmueller M, Puppe F, Buscher HP (2004) Towards knowledge-intensive subgroup discovery. In: Proceedings of the Lernen-Wissensentdeckung-Adaptivität-Fachgruppe Maschinelles Lernen, pp 111–117

  14. Atzmueller M, Baumeister J, Puppe F (2006) Introspective subgroup analysis for interactive knowledge refinement. In: Proceedings of the 9th international Florida artificial intelligence research society conference. AAAI Press, pp 402–407

  15. Atzmueller M, Puppe F, Buscher HP (2009) A semi-automatic approach for confounding-aware subgroup discovery. Int J Artif Intell Tools 18(1): 81–98

    Article  Google Scholar 

  16. Barrera V, López B, Meléndez J, Sánchez J (2008) Voltage sag source location from extracted rules using subgroup discovery. Front Artif Intell Appl 184: 225–235

    Google Scholar 

  17. Bay S, Pazzani M (2001) Detecting group differences: mining contrast sets. Data Mining Knowl Discov 5: 213–246

    Article  MATH  Google Scholar 

  18. Berlanga FJ, del Jesus MJ, González P, Herrera F, Mesonero M (2006) Multiobjective evolutionary induction of subgroup discovery fuzzy rules: a case study in marketing. In: Proceedings of the 6th industrial conference on data mining, vol 4065. Springer, LNCS, pp 337–349

  19. Blumenstock A, Schweiggert F, Mueller M, Lanquillon C (2009) Rule cubes for casual investigations. Knowl Inf Syst 18(1): 109–132

    Article  Google Scholar 

  20. Boley M, Grosskreutz H (2009) Non-redundant subgroup discovery using a closure system. In: Proceedings of the European conference on machine learning and principles and practice of knowledge discovery in databases, vol 5781. Springer, LNAI, pp 179–194

  21. Box G, Jenkins G, Reinsel G (2008) Time series analysis: forecasting and control, 4th edn. Wiley, New York

    MATH  Google Scholar 

  22. Brin S, Motwani R, Ullman JD, Tsur S (1997) Dynamic itemset counting and implication rules for market basket data. In: Proceedings of the 1997 ACM SIGMOD international conference on management of data. ACM Press, pp 255–264

  23. Bringmann B, Zimmermann A (2009) One in a million: picking the right patterns. Knowl Inf Syst 18(1): 61–81

    Article  Google Scholar 

  24. Cano JR, García S, Herrera F (2008) Subgroup discover in large size data sets preprocessed using stratified instance selection for increasing the presence of minority classes. Patt Recognit Lett 29: 2156–2164

    Article  Google Scholar 

  25. Cano JR, Herrera F, Lozano M, García S (2008) Making CN2-SD subgroup discovery algorithm scalable to large size data sets using instance selection. Expert Syst Appl 35: 1949–1965

    Article  Google Scholar 

  26. Carmona CJ, González P, del Jesus MJ, Herrera F (2009a) An analysis of evolutionary algorithms with different types of fuzzy rules in subgroup discovery. In: Proceedings of the IEEE international conference on fuzzy systems, pp 1706–1711

  27. Carmona CJ, González P, del Jesus MJ, Herrera F (2009b) Non-dominated multi-objective evolutionary algorithm based on fuzzy rules extraction for subgroup discovery. In: Proceedings of the 4th international conference on hybrid artificial intelligence systems, vol 5572. Springer, LNAI, pp 573–580

  28. Carmona CJ, González P, del Jesus MJ, Herrera F (2010a) NMEEF-SD: Non-dominated multi-objective evolutionary algorithm for extracting fuzzy rules in subgroup discovery. IEEE Trans Fuzzy Syst 18(5): 958–970

    Article  Google Scholar 

  29. Carmona CJ, González P, del Jesus MJ, Navío M, Jiménez L (2010b) Evolutionary fuzzy rule extraction for subgroup discovery in a psychiatric emergency department. Soft Comput Special Issue on “Genetic Fuzzy Systems” (in press)

  30. Carmona CJ, González P, del Jesus MJ, Romero C, Ventura S (2010c) Evolutionary algorithms for subgroup discovery applied to e-learning data. In: Proceedings of the IEEE international education engineering, pp 983–990

  31. Cherkassky V, Mulier FM (2007) Learning from data: concepts, theory and methods, 2nd edn. IEEE Press, New York

    Book  MATH  Google Scholar 

  32. Clark P, Boswell R (1991) Rule Induction with CN2: some recent improvements. In: Proceedings of the 5th European conference on machine learning, vol 482. Springer, LNCS, pp 151–163

  33. Clark P, Niblett T (1989) The CN2 induction algorithm. Mach Learn 3: 261–283

    Google Scholar 

  34. Cordón O, Herrera F, Hoffmann F, Magdalena L (2001) Genetic fuzzy systems: evolutionary tuning and learning of fuzzy knowledge bases. World Scientific, Singapore

    MATH  Google Scholar 

  35. Deb K, Pratap A, Agrawal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6(2): 182–197

    Article  Google Scholar 

  36. Demsar J, Zupan B, Leban G (2004) White Paper (http://www.ailabsi/orange)

  37. Domingo C, Gavaldá R, Watanabe O (2002) Adaptive sampling methods for scaling up knowledge discovery algorithms. Data Mining Knowl Discov 6(2): 131–152

    Article  MATH  Google Scholar 

  38. Dong G, Li J (1999) Efficient mining of emerging patterns: discovering trends and differences. In: Proceedings of the 5th ACM SIGKDD international conference on knowledge discovery and data mining. ACM Press, pp 43–52

  39. Dubois D, Prade H, Sudkamp T (2005) On the representation, measurement, and discovery of fuzzy associations. IEEE Trans Fuzzy Syst 13: 250–262

    Article  Google Scholar 

  40. Fayyad UM, Irani KB (1993) Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th International joint conference on artificial intelligence, pp 1022–1029

  41. Fayyad UM, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery: an overview. In: Advances in knowledge discovery and data mining. AAAI/MIT Press, pp 1–34

  42. Flach PA, Gamberger D (2001) Subgroup evaluation and decision support for a direct mailing marketing problem. In: Proceedings of the 12th European conference on machine learning and 5th European conference on principles and practice of knowledge discovery in databases, pp 45–56

  43. Gamberger D, Lavrac N (2002) Expert-guided subgroup discovery: methodology and application. J Artif Intell Res 17: 501–527

    MATH  Google Scholar 

  44. Gamberger D, Lavrac N (2002) Generating actionable knowledge by expert-guided subgroup discovery. In: Proceedings of the 6th European conference on principles and practice of knowledge discovery in databases, vol 2431. Springer, LNCS, pp 163–174

  45. Gamberger D, Lavrac N (2003) Active subgroup mining: a case study in coronary heart disease risk group detection. Artif Intell Med 28(1): 27–57

    Article  Google Scholar 

  46. Gamberger D, Lavrac N (2004) Avoiding data overfitting in scientific discovery: experiments in functional genomics. In: Proceedings of the 16th European conference on artificial intelligence. IOS Press, pp 470–474

  47. Gamberger D, Lavravc N (2007) Supporting factors in descriptive analysis of brain ischaemia. In: Proceedings of the 11th conference on artificial intelligence in medicine, vol 4594. Springer, LNCS, pp 155–159

  48. Gamberger D, Lavrac N, Wettschereck D (2002) Subgroup visualization: a method and application to population screening. In: Proceedings of the 2nd international workshop on integration and collaboration aspects of data mining, decision support and meta-learning, pp 35–40

  49. Gamberger D, Smuc T, Lavrac N (2003) Subgroup discovery: on-line data minig server and its application. In: Proceedings of the 5th international conference on simulations in biomedicine, pp 433–442

  50. Gamberger D, Lavrac N, Zelezny F, Tolar J (2004) Induction of comprehensible models for gene expression datasets by subgroup discovery methodology. J Biomed Inform 37(4): 269–284

    Article  Google Scholar 

  51. Gamberger D, Krstacic A, Krstatic G, Lavrac N, Sebag M (2005) Data analysis based on subgroup discovery: experiments in brain ischaemia domain. In: Proceedings of the 10th international workshop on intelligent data analysis in medicine and pharmacology, pp 52–56

  52. Gamberger D, Lavrac N, Krstaic A, Krstaic G (2007) Clinical data analysis based on iterative subgroup discovery: experiments in brain ischaemia data analysis. Appl Intell 27(3): 205–217

    Article  Google Scholar 

  53. Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. Addison-Wesley Longman Publishing Co, Reading

    MATH  Google Scholar 

  54. Grosskreutz H, Rueping S (2009) On subgroup discovery in numerical domains. Data Mining Knowl Discov 19(2): 210–216

    Article  Google Scholar 

  55. Grosskreutz H, Rueping S, Wrobel S (2008) Tight optimistic estimates for fast subgroup discovery. In: European conference on machine learning and principles and practice of knowledge discovery in databases, pp 440–456

  56. Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACM SIGMOD international conference on management of data. ACM Press, pp 1–12

  57. Herrera F (2008) Genetic fuzzy systems: taxomony, current research trends and prospects. Evol Intell 1: 27–46

    Article  Google Scholar 

  58. Holland JH (1975) Adaptation in natural and artificial systems. University of Michigan Press, Ann Arbor

    Google Scholar 

  59. del Jesus MJ, González P, Herrera F (2007) Fuzzy sets and their extensions: representation, aggregation and models, vol 220, Springer, chap Subgroup Discovery with Linguistic Rules, pp 411–430

  60. del Jesus MJ, González P, Herrera F (2007) Multiobjective genetic algorithm for extracting subgroup discovery fuzzy rules. In: Proceedings of the IEEE symposium on computational intelligence in multicriteria decision making. IEEE Press, pp 50–57

  61. del Jesus MJ, González P, Herrera F, Mesonero M (2007) Evolutionary fuzzy rule induction process for subgroup discovery: a case study in marketing. IEEE Trans Fuzzy Syst 15(4): 578–592

    Article  Google Scholar 

  62. Jorge AM, Pereira F, Azevedo PJ (2006) Visual interactive subgroup discovery with numerical properties of interest. In: Proceedings of the 9th international conference on discovery science, vol 4265. Springer, LNAI, pp 301–305

  63. Jovanoski V, Lavrac N (2001) Classification rule learning with APRIORI-C. In: 10th Portuguese conference on artificial intelligence on progress in artificial intelligence, knowledge extraction, multi-agent systems, logic programming and constraint solving, vol 2258. Springer, LNCS, pp 44–51

  64. Kavsek B, Lavrac N (2004) Analysis of example weighting in subgroup discovery by comparison of three algorithms on a real-life data set. In: Proceedings of the 15th European conference on machine learning and 8th European conference on principles and practice of knowledge discovery in databases, pp 64–76

  65. Kavsek B, Lavrac N (2004) Using subgroup discovery to analyze the UK traffic data. Metodoloski Zvezki 1(1): 249–264

    Google Scholar 

  66. Kavsek B, Lavrac N (2006) APRIORI-SD: adapting association rule learning to subgroup discovery. Appl Artif Intell 20: 543–583

    Article  Google Scholar 

  67. Kavsek B, Lavrac N, Bullas JC (2002) Rule induction for subgroup discovery: a case study in mining UK traffic accident data. In: International multi-conference on information society, pp 127–130

  68. Kavsek B, Lavrac N, Jovanoski V (2003) APRIORI-SD: adapting association rule learning to subgroup discovery. In: Proceedings of the 5th international symposium on intelligent data analysis, vol 2810. Springer, LNCS, pp 230–241

  69. Kavsek B, Lavrac N, Todorovski L (2004) ROC analysis of example weighting in subgroup discovery. In: Proceedings of the 1st workshop on international workshop ROC analysis in artificial intelligence, pp 55–60

  70. Kloesgen W (1996) Explora: a multipattern and multistrategy discovery assistant. In: Advances in Knowledge discovery and data mining. American Association for Artificial Intelligence, pp 249–271

  71. Kloesgen W (1999) Applications and research problems of subgroup mining. In: Proceedings of the 11th international symposium on foundations of intelligent systems. Springer, pp 1–15

  72. Kloesgen W, May M (2002) Census data mining—an application. In: Proceedings of the 6th European conference on principles of data mining and knowledge discovery, pp 65–79

  73. Kloesgen W, May M (2002) Spatial subgroup mining integrated in an object-relational spatial database. In: Proceedings of the 6th European conference on principles of data mining and knowledge discovery, pp 275–286

  74. Kloesgen W, Zytkow J (2002) Handbook of data mining and knowledge discovery, Oxford

  75. Kloesgen W, May M, Petch J (2003) Mining census data for spatial effects on mortality. Intell Data Anal 7: 521–540

    Google Scholar 

  76. Kralj-Novak P, Lavrac N, Zupan B, Gamberger D (2005) Experimental comparison of three subgroup discovery algorithms: analysing brain ischemia data. In: Proceedings of the 8th international multiconference information society, pp 220–223

  77. Kralj-Novak P, Lavrac N, Gamberger D, Krstacic A (2009) CSM-SD: methodology for contrast set mining through subgroup discovery. J Biomed Inform 42(1): 113–122

    Article  Google Scholar 

  78. Kralj-Novak P, Lavrac N, Webb GI (2009) Supervised descriptive rule discovery: a unifying survey of constrast set, emerging pateern and subgroup mining. J Mach Learn Res 10: 377–403

    Google Scholar 

  79. Lambach D, Gamberger D (2008) Temporal analysis of political instability through descriptive subgroup discovery. Confl Manag Peace Sci 25: 19–32

    Article  Google Scholar 

  80. Lavrac N (2005) Subgroup discovery techniques and applications. In: Proceedings of the 9th Pacific-Asia conference on knowledge discovery and data mining, vol 3518. Springer, LNCS, pp 2–14

  81. Lavrac N, Flach PA, Zupan B (1999) Rule evaluation measures: a unifying view. In: Proceedings of the 9th international workshop on inductive logic programming, vol 1634. Springer, LNCS, pp 174–185

  82. Lavrac N, Flach P, Kavsek B, Todorovski L (2002) Rule induction for subgroup discovery with CN2-SD. In: Proceedings of the 2nd international workshop on integration and collaboration aspects of data mining, decision support and meta-learning, pp 77–87

  83. Lavrac N, Zelezny F, Flach PA (2003) RSD: relational subgroup discovery through first-order feature construction. In: Proceedings of the 12th international conference inductive logic programming, vol 2583. Springer, LNCS, pp 149–165

  84. Lavrac N, Cestnik B, Gamberger D, Flach PA (2004) Decision support through subgroup discovery: three case studies and the lessons learned. Mach Learn 57(1–2): 115–143

    Article  MATH  Google Scholar 

  85. Lavrac N, Kavsek B, Flach PA, Todorovski L (2004) Subgroup discovery with CN2-SD. J Mach Learn Res 5: 153–188

    MathSciNet  Google Scholar 

  86. Lavrac N, Zelezny F, Dzeroski S (2005) Local patterns: theory and practice of constraint-based relational subgroup discovery. In: International seminar on local pattern detection, vol 3539. Springer, LNCS, pp 71–88

  87. Lavrac N, Kralj-Novak P, Mozetic I, Podpecan V, Motaln H, Petek M, Gruder K (2009) Semantic subgroup discovery: using ontologies in microarray data analysis. In: Proceedings of the 31st annual international conference of the IEEE engineering in medicine and biology society. IEEE Press, pp 5613–5616

  88. Liu H, Hussain F, Tan C, Dash M (2002) Discretization: an enabling technique. Data mining Knowl Discov 6: 393–423

    Article  MathSciNet  Google Scholar 

  89. Lucas JP, Jorge AP, Pereira F, Pernas AM, Machado AA (2007) A tool for interactive subgroup discovery using distribution rules. In: Proceedings of the 13th Portuguese conference on artificial intelligence, vol 4874. Springer, LNAI, pp 426–436

  90. May M, Ragia L (2002) Spatial subgroup discovery applied to the analysis of vegetation data. In: Proceedings of the 4th international conference on practical aspects of knowledge management, vol 2569. Springer, LNCS, pp 49–61

  91. Moreland K, Truemper K (2009) Discretization of target attributes for subgroup discovery. In: Proceedings of the 6th international conference machine learning and data mining in pattern recognition, vol 5632. Springer, LNAI, pp 44–52

  92. Mueller M, Rosales R, Steck H, Krishnan S, Rao B, Kramer S (2009) Subgroup discovery for test selection: a novel approach and its application to breast cancer diagnosis. In: Proceedings of the 8th international symposium on intelligent data analysis, vol 5772. Springer, LNCS, pp 119–130

  93. Noda E, Freitas AA, Lopes HS (1999) Discovering interesting prediction rules wih a genetic algorithm. IEEE Congr Evol Comput 2: 1322–1329

    Google Scholar 

  94. Richardson M, Domingos P (2003) Learning with knowledge from multiple experts. In: Proceedings of the 20th international conference on machine learning. AAAI Press, pp 624–631

  95. Romero C, Ventura S (2007) Educational data mining: a survey from 1995 to 2005. Expert Syst Appl 33(1): 135–146

    Article  Google Scholar 

  96. Romero C, González P, Ventura S, del Jesus MJ, Herrera F (2009) Evolutionary algorithm for subgroup discovery in e-learning: a practical application using Moodle data. Expert Syst Appl 36: 1632–1644

    Article  Google Scholar 

  97. Rueping S (2009) Ranking interesting subgroups. In: Proceedings of the 26th international conference on machine learning, pp 913–920

  98. Scheffer T, Wrobel S (2002) Finding the most interesting patterns in a database quickly by using sequential sampling. J Mach Learn Res 3: 833–862

    MathSciNet  Google Scholar 

  99. Schmidt J, Hapfelmeier A, Mueller M, Perneczky R, Kurz A, Drzezga A, Kramer S (2010) Interpreting PET scans by structured patient data: a data mining case study in dementia research. Knowl Inf Syst 24(1): 149–170

    Article  Google Scholar 

  100. Scholz M (2005) Knowledge-based sampling for subgroup discovery. In: International seminar on local pattern detection, vol 3539. Springer, LNAI, pp 171–189

  101. Siebes A (1995) Data Surveying: foundations of an inductive query language. In: Proceedings of the 1st international conference on knowledge discovery and data mining. AAAI Press, pp 269–274

  102. Bäck T, Fogel D, Michalewicz Z (1997) Handbook of evolutionary computation. Oxford University Press, New York

    Book  MATH  Google Scholar 

  103. Tan PN, Steinbach M, Kumar V (2006) Introduction to data mining. Pearson

  104. Trajkovski I, Zelezny F, Tolar J, Lavrac N (2006) Relational descriptive analysis of gene expression data. In: Proceedings of the 3rd starting artificial intelligence researchers. IOS Press, pp 184–195

  105. Trajkovski I, Zelezny F, Tolar J, Lavrac N (2006) Relational subgroup discovery for descriptive analysis of microarray data. In: Proceedings of the 2nd international symposium in computational life sciences, vol 4216. Springer, LNCS, pp 86–96

  106. Trajkovski I, Zelezny F, Lavrac N, Tolar J (2008) Learning relational descriptions of differentially expressed gene groups. IEEE Trans Syst Man Cybern C 38(1): 16–25

    Article  Google Scholar 

  107. Umek L, Zupan B, Toplak M, Morin A, Chauchat JH, Makovec G, Smrke D (2009) Subgroup discovery in data sets with multi-dimensional responses: a method and a case study in traumatology. In: Proceedings of the 12th conference on artificial intelligence in medicine, vol 5651. Springer, LNAI, pp 265–274

  108. Wrobel S (1997) An algorithm for multi-relational discovery of subgroups. In: Proceedings of the 1st European symposium on principles of data mining and knowledge discovery, vol 1263. Springer, LNAI, pp 78–87

  109. Wrobel S (2001) Inductive logic programming for knowledge discovery in databases. Springer, chap Relational Data Mining, pp 74–101

  110. Wu X, Kumar V, Ross-Quinlan J, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Yu PS, Zhou ZH, Steinbach M, Hand DJ, Steinberg D (2009) Top 10 algorithms in data mining. Knowl Inf Syst 14(1): 1–37

    Article  Google Scholar 

  111. Zadeh LA (1975) The concept of a linguistic variable and its applications to approximate reasoning. Parts I, II, III. Inf Sci 8–9:199–249, 301–357, 43–80

  112. Zelezny F, Lavrac N (2006) Propositionalization-based relational subgroup discovery with RSD. Machine Learning 62: 33–63

    Article  Google Scholar 

  113. Zelezny F, Lavrac N, Dzeroski S (2003) Constraint-based relational subgroup discovery. In: Proceedings of the 2nd workshop on multi-relational data mining, pp 135–150

  114. Zelezny F, Lavrac N, Dzeroski S (2003) Using constraints in relational subgroup discovery. In: International conference on methodology and statistics, pp 78–81

  115. Zelezny F, Tolar J, Lavrac N, Stepankova O (2005) Relational subgroup discovery for gene expression data mining. In: Proceedings of the 3rd European medical and biological engineering conference

  116. Zembowicz R, Zytkow JM (1996) From contingency tables to various forms of knowledge in databases. In: Advances in knowledge discovery and data mining. AAAI/MIT Press, pp 329–349

  117. Zimmerman A, de Raedt L (2009) Cluster-grouping: from subgroup discovery to clustering. Mach Learn 77(1): 125–159

    Article  Google Scholar 

  118. Zitzler E, Laumanns M, Thiele L (2002) SPEA2: Improving the strength pareto evolutionary algorithm for multiobjective optimization. In: International congress on evolutionary methods for design optimization and control with applications to industrial problems, pp 95–100

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cristóbal José Carmona.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Herrera, F., Carmona, C.J., González, P. et al. An overview on subgroup discovery: foundations and applications. Knowl Inf Syst 29, 495–525 (2011). https://doi.org/10.1007/s10115-010-0356-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-010-0356-2

Keywords

Navigation