Skip to main content

Deterministic Extraction of Compact Sets of Rules for Subgroup Discovery

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9375))

Abstract

This work presents a novel deterministic method to obtain rules for Subgroup Discovery tasks. It makes no previous discretization for the numeric attributes, but their conditions are obtained dynamically. To obtain the final rules, the AUC value of a rule has been used for selecting them. An experimental study supported by appropriate statistical tests was performed, showing good results in comparison with the classic deterministic algorithms CN2-SD and APRIORI-SD. The best results were obtained in the number of induced rules, where a significant reduction was achieved. Also, better coverage and less number of attributes were obtained in the comparison with CN2-SD.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bay, S.D., Pazzani, M.J.: Detecting group differences. Mining contrast sets. Data Min. Knowl. Discov. 5(3), 213–246 (2001)

    Article  MATH  Google Scholar 

  2. Dong, G., Li, J.: Efficient mining of emerging patterns. Discovering trends and differences. In: Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 43–52 (1999)

    Google Scholar 

  3. Klösgen, W.: Explora: A multipattern and multistrategy discovery assistant. Advances in Knowledge Discovery and Data Mining, pp. 249–271. American Association for Artificial Intelligence, Cambridge (1996)

    Google Scholar 

  4. Wrobel, S.: An algorithm for multi-relational discovery of subgroups. In: Proceedings of the 1st European Conference on Principles of Data Mining and Knowledge Discovery (PKDD-97), pp 78–87 (1997)

    Google Scholar 

  5. Novak, P.N., Lavrač, N., Webb, G.: Supervised descriptive rule discovery: a unifying survey of contrast set, emerging pattern and subgroup mining. J. Mach. Learn. Res. 10, 377–403 (2009)

    MATH  Google Scholar 

  6. Lavrač, N., Kavsek, B., Flach, P.A., Todorovski, L.: Subgroup discovery with CN2-SD. J. Mach. Learn. Res. 5, 153–188 (2004)

    MathSciNet  Google Scholar 

  7. Kavsek, B., Lavrač, N.: APRIORI-SD: adapting association rule learning to subgroup discovery. Appl. Artif. Intell. 20(7), 543–583 (2006)

    Article  Google Scholar 

  8. Atzmüller, M., Puppe, F.: SD-Map – a fast algorithm for exhaustive subgroup discovery. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) PKDD 2006. LNCS (LNAI), vol. 4213, pp. 6–17. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  9. Carmona, C.J., González, P., del Jesus, M.J., Herrera, F.: NMEEF-SD: non-dominated multi-objective evolutionary algorithm for extracting fuzzy rules in subgroup discovery. IEEE Trans. Fuzzy Syst. 18(5), 958–970 (2010)

    Article  Google Scholar 

  10. Rodríguez, D., Ruiz, R., Riquelme, J.C., Aguilar-Ruiz, J.S.: Searching for rules to detect defective modules: a subgroup discovery approach. Inf. Sci. 191, 14–30 (2012)

    Article  Google Scholar 

  11. Carmona, C.J., Ruiz-Rodado, V., del Jesus, M.J., Weber, A., Grootveld, M., González, P., Elizondo, D.: A fuzzy genetic programming-based algorithm for subgroup discovery and the application to one problem of pathogenesis of acute sore throat conditions in humans. Inf. Sci. 298, 180–197 (2015)

    Article  Google Scholar 

  12. Grosskreutz, H., Rüping, S.: On subgroup discovery in numerical domains. Data Min. Knowl. Discov. 19(2), 210–226 (2009)

    Article  MathSciNet  Google Scholar 

  13. Fayyad, U., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: 13th International Joint Conference on Artificial Intelligence, pp. 1022–1029 (1999)

    Google Scholar 

  14. Domínguez-Olmedo, J.L., Mata, J., Pachón, V., Maña, M.J.: A deterministic approach to association rule mining without attribute discretization. In: Snasel, V., Platos, J., El-Qawasmeh, E. (eds.) ICDIPC 2011, Part I. CCIS, vol. 188, pp. 140–150. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  15. Lichman, M.: UCI Machine Learning Repository. School of Information and Computer Science, University of California, Irvine, CA (2013). http://archive.ics.uci.edu/ml

  16. Alcalá-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., García, S., Sánchez, L., Herrera, F.: KEEL data-mining software tool. J. Multiple-Valued Logic Soft Comput. 17, 255–287 (2011)

    Google Scholar 

  17. Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

This work was partially funded by the Regional Government of Andalusia (Junta de Andalucía), grant number TIC-7629.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Juan L. Domínguez-Olmedo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Domínguez-Olmedo, J.L., Vázquez, J.M., Pachón, V. (2015). Deterministic Extraction of Compact Sets of Rules for Subgroup Discovery. In: Jackowski, K., Burduk, R., Walkowiak, K., Wozniak, M., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2015. IDEAL 2015. Lecture Notes in Computer Science(), vol 9375. Springer, Cham. https://doi.org/10.1007/978-3-319-24834-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24834-9_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24833-2

  • Online ISBN: 978-3-319-24834-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics