Skip to main content

An Optimization Approach for Optimizing PRIM’s Randomly Generated Rules Using the Genetic Algorithm

  • Conference paper
  • First Online:
Optimization and Learning (OLA 2023)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1824))

Included in the following conference series:

  • 323 Accesses

Abstract

The Patient Rule Induction Method (PRIM) is a bump hunting algorithm that generates a big number of rules in high dimensional data. Despite the high accuracy it provides, in this case it lacks of interpretability when the set of rules is big. To address it, we aim, in this paper to optimize the number of rules using Genetic Algorithm (GA) by formulating a combinatorial optimization problem to minimize the ruleset and maximize the performance of the ruleset. We applied this approach on a real-life dataset involving slope stability, one of the most important subjects in civil engineering, by choosing random feature spaces to generate the rules. We also set a performance score that balances between the confidence and the support of the rules in a ruleset and that has to be maximized to select a ruleset as a potential candidate. The results obtained show that optimizing with GA gives a more powerful set of rules that eases the interpretation. However, if the goal of the study is to detect small groups, we should minimize the performance of the ruleset by looking at the weakest groups, hence with the lowest support.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley Longman Publishing Co., Inc., Boston (1989)

    MATH  Google Scholar 

  2. Nassih, R., Berrado, A.: Potential for PRIM based classification: a literature review. In: The Third European International Conference on Industrial Engineering and Operations Managements in Pilsen, Czech Republic, p. 7 (2019)

    Google Scholar 

  3. Friedman, J.H., Fisher, N.I.: Bump hunting in high-dimensional data. Stat. Comput. 9(2), 123–143 (1999)

    Article  Google Scholar 

  4. Sastry, K., Goldberg, D., Kendall, G.: Genetic algorithms. In: Burke, E.K., Kendall, G. (eds.) Search Methodologies, pp. 97–125. Springer, Boston, MA (2005). https://doi.org/10.1007/0-387-28356-0_4

  5. Ruder, S.: An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747 (2016)

  6. Sarath, K.N.V.D., Ravi, V.: Association rule mining using binary particle swarm optimization. Eng. Appl. Artif. Intell. 26(8), 1832–1840 (2013)

    Article  Google Scholar 

  7. Nedic, V., Cvetanovic, S., Despotovic, D., Despotovic, M., Babic, S.: Data mining with various optimization methods. Expert Syst. Appl. 41(8), 3993–3999 (2014)

    Article  Google Scholar 

  8. Minaei-Bidgoli, B., Punch, W.F.: Using genetic algorithms for data mining optimization in an educational web-based system. In: Cantú-Paz, E., et al. (eds.) GECCO 2003. LNCS, vol. 2724, pp. 2252–2263. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-45110-2_119

    Chapter  MATH  Google Scholar 

  9. Liu, Y., Chung, Y.Y.: Mining cancer data with discrete particle swarm optimization and rule pruning. In: 2011 IEEE International Symposium on IT in Medicine and Education, vol. 2, pp. 31–34. IEEE (2011)

    Google Scholar 

  10. Alatas, B., Akin, E.: Multi-objective rule mining using a chaotic particle swarm optimization algorithm. Knowl.-Based Syst. 22(6), 455–460 (2009)

    Google Scholar 

  11. Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. SIAM Rev. 60(2), 223–311 (2018)

    Article  MathSciNet  MATH  Google Scholar 

  12. Kausar, N., Palaniappan, S., Samir, B.B., Abdullah, A., Dey, N.: Systematic analysis of applied data mining based optimization algorithms in clinical attribute extraction and classification for diagnosis of cardiac patients. In: Hassanien, A.-E., Grosan, C., Fahmy Tolba, M. (eds.) Applications of Intelligent Optimization in Biology and Medicine. ISRL, vol. 96, pp. 217–231. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-21212-8_9

    Chapter  Google Scholar 

  13. Nassih, R., Berrado, A.: Towards a patient rule induction method-based classifier. In: 2019 1st International Conference on Smart Systems and Data Science (ICSSD), pp. 1–5. IEEE (2019)

    Google Scholar 

  14. Kaveh, A., Hamze-Ziabari, S.M., Bakhshpoori, T.: Soft computing-based slope stability assessment: a comparative study. Geomech. Eng. 14(3), 257–269 (2018)

    Google Scholar 

  15. Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of ICNN 1995-International Conference on Neural Networks, vol. 4, pp. 1942–1948. IEEE (1995). https://doi.org/10.1109/ICNN.1995.488968

  16. Herrera, F., Carmona, C.J., González, P., Del Jesus, M.J.: An overview on subgroup discovery: foundations and applications. Knowl. Inf. Syst. 29, 495–525 (2011)

    Article  Google Scholar 

  17. Atzmueller, M. Subgroup discovery. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 5(1), 35–49 (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rym Nassih .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nassih, R., Berrado, A. (2023). An Optimization Approach for Optimizing PRIM’s Randomly Generated Rules Using the Genetic Algorithm. In: Dorronsoro, B., Chicano, F., Danoy, G., Talbi, EG. (eds) Optimization and Learning. OLA 2023. Communications in Computer and Information Science, vol 1824. Springer, Cham. https://doi.org/10.1007/978-3-031-34020-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-34020-8_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-34019-2

  • Online ISBN: 978-3-031-34020-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics