Abstract
The Patient Rule Induction Method (PRIM) is a bump hunting algorithm that generates a big number of rules in high dimensional data. Despite the high accuracy it provides, in this case it lacks of interpretability when the set of rules is big. To address it, we aim, in this paper to optimize the number of rules using Genetic Algorithm (GA) by formulating a combinatorial optimization problem to minimize the ruleset and maximize the performance of the ruleset. We applied this approach on a real-life dataset involving slope stability, one of the most important subjects in civil engineering, by choosing random feature spaces to generate the rules. We also set a performance score that balances between the confidence and the support of the rules in a ruleset and that has to be maximized to select a ruleset as a potential candidate. The results obtained show that optimizing with GA gives a more powerful set of rules that eases the interpretation. However, if the goal of the study is to detect small groups, we should minimize the performance of the ruleset by looking at the weakest groups, hence with the lowest support.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley Longman Publishing Co., Inc., Boston (1989)
Nassih, R., Berrado, A.: Potential for PRIM based classification: a literature review. In: The Third European International Conference on Industrial Engineering and Operations Managements in Pilsen, Czech Republic, p. 7 (2019)
Friedman, J.H., Fisher, N.I.: Bump hunting in high-dimensional data. Stat. Comput. 9(2), 123–143 (1999)
Sastry, K., Goldberg, D., Kendall, G.: Genetic algorithms. In: Burke, E.K., Kendall, G. (eds.) Search Methodologies, pp. 97–125. Springer, Boston, MA (2005). https://doi.org/10.1007/0-387-28356-0_4
Ruder, S.: An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747 (2016)
Sarath, K.N.V.D., Ravi, V.: Association rule mining using binary particle swarm optimization. Eng. Appl. Artif. Intell. 26(8), 1832–1840 (2013)
Nedic, V., Cvetanovic, S., Despotovic, D., Despotovic, M., Babic, S.: Data mining with various optimization methods. Expert Syst. Appl. 41(8), 3993–3999 (2014)
Minaei-Bidgoli, B., Punch, W.F.: Using genetic algorithms for data mining optimization in an educational web-based system. In: Cantú-Paz, E., et al. (eds.) GECCO 2003. LNCS, vol. 2724, pp. 2252–2263. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-45110-2_119
Liu, Y., Chung, Y.Y.: Mining cancer data with discrete particle swarm optimization and rule pruning. In: 2011 IEEE International Symposium on IT in Medicine and Education, vol. 2, pp. 31–34. IEEE (2011)
Alatas, B., Akin, E.: Multi-objective rule mining using a chaotic particle swarm optimization algorithm. Knowl.-Based Syst. 22(6), 455–460 (2009)
Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. SIAM Rev. 60(2), 223–311 (2018)
Kausar, N., Palaniappan, S., Samir, B.B., Abdullah, A., Dey, N.: Systematic analysis of applied data mining based optimization algorithms in clinical attribute extraction and classification for diagnosis of cardiac patients. In: Hassanien, A.-E., Grosan, C., Fahmy Tolba, M. (eds.) Applications of Intelligent Optimization in Biology and Medicine. ISRL, vol. 96, pp. 217–231. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-21212-8_9
Nassih, R., Berrado, A.: Towards a patient rule induction method-based classifier. In: 2019 1st International Conference on Smart Systems and Data Science (ICSSD), pp. 1–5. IEEE (2019)
Kaveh, A., Hamze-Ziabari, S.M., Bakhshpoori, T.: Soft computing-based slope stability assessment: a comparative study. Geomech. Eng. 14(3), 257–269 (2018)
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of ICNN 1995-International Conference on Neural Networks, vol. 4, pp. 1942–1948. IEEE (1995). https://doi.org/10.1109/ICNN.1995.488968
Herrera, F., Carmona, C.J., González, P., Del Jesus, M.J.: An overview on subgroup discovery: foundations and applications. Knowl. Inf. Syst. 29, 495–525 (2011)
Atzmueller, M. Subgroup discovery. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 5(1), 35–49 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Nassih, R., Berrado, A. (2023). An Optimization Approach for Optimizing PRIM’s Randomly Generated Rules Using the Genetic Algorithm. In: Dorronsoro, B., Chicano, F., Danoy, G., Talbi, EG. (eds) Optimization and Learning. OLA 2023. Communications in Computer and Information Science, vol 1824. Springer, Cham. https://doi.org/10.1007/978-3-031-34020-8_23
Download citation
DOI: https://doi.org/10.1007/978-3-031-34020-8_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34019-2
Online ISBN: 978-3-031-34020-8
eBook Packages: Computer ScienceComputer Science (R0)