Abstract
Constructing Bayesian network structures from data is an NP-hard problem. This paper presents a novel method for Bayesian network structure learning using a discrete Harris hawks optimization algorithm, named BNC-HHO. It uses the max-min parents and children algorithm, V-structure & log-likelihood function, and neighborhood structures to limit the search space during the initialization phase. Then, the Harris hawk optimization algorithm is extended from the continuous to the discrete domain by redefining the movement strategies of hawks using genetic operators in genetic algorithm. The crossover and mutation operations in the proposed method are controlled by an adaptive crossover and mutation rate based on the X-conditional cloud. To balance the exploration and exploitation phases, a nonlinear escaping energy curve is also designed. Finally, the quality of the solution is further improved using a local optimizer. Experiments on various standard networks demonstrate that the proposed algorithm can quickly get higher structure scores and better convergence accuracy in most cases compared to other state-of-the-art algorithms. It indicates that the proposed algorithm can be used as an effective and feasible method for learning Bayesian network structures.
Similar content being viewed by others
Data availability
The data generated during the current study are available from the corresponding author on reasonable request.
Notes
https://github.com/bayesnet/bnt.
https://github.com/mensxmachina/CausalExplorer 1.5.
References
Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19(6):716–723. https://doi.org/10.1109/TAC.1974.1100705
Asadayoobi N, Taghipour S, Jaber MY (2022) Predicting human reliability based on probabilistic mission completion time using bayesian network. Reliab Eng Syst Saf 221(108):324. https://doi.org/10.1016/j.ress.2022.108324
Askari MBA, Ahsaee MG (2018) Bayesian network structure learning based on cuckoo search algorithm. In: 2018 6th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), IEEE, pp 127–130, https://doi.org/10.1109/CFIS.2018.8336652
Cerf R (1998) Asymptotic convergence of genetic algorithms. Adv Appl Probab 30(2):521–550. https://doi.org/10.1002/qua.560510407
Chen S, Guo X, Han S et al (2021) A novel stochastic cloud model for statistical characterization of wind turbine output. IEEE ACCESS 9:7439–7446. https://doi.org/10.1109/ACCESS.2020.3038450
Chickering DM (1996) Learning bayesian networks is np-complete. Learning from data. Springer, Berlin, pp 121–130. https://doi.org/10.1007/978-1-4612-2404-4_12
Constantinou AC (2020) Learning bayesian networks that enable full propagation of evidence. IEEE Access 8:124845–124856. https://doi.org/10.1109/ACCESS.2020.3006472
Dai J, Ren J, Du W (2020) Decomposition-based bayesian network structure learning algorithm using local topology information. Knowl-Based Syst 195(105):602. https://doi.org/10.1016/j.knosys.2020.105602
Fang W, Zhang W, Ma L et al (2023) An efficient bayesian network structure learning algorithm based on structural information. Swarm Evol Comput 76(101):224. https://doi.org/10.1016/J.SWEVO.2022.101224
Fawcett C, Hoos HH (2016) Analysing differences between algorithm configurations through ablation. J Heurist 22(4):431–458. https://doi.org/10.1007/s10732-014-9275-9
Feng D (2013) Binary cuckoo search algorithm. J Comput Appl 33(06):1566. https://doi.org/10.3724/SP.J.1087.2013.01566
Gao F, Huang D (2020) A node sorting method for k2 algorithm in bayesian network structure learning. In: 2020 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA), IEEE, pp 106–110, https://doi.org/10.1109/ICAICA50127.2020.9182465
Gheisari S, Meybodi MR (2016) Bnc-pso: structure learning of bayesian networks by particle swarm optimization. Inf Sci 348:272–289. https://doi.org/10.1016/j.ins.2016.01.090
Guo H, Li H (2020) An efficient bayesian network structure learning algorithm using the strategy of two-stage searches. Intell Data Anal 24(5):1087–1106. https://doi.org/10.3233/Ida-194844
Guo Q, Ren H, Yu J et al (2022) A method of predicting oil and gas resource spatial distribution based on bayesian network and its application. J Petrol Sci Eng 208(109):267. https://doi.org/10.1016/j.petrol.2021.109267
Gures SSMARDildarBureerat (2021) Comparison of the arithmetic optimization algorithm, the slime mold optimization algorithm, the marine predators algorithm, the salp swarm algorithm for real-world engineering applications. Materialpruefung: Werkstoffe und Bauteile, Forschung Prufung Anwendung 63(5):448–452. https://doi.org/10.1515/mt-2020-0076
Heckerman D, Geiger D, Chickering DM (1995) Learning bayesian networks: the combination of knowledge and statistical data. Mach Learn 20:197–243. https://doi.org/10.1007/BF00994016
Heidari AA, Mirjalili S, Faris H et al (2019) Harris hawks optimization: algorithm and applications. Futur Gener Comput Syst 97:849–872. https://doi.org/10.1016/j.future.2019.02.028
Houssein EH, Hosney ME, Oliva D et al (2020) A novel hybrid harris hawks optimization and support vector machines for drug design and discovery. Comput Chem Eng 133(106):656. https://doi.org/10.1016/j.compchemeng.2019.106656
Houssein EH, Abdelkareem DA, Emam MM et al (2022) An efficient image segmentation method for skin cancer imaging using improved golden jackal optimization algorithm. Comput Biol Med 149(106):075. https://doi.org/10.1016/j.compbiomed.2022.106075
Jose S, Louis SJ, Dascalu SM, et al. (2020) Bayesian network structure learning using case-injected genetic algorithms. In: 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI), IEEE, pp 572–579, https://doi.org/10.1109/ictai50040.2020.00094
Kar AK (2016) Bio inspired computing-a review of algorithms and scope of applications. Expert Syst Appl 59:20–32. https://doi.org/10.1016/j.eswa.2016.04.018
Kareem SW, Okur MC (2020) Structure learning of bayesian networks using elephant swarm water search algorithm. Int J Swarm Intell Res (IJSIR) 11(2):19–30. https://doi.org/10.4018/ijsir.2020040102
Kareem SW, Okur MC (2021) Falcon optimization algorithm for bayesian networks structure learning. Comput Sci. https://doi.org/10.7494/csci.2021.22.4.3773
Khanteymoori AR, Olyaee MH, Abbaszadeh O et al (2018) A novel method for bayesian networks structure learning based on breeding swarm algorithm. Soft Comput 22(9):3049–3060. https://doi.org/10.1007/s00500-017-2557-z
Kumari P, Bhadriraju B, Wang Q et al (2022) A modified bayesian network to handle cyclic loops in root cause diagnosis of process faults in the chemical process industry. J Process Control 110:84–98. https://doi.org/10.1016/j.jprocont.2021.12.011
Li T, Zhou Y, Zhao Y et al (2022) A hierarchical object oriented bayesian network-based fault diagnosis method for building energy systems. Appl Energy 306(118):088. https://doi.org/10.1016/j.apenergy.2021.118088
Ling Z, Yu K, Zhang Y et al (2022) Causal learner: A toolbox for causal structure and markov blanket learning. Pattern Recogn Lett 163:92–95. https://doi.org/10.1016/j.patrec.2022.09.021
Liu H, Wang N, Wang Y et al (2021) Bayesian network structure algorithm based on v-structure & log-likelihood orientation and tabu hill climbing. J Electron Inf Technol 43(11):3272–3281. https://doi.org/10.11999/JEIT210032
McLachlan S, Dube K, Hitman GA et al (2020) Bayesian networks in healthcare: distribution by medical condition. Artif Intell Med 107(101):912. https://doi.org/10.1016/j.artmed.2020.101912
Michalewicz Z, Janikow CZ (1991) Genetic algorithms for numerical optimization. Stat Comput 1:75–91. https://doi.org/10.1007/BF01889983
Nadimi-Shahraki MH, Zamani H, Mirjalili S (2022) Enhanced whale optimization algorithm for medical feature selection: A covid-19 case study. Comput Biol Med 148(105):858. https://doi.org/10.1016/j.compbiomed.2022.105858
Rarità L (2020) Optimization approaches to manage congestions for the phenomenon “Luci D’Artista” in Salerno In: The 32nd European Modeling & Simulation Symposium, pp 319–324, https://doi.org/10.46354/i3m.2020.emss.046
Rarità L (2022) A genetic algorithm to optimize dynamics of supply chains. Optimization in artificial intelligence and data sciences. Springer, Berlin, pp 107–115. https://doi.org/10.1007/978-3-030-95380-5_10
Schwarz G (1987) Estimating the dimension of a model. Ann Stat 6(2):461–464. https://doi.org/10.1214/aos/1176344136
Sun B, Zhou Y, Wang J et al (2021) A new pc-pso algorithm for bayesian network structure learning with structure priors. Expert Syst Appl 184(115):237. https://doi.org/10.1016/j.eswa.2021.115237
Suzuki J (1993) A construction of bayesian networks from databases based on an mdl principle. Uncertainty in artificial intelligence. Elsevier, Amsterdam, pp 266–273. https://doi.org/10.1016/B978-1-4832-1451-1.50037-8
Tabar VR, Eskandari F, Salimi S et al (2018) Finding a set of candidate parents using dependency criterion for the k2 algorithm. Pattern Recogn Lett 111:23–29. https://doi.org/10.1016/j.patrec.2018.04.019
Tsamardinos I, Brown LE, Aliferis CF (2006) The max-min hill-climbing bayesian network structure learning algorithm. Mach Learn 65(1):31–78. https://doi.org/10.1007/s10994-006-6889-7
Wang J, Liu S (2018) Novel binary encoding water cycle algorithm for solving bayesian network structures learning problem. Knowl-Based Syst 150:95–110. https://doi.org/10.1016/j.knosys.2018.03.007
Wang J, Liu S (2019) A novel discrete particle swarm optimization algorithm for solving bayesian network structures learning problem. Int J Comput Math 96(12):2423–2440. https://doi.org/10.1080/00207160.2019.1566535
Wang W, He X, Li Y et al (2022) Risk analysis on corrosion of submarine oil and gas pipelines based on hybrid bayesian network. Ocean Eng 260(111):957. https://doi.org/10.1016/j.oceaneng.2022.111957
Wang X, Ren H, Guo X (2022) A novel discrete firefly algorithm for bayesian network structure learning. Knowl-Based Syst 242(108):426. https://doi.org/10.1016/j.knosys.2022.108426
Yıldız BS, Patel V, Pholdee N et al (2021) Conceptual comparison of the ecogeography-based algorithm, equilibrium algorithm, marine predators algorithm and slime mold algorithm for optimal product design. Mater Test 63(4):336–340. https://doi.org/10.1515/mt-2020-0049
Yıldız BS, Pholdee N, Bureerat S et al (2021) Comparision of the political optimization algorithm, the archimedes optimization algorithm and the levy flight algorithm for design optimization in industry. Mater Test 63(4):356–359. https://doi.org/10.1515/mt-2020-0053
Yuan Y, Lin L, Xie C et al (2022) Effectiveness comparisons of various psychosocial therapies for cancer-related fatigue: A bayesian network meta-analysis. J Affect Disord. https://doi.org/10.1016/j.jad.2022.04.152
Yücetürk H, Gülle H, Şakar CT et al (2022) Reducing the question burden of patient reported outcome measures using bayesian networks. J Biomed Inform. https://doi.org/10.1016/j.jbi.2022.104230
Zhou Z, Liu S, Qi H (2022) Mitigating subway construction collapse risk using bayesian network modeling. Autom Constr 143(104):541. https://doi.org/10.1016/j.autcon.2022.104541
Zhu M, Liu S, Jiang J (2016) Learning Bayesian networks in the space of structures by a hybrid optimization algorithm. Int J Comput Commun Control 11(6):889–901. https://doi.org/10.15837/ijccc.2016.6.2502
Funding
This study was funded by the National Key R &D Program of China (No.2019YFB1707301).
Author information
Authors and Affiliations
Contributions
Conceptualization, methodology, Writing-original draft preparation by YC; Experiment and data processing by QS; Supervision and resources by HL, NW, and LZ; Writing review and editing by SL, SC. All authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
All the authors of this research paper declare that there is no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent:
Informed consent was obtained from all individual participants included in the study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, H., Cai, Y., Shi, Q. et al. An improved Harris Hawks optimization for Bayesian network structure learning via genetic operators. Soft Comput 27, 14659–14672 (2023). https://doi.org/10.1007/s00500-023-09107-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-023-09107-7