Abstract
Bayesian networks (BNs) are one of the most compelling theoretical models in uncertain knowledge representation and inference. However, many domains are encountering the dilemma of insufficient data. Learning BN parameters using raw data may lead to low learning accuracy. Therefore, this paper seeks to solve the problem via two novel data extension methods. First, a constraint-based nonparametric bootstrap (CNB) method is proposed, which extends the raw data and guides the parameter distribution of the extended data through a constraint-based sample scoring function. The experimental results on 12 BNs show that the extended data can improve the parameter learning accuracy and enhance the existing parameter learning approaches. The CNB is still valid for medium and large networks with relatively large data. When the original data are of inferior quality, the CNB is unattainable to extend it. Then, a constraint-based parametric bootstrap (CPB) method is proposed, creating a new parameter distribution by constraints and the original samples. The experimental results for the missing data demonstrate that the extended data perform better. The CPB is insensitive to the proportion of missing data and remains superior in relatively large data.








Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference san mateo. Comput Sci Artif Intell 58(2):721–721
Zhang Y, Weng WG (2020) Bayesian network model for buried gas pipeline failure analysis caused by corrosion and external interference. Reliab Eng Syst Saf 107089:203
Fg A, Nm B (2020) A new scoring system for the rapid entire body assessment (reba) based on fuzzy sets and bayesian networks. Int J Ind Ergon 80:103058
Zhang GH, Chen W, Jiao YY, Wang H, Wang CT (2020) A failure probability evaluation method for collapse of drill-and-blast tunnels based on multistate fuzzy bayesian network. Eng Geol 276:105752
Zhang T, Zhang T, Li C, Zhai X, Huo Q (2021) Complementary and alternative therapies for precancerous lesions of gastric cancer: a protocol for a bayesian network meta analysis. Medicine 100 (2):24249
Iraji Z, Jafarabadi MA, Jafari-Koshki T, Dolatkhah R (2020) A conditional probability model to predict the mortality in patients with breast cancer: a bayesian network analysis. Am J Med Sci 360(5):575–580
Howey R, Shin SY, Relton C, Smith GD, Cordell HJ (2020) Bayesian network analysis incorporating genetic anchors complements conventional mendelian randomization approaches for exploratory analysis of causal relationships in complex data. PLOS Genetics 16(3):1–15
Song C, Xu Z, Zhang Y, Wang X (2020) Dynamic hesitant fuzzy bayesian network and its application in the optimal investment port decision making problem of “twenty-first century maritime silk road”. Appl Intell 50(6):1846–1858
Wang Z, Gao X, Yang Y, Tan X, Chen D (2021) Learning bayesian networks based on order graph with ancestral constraints. Knowl-Based Syst 211:106515
Tan X, Gao X, Wang Z, He C (2021) Bidirectional heuristic search to find the optimal bayesian network structure. Neurocomputing 426:35–46
Luo G, Zhao B, Du S (2019) Causal inference and bayesian network structure learning from nominal data. Appl Intell 49(1):253–264
Gao X, Guo Z, Ren H, Yang Y, Chen D, He C (2019) Learning bayesian network parameters via minimax algorithm. Int J Approx Reason 108:62–75
Kovacic J (2020) Learning parameters of bayesian networks from datasets with systematically missing data: a meta-analytic approach. Expert Syst Appl 141(112956):1–11
Gao X, Yang Y, Gao Z (2019) Learning bayesian networks by constrained bayesian estimation. J Syst Eng Electron 30(3):511–524
Feelders AJ, Gaag LC (2006) Learning bayesian network parameters under order constraints. Int J Approx Reason 42(1-2):37–53
Feelders A, Gaag L (2012) Learning bayesian network parameters with prior knowledge about context-specific qualitative influences. Computer Science, p 193–200
Chang R, Wang W (2010) Novel algorithm for bayesian network parameter learning with informative prior constraints. In: International joint conference on neural networks, IJCNN 2010, Barcelona, Spain, 18-23 july, 2010. IEEE, New York, pp 1–8
Yang Y, Gao X, Guo Z, Chen D (2019) Learning bayesian networks using the constrained maximum a posteriori probability method. Pattern Recogn 91:123–134
Campos CP, Tong Y, Ji Q (2008) Constrained maximum likelihood learning of bayesian networks for facial action recognition. In: Forsyth D.A., Torr P.H.S., Zisserman A. (eds.) Computer Vision - ECCV 2008, 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part III Lecture Notes in Computer Science, vol. 5304. Springer, Berlin Heidelberg, pp 168–181
Redner RA (1984) Mixture densities, maximum likelihood and the em algorithm. Siam Review 26(2):195–239
Guo Z, Gao X, Ren H, Yang Y, Di R, Chen D (2017) Learning bayesian network parameters from small data sets: a further constrained qualitatively maximum a posteriori method. Int J Approx Reason 91:22–35
Hao J, Yue K, Zhang B, Duan L, Fu X (2021) Transfer learning of bayesian network for measuring qos of virtual machines. Appl Intell 51(12):8641–8660
Luis R, Sucar LE, Morales EF (2010) Inductive transfer for learning bayesian networks. Mach Learn 79(1-2):227–255
Yuan P, Sun Y, Li H, Wang F, Li H (2019) Abnormal condition identification modeling method based on bayesian network parameters transfer learning for the electro-fused magnesia smelting process. IEEE Access 7:149764–149775
Friedman N, Goldszmidt M (1999) Wynera.: Data analysis with bayesian networks: a bootstrapa pproach. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp 196–205
Chen G, Ge Z (2020) Robust bayesian networks for low-quality data modeling and process monitoring applications. Control Eng Pract 97(104344):1–14
Kullback S, Leibler RA (1951) On Information and Sufficiency vol 22, p 79–86
Liao W, Ji Q (2009) Learning bayesian network parameters under incomplete data with domain knowledge. Pattern Recogn 42(11):3046–3056
Jiang L, Zhang L, Yu L, Wang D (2019) Class-specific attribute weighted naive bayes. Pattern Recogn 88:321–330
Jiang L, Zhang L, Li C, Wu J (2019) A correlation-based feature weighting filter for naive bayes. IEEE Trans Knowl Data Eng 31(2):201–213
Funding
This work was supported by the National Natural Science Foundation of China (61573285).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Ru, X., Gao, X., Wang, Y. et al. Bayesian network parameter learning using constraint-based data extension method. Appl Intell 53, 9958–9977 (2023). https://doi.org/10.1007/s10489-022-03941-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03941-2