Skip to main content
Log in

Bayesian network parameter learning using constraint-based data extension method

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Bayesian networks (BNs) are one of the most compelling theoretical models in uncertain knowledge representation and inference. However, many domains are encountering the dilemma of insufficient data. Learning BN parameters using raw data may lead to low learning accuracy. Therefore, this paper seeks to solve the problem via two novel data extension methods. First, a constraint-based nonparametric bootstrap (CNB) method is proposed, which extends the raw data and guides the parameter distribution of the extended data through a constraint-based sample scoring function. The experimental results on 12 BNs show that the extended data can improve the parameter learning accuracy and enhance the existing parameter learning approaches. The CNB is still valid for medium and large networks with relatively large data. When the original data are of inferior quality, the CNB is unattainable to extend it. Then, a constraint-based parametric bootstrap (CPB) method is proposed, creating a new parameter distribution by constraints and the original samples. The experimental results for the missing data demonstrate that the extended data perform better. The CPB is insensitive to the proportion of missing data and remains superior in relatively large data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. https://github.com/bayesnet/bnt/tree/master/BNT

  2. http://www.bnlearn.com/bnrepository

References

  1. Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference san mateo. Comput Sci Artif Intell 58(2):721–721

    MathSciNet  Google Scholar 

  2. Zhang Y, Weng WG (2020) Bayesian network model for buried gas pipeline failure analysis caused by corrosion and external interference. Reliab Eng Syst Saf 107089:203

    Google Scholar 

  3. Fg A, Nm B (2020) A new scoring system for the rapid entire body assessment (reba) based on fuzzy sets and bayesian networks. Int J Ind Ergon 80:103058

    Article  Google Scholar 

  4. Zhang GH, Chen W, Jiao YY, Wang H, Wang CT (2020) A failure probability evaluation method for collapse of drill-and-blast tunnels based on multistate fuzzy bayesian network. Eng Geol 276:105752

    Article  Google Scholar 

  5. Zhang T, Zhang T, Li C, Zhai X, Huo Q (2021) Complementary and alternative therapies for precancerous lesions of gastric cancer: a protocol for a bayesian network meta analysis. Medicine 100 (2):24249

    Article  Google Scholar 

  6. Iraji Z, Jafarabadi MA, Jafari-Koshki T, Dolatkhah R (2020) A conditional probability model to predict the mortality in patients with breast cancer: a bayesian network analysis. Am J Med Sci 360(5):575–580

    Article  Google Scholar 

  7. Howey R, Shin SY, Relton C, Smith GD, Cordell HJ (2020) Bayesian network analysis incorporating genetic anchors complements conventional mendelian randomization approaches for exploratory analysis of causal relationships in complex data. PLOS Genetics 16(3):1–15

    Article  Google Scholar 

  8. Song C, Xu Z, Zhang Y, Wang X (2020) Dynamic hesitant fuzzy bayesian network and its application in the optimal investment port decision making problem of “twenty-first century maritime silk road”. Appl Intell 50(6):1846–1858

    Article  Google Scholar 

  9. Wang Z, Gao X, Yang Y, Tan X, Chen D (2021) Learning bayesian networks based on order graph with ancestral constraints. Knowl-Based Syst 211:106515

    Article  Google Scholar 

  10. Tan X, Gao X, Wang Z, He C (2021) Bidirectional heuristic search to find the optimal bayesian network structure. Neurocomputing 426:35–46

    Article  Google Scholar 

  11. Luo G, Zhao B, Du S (2019) Causal inference and bayesian network structure learning from nominal data. Appl Intell 49(1):253–264

    Article  Google Scholar 

  12. Gao X, Guo Z, Ren H, Yang Y, Chen D, He C (2019) Learning bayesian network parameters via minimax algorithm. Int J Approx Reason 108:62–75

    Article  MathSciNet  MATH  Google Scholar 

  13. Kovacic J (2020) Learning parameters of bayesian networks from datasets with systematically missing data: a meta-analytic approach. Expert Syst Appl 141(112956):1–11

    Google Scholar 

  14. Gao X, Yang Y, Gao Z (2019) Learning bayesian networks by constrained bayesian estimation. J Syst Eng Electron 30(3):511–524

    Article  MathSciNet  Google Scholar 

  15. Feelders AJ, Gaag LC (2006) Learning bayesian network parameters under order constraints. Int J Approx Reason 42(1-2):37–53

    Article  MathSciNet  MATH  Google Scholar 

  16. Feelders A, Gaag L (2012) Learning bayesian network parameters with prior knowledge about context-specific qualitative influences. Computer Science, p 193–200

  17. Chang R, Wang W (2010) Novel algorithm for bayesian network parameter learning with informative prior constraints. In: International joint conference on neural networks, IJCNN 2010, Barcelona, Spain, 18-23 july, 2010. IEEE, New York, pp 1–8

  18. Yang Y, Gao X, Guo Z, Chen D (2019) Learning bayesian networks using the constrained maximum a posteriori probability method. Pattern Recogn 91:123–134

    Article  Google Scholar 

  19. Campos CP, Tong Y, Ji Q (2008) Constrained maximum likelihood learning of bayesian networks for facial action recognition. In: Forsyth D.A., Torr P.H.S., Zisserman A. (eds.) Computer Vision - ECCV 2008, 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part III Lecture Notes in Computer Science, vol. 5304. Springer, Berlin Heidelberg, pp 168–181

  20. Redner RA (1984) Mixture densities, maximum likelihood and the em algorithm. Siam Review 26(2):195–239

    Article  MathSciNet  MATH  Google Scholar 

  21. Guo Z, Gao X, Ren H, Yang Y, Di R, Chen D (2017) Learning bayesian network parameters from small data sets: a further constrained qualitatively maximum a posteriori method. Int J Approx Reason 91:22–35

    Article  MathSciNet  MATH  Google Scholar 

  22. Hao J, Yue K, Zhang B, Duan L, Fu X (2021) Transfer learning of bayesian network for measuring qos of virtual machines. Appl Intell 51(12):8641–8660

    Article  Google Scholar 

  23. Luis R, Sucar LE, Morales EF (2010) Inductive transfer for learning bayesian networks. Mach Learn 79(1-2):227–255

    Article  MathSciNet  MATH  Google Scholar 

  24. Yuan P, Sun Y, Li H, Wang F, Li H (2019) Abnormal condition identification modeling method based on bayesian network parameters transfer learning for the electro-fused magnesia smelting process. IEEE Access 7:149764–149775

    Article  Google Scholar 

  25. Friedman N, Goldszmidt M (1999) Wynera.: Data analysis with bayesian networks: a bootstrapa pproach. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp 196–205

  26. Chen G, Ge Z (2020) Robust bayesian networks for low-quality data modeling and process monitoring applications. Control Eng Pract 97(104344):1–14

    Google Scholar 

  27. Kullback S, Leibler RA (1951) On Information and Sufficiency vol 22, p 79–86

  28. Liao W, Ji Q (2009) Learning bayesian network parameters under incomplete data with domain knowledge. Pattern Recogn 42(11):3046–3056

    Article  MATH  Google Scholar 

  29. Jiang L, Zhang L, Yu L, Wang D (2019) Class-specific attribute weighted naive bayes. Pattern Recogn 88:321–330

    Article  Google Scholar 

  30. Jiang L, Zhang L, Li C, Wu J (2019) A correlation-based feature weighting filter for naive bayes. IEEE Trans Knowl Data Eng 31(2):201–213

    Article  Google Scholar 

Download references

Funding

This work was supported by the National Natural Science Foundation of China (61573285).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaoguang Gao.

Ethics declarations

Conflict of Interests

The authors declare no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ru, X., Gao, X., Wang, Y. et al. Bayesian network parameter learning using constraint-based data extension method. Appl Intell 53, 9958–9977 (2023). https://doi.org/10.1007/s10489-022-03941-2

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-03941-2

Keywords

Navigation