Abstract
An experiment in Bayesian model building from a large medical dataset for Mental Retardation is discussed in this paper. We give a step by step description of the practical aspects of building a Bayesian Network from a dataset. We enumerate and briefly describe the tools required, address the problem of missing values in big datasets resulting from incomplete clinical findings and elaborate on our solution to the problem. We advance some reasons why imputation is a more desirable approach for model building than some other ad hoc methods suggested in literature. In our experiment, the initial Bayesian Network is learned from a dataset using a machine learning program called CB. The network structure and the conditional probabilities are then modified under the guidance of a domain expert. We present validation results for the unmodified and modified networks and give some suggestions for improvement of the model.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
J.M. Zytkow and J. Baker, “Interactive mining of regularities in databases,” in: Knowledge Discovery in Databases, edited by Gregory Piatetsky-Shapiro and William J. Frawley, The AAAI Press: 445 Burgess Drive, Menlo Park, CA 94025, 1991, pp. 31–54.
J. Pearl and T. Verma, “A theory of inferred causation,” in Principles of Knowledge Representation and Reasoning: Proceedings of the Second Conference, edited by J.A. Allen, R. Fikes, and E. Sandewall, Morgan-Kaufmann: San Mateo, CA, 1991, pp. 441–452.
J. Pearl, Causality: Models, Reasoning, and Inference, Cambridge University Press: Cambridge, UK, 2000.
M.L. Batshaw, “Mental retardation,” in Pediatric Clinics of North America—The Child With Developmental Disabilities, vol. 40(3), edited by M.L Batshaw, W.B Saunders Company: Philadelphia, PA, 1993, pp. 507–522.
Z.A. Stein and M.W. Susser, “Mental retardation,” in Public Health and Preventive Medicine, 13th edn., M.J. Last and R.B. Wallace, Appleton & Lange: San Mateo, CA, 1992.
American Association on Mental Retardation, Mental Retardation American Association on Mental Retardation, 1719 Kalorama Road, NW Washington DC, 1992.
S.W. McDermott, A.L. Coker, and R.E. McKeown, “Low birth-weight and risk of mild mental retardation by ages 5 and 9 to 11,” Pediatric and Perinatal Epidemiology, vol. 7, pp. 195–204, 1993.
S. Mani, S. McDermott, and M. Valtorta, “MENTOR: A Bayesian model for prediction of mental retardation in newborns,” Research in Developmental Disabilities, vol. 18, no. 5, pp. 303–318, 1997.
R. Neapolitan, Probabilistic Reasoning in Expert Systems: Theory and Algorithms. Wiley: New York, 1990.
J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, Morgan Kaufmann: San Mateo, California, 1988.
E. Charniak, “Bayesian networks without tears,” AI Magazine, vol. 12, no. 4, pp. 50–63, 1991.
D.J. Spiegelhalter, A.P. Dawid, S.L. Lauritzen, and R.G. Cowell, “Bayesian analysis in expert systems,” Statistical Science, vol. 8, no. 3, pp. 219–283, 1993.
R.G. Cowell, A.P. Dawid, S.L. Lauritzen, and D.J. Spiegelhalter, Probabilistic Networks and Expert Systems, Springer: New York, 1999.
F.V. Jensen, An Introduction to Bayesian Networks, Springer: New York, 1996.
F.V. Jensen, Bayesian Networks and Decision Graphs, Springer: New York, 2001.
S.L. Lauritzen, B. Thiesson, and D. Spiegelhalter, “Diagnostic systems created by model selection methods—A case study,” Preliminary Papers of the Fourth International Workshop on Artificial Intelligence and Statistics, Ft. Lauderdale, FL, January 3–6, pp. 93–105, 1993.
G.F. Cooper, and E. Herskovits, “A Bayesian method for the induction of probabilistic networks from data,” Machine Learning, vol. 9, pp. 309–347, 1992.
M. Singh and M. Valtorta, “A new algorithm for the construction of bayesian network structures from data,” in Proceedings of the 9th Conference on Uncertainty in Artificial Intelligence (UAI-93), Washington, DC, July 1993, pp. 259-264.
M. Singh and M. Valtorta, “Construction of bayesian belief networks from data: A brief survey and an efficient algorithm,” International Journal of Approximate Reasoning, vol. 12, pp. 111–131, 1995.
Data Archive and Users Manual of the Child Health and Development Studies, Vols. 1 & 2. School of Public Health, University of California at Berkeley, CA, March 1, 1987.
B. Xia, “An algorithm to learn bayesian probabilistic network structures from data,” M.S. Thesis, Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, 2002.
S.L. Lauritzen and D.J. Spiegelhalter, “Local computation with probabilities on graphical structures and their applications to expert systems,” J.R. Statist. Soc., Series B, vol. 50, pp. 157–224, 1988.
S.K. Andersen, K.G. Olesen, F.V. Jensen, and F. Jensen. “HUGIN—A shell for building bayesian belief universes for expert systems,” in Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, Detroit, 1989, pp. 1080–1085.
S. Andreassen, M. Woldbye, B. Falck, and S.K. Andersen, “MUNIN—A causal probabilistic network for interpretation of electromyographic findings,” in Proceedings of the Tenth International Joint Conference on Artificial Intelligence, Milan, 1987, pp. 366–372.
P.M. Murphy and D.W. Aha, UCI Repository of Machine Learning Databases. [http://www.ics.uci.edu/mlearn/ML.Repository.html]. University of California, Dept. of Information and Computer Science: Irvine, CA.
R. Neapolitan, Learning Bayesian Networks, Pearson Prentice Hall: Upper Saddle River, NJ, 2004.
N. Friedman, “The Bayesian structural EM algorithm,” in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98), Madison, WI, July 1998, pp. 129–138.
J.S. Mausner and S. Kramer, “The concept of causality and steps in the establishment of causal relationships,” in Epidemiology—An introductory Text, Ch 7. Company: Philadelphia, PA, W.B Saunders, 1985.
S.M. Weiss and C.A. Kulikowski, Computer Systems that Learn. San Mateo, Morgan-Kaufmann: CA, 1991.
S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach. Englewood Cliffs, Prentice Hall: New Jersey, 1995.
P. Spirtes, C. Glymour, and R. Scheines, Causation, Prediction, and Search, 2nd edn., MIT Press: Cambridge, MA, 2000.
S.L. Lauritzen and R. Thomas, “Chain graph models and their causal interpretation,” Research Report R-01-2003, Department of Mathematical Sciences, Aalborg University. (The revised version of August 2001 is available at http://www.math. auc.dk/steffen/papers/rss.ps. The paper is to appear in Journal of the Royal Statistical Society, Series B.)
S. Monti and G.F. Cooper, “A latent variable model for multivariate discretization,” in Proceedings of the Seventh International Workshop on Artificial Intelligence and Statistics, January 1999 (http://www-2.cs.cmu.edu/smonti/publications/ais99.html).
G. Provan and M. Singh, “Learning bayesian networks using feature selection,” in Learning from Data, Lecture Notes in Statistics edited by D. Fisher, and H. Lenz, Springer Verlag: New York, vol. 112, pp. 291–300.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mani, S., Valtorta, M. & McDermott, S. Building Bayesian Network Models in Medicine: The MENTOR Experience. Appl Intell 22, 93–108 (2005). https://doi.org/10.1007/s10489-005-5599-3
Issue Date:
DOI: https://doi.org/10.1007/s10489-005-5599-3