Causal Discovery from Medical Data: Dealing with Missing Values and a Mixture of Discrete and Continuous Data

Sokolova, Elena; Groot, Perry; Claassen, Tom; von Rhein, Daniel; Buitelaar, Jan; Heskes, Tom

doi:10.1007/978-3-319-19551-3_23

Causal Discovery from Medical Data: Dealing with Missing Values and a Mixture of Discrete and Continuous Data

Elena Sokolova⁸,
Perry Groot⁸,
Tom Claassen⁸,
Daniel von Rhein⁹,
Jan Buitelaar⁹ &
…
Tom Heskes⁸

Conference paper

4045 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9105))

Abstract

Causal discovery is an increasingly popular method for data analysis in the field of medical research. In this paper we consider two challenges in causal discovery that occur very often when working with medical data: a mixture of discrete and continuous variables and a substantial amount of missing values. To the best of our knowledge there are no methods that can handle both challenges at the same time. In this paper we develop a new method that can handle these challenges based on the assumption that data is missing completely at random and that variables obey a non-paranormal distribution. We demonstrate the validity of our approach for causal discovery for empiric data from a monetary incentive delay task. Our results may help to better understand the etiology of attention deficit-hyperactivity disorder (ADHD).

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abegaz, F., Wit, E.: Penalized EM algorithm and copula skeptic graphical models for inferring networks for mixed variables. Statistics in Medicine (2014)
Google Scholar
Bach, F.R., Jordan, M.I.: Learning graphical models with Mercer kernels. In: Proceedings of the NIPS Conference, pp. 1009–1016 (2002)
Google Scholar
Claassen, T., Heskes, T.: A Bayesian approach to constraint based causal inference. In: Proceedings of the UAI Conference, pp. 207–216 (2012)
Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), pp. 1–38 (1977)
Google Scholar
Franke, B., Neale, B.M., Faraone, S.V.: Genome-wide association studies in ADHD. Human Genetics 126(1), 13–50 (2009)
Article Google Scholar
Friedman, N.: The bayesian structural EM algorithm. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 129–138. Morgan Kaufmann Publishers Inc. (1998)
Google Scholar
Harris, N., Drton, M.: PC algorithm for nonparanormal graphical models. Journal of Machine Learning Research 14, 3365–3383 (2013)
MATH MathSciNet Google Scholar
Monti, S., Cooper, G.F.: Learning hybrid Bayesian networks from data. Technical Report ISSP-97-01, Intelligent Systems Program, University of Pittsburgh (1997)
Google Scholar
Riggelsen, C., Feelders, A.: Learning bayesian network models from incomplete data using importance sampling. In: Proc. of Artificial Intelligence and Statistics, pp. 301–308 (2005)
Google Scholar
Sokolova, E., Groot, P., Claassen, T., Heskes, T.: Causal discovery from databases with discrete and continuous variables. In: van der Gaag, L.C., Feelders, A.J. (eds.) PGM 2014. LNCS, vol. 8754, pp. 442–457. Springer, Heidelberg (2014)
Google Scholar
von Rhein, D., Mennes, M., van Ewijk, H., Groenman, A.P., Zwiers, M.P., Oosterlaan, J., Heslenfeld, D., Franke, B., Hoekstra, P.J., Faraone, S.V.: et al. The NeuroIMAGE study: a prospective phenotypic, cognitive, genetic and MRI study in children with attention-deficit/hyperactivity disorder. Design and descriptives. European Child & Adolescent Psychiatry, 1–17 (2014)
Google Scholar
Wang, H., Fazayeli, F., Chatterjee, S., Banerjee, A., Steinhauser, K., Ganguly, A., Bhattacharjee, K., Konar, A., Nagar, A.: Gaussian copula precision estimation with missing values. Biotechnology Journal 4(9) (2009)
Google Scholar
Willcutt, E.G., Pennington, B.F., DeFries, J.C.: Etiology of inattention and hyperactivity/impulsivity in a community sample of twins with learning difficulties. J. Abnorm. Child Psychol. 28(2), 149–159 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Science, Radboud University, Nijmegen, The Netherlands
Elena Sokolova, Perry Groot, Tom Claassen & Tom Heskes
Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
Daniel von Rhein & Jan Buitelaar

Authors

Elena Sokolova
View author publications
You can also search for this author in PubMed Google Scholar
Perry Groot
View author publications
You can also search for this author in PubMed Google Scholar
Tom Claassen
View author publications
You can also search for this author in PubMed Google Scholar
Daniel von Rhein
View author publications
You can also search for this author in PubMed Google Scholar
Jan Buitelaar
View author publications
You can also search for this author in PubMed Google Scholar
Tom Heskes
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elena Sokolova .

Editor information

Editors and Affiliations

University of Pennsylvania, Philadelphia, Pennsylvania, USA
John H. Holmes
University of Pavia, Pavia, Italy
Riccardo Bellazzi
University of Pavia, Pavia, Italy
Lucia Sacchi
University of Manchester, Manchester, United Kingdom
Niels Peek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sokolova, E., Groot, P., Claassen, T., von Rhein, D., Buitelaar, J., Heskes, T. (2015). Causal Discovery from Medical Data: Dealing with Missing Values and a Mixture of Discrete and Continuous Data. In: Holmes, J., Bellazzi, R., Sacchi, L., Peek, N. (eds) Artificial Intelligence in Medicine. AIME 2015. Lecture Notes in Computer Science(), vol 9105. Springer, Cham. https://doi.org/10.1007/978-3-319-19551-3_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-19551-3_23
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19550-6
Online ISBN: 978-3-319-19551-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics