Abstract
Scientific data generation in the world is continuous. However, scientific studies once published do not take advantage of new data. In order to leverage this incoming flow of data, we present Neuro-DISK, an end-to-end framework to continuously process neuroscience data and update the assessment of a given hypothesis as new data become available. Our scope is within the ENIGMA consortium, a large international collaboration for neuro-imaging and genetics whose goal is to understand brain structure and function. Neuro-DISK includes an ontology and framework to organize datasets, cohorts, researchers, tools, working groups and organizations participating in multi-site studies, such as those of ENIGMA, and an automated discovery framework to continuously test hypotheses through the execution of scientific workflows. We illustrate the usefulness of our approach with an implemented example.
D. Garijo and S. Fakhraei—Co-first author.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Gil, Y., et al.: Automated hypothesis testing with large scientific data repositories. In: Proceedings of the Fourth Annual Conference on Advances in Cognitive Systems (ACS) (2016)
Thompson, P., et al.: ENIGMA and global neuroscience: a decade of large-scale studies of the brain in health and disease across more than 40 Countries, July 2019
Shalev-Shwartz, S., et al.: Online learning and online convex optimization. Found. Trends Mach. Learn. 4(2), 107–194 (2012)
Gama, J.: A survey on learning from data streams: current and future trends. Prog. Artif. Intell. 1(1), 45–55 (2012)
King, R.D., et al.: The automation of science. Science 324(5923), 85–89 (2009)
Soldatova, L.N., King, R.D.: An ontology of scientific experiments. J. R. Soc. Interface 3(11), 795–803 (2006)
Soldatova, L.N., Rzhetsky, A., De Grave, K., King, R.D.: Representation of probabilistic scientific knowledge. J. Biomed. Semant. 4, S7 (2013)
Thompson, P.M., et al.: The ENIGMA consortium: large-scale collaborative analyses of neuroimaging and genetic data. Brain Imaging Behav. 8(2), 153–182 (2014)
Gil, Y., Ratnakar, V., Hanson, P.C.: Organic data publishing: a novel approach to scientific data sharing. In: Second International Workshop on Linked Science: Tackling Big Data (LISC), Held in Conjunction with ISWC, Boston, MA (2012)
Jang, M., et al.: Towards automatic generation of portions of scientific papers for large multi-institutional collaborations based on semantic metadata. In: CEUR Workshop Proceedings, vol. 1931, pp. 63–70 (2017)
Gil, Y., et al.: Towards continuous scientific data analysis and hypothesis evolution. In: Thirty-First AAAI Conference on Artificial Intelligence (2017)
Gil, Y., et al.: Wings: intelligent workflow-based design of computational experiments. IEEE Intell. Syst. 26(1), 62–72 (2010)
Alzheimer’s Association, et al.: 2018 Alzheimer’s disease facts and figures. Alzheimer’s Dement. 14(3), 367–429 (2018)
Jack, C.R., et al.: Tracking pathophysiological processes in alzheimer’s disease: an updated hypothetical model of dynamic biomarkers. Lancet Neurol. 12(2), 207–216 (2013)
Lambert, J.-C., et al.: Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat. Genet. 45(12), 1452 (2013)
Schuff, N., et al.: MRI of hippocampal volume loss in early alzheimer’s disease in relation to ApoE genotype and biomarkers. Brain 132(4), 1067–1077 (2009)
Lyall, D.M., et al.: Is there association between APOE e4 genotype and structural brain ageing phenotypes, and does that association increase in older age in UK Biobank? (N = 8,395). bioRxiv (2017)
Fischl, B.: FreeSurfer. NeuroImage 62(2), 774–781 (2012)
Hibar, D.P., et al.: Novel genetic loci associated with hippocampal volume. Nat. Commun. 8, 13624 (2017)
Acknowledgments
We are grateful to the KAVLI foundation for their support of ENIGMA Informatics (PIs: Jahanshad and Gil). We also acknowledge support from the National Science Foundation under awards IIS-1344272 (PI: Gil), ICER-1541029 (Co-PI: Gil), and IIS-1344272 (PI: Gil), and from the National Institutes of Health’s Big Data to Knowledge Grant U54EB020403 for support for ENIGMA (PI: Thompson) and High resolution mapping of the genetic risk for disease in the aging brain grant R01AG059874 (PI: Jahanshad). Data used in preparing this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu), phases both 1 and 2. As such, many investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators is available online (http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf). We also used the DLBS (http://fcon_1000.projects.nitrc.org/indi/retro/dlbs.html) dataset and the UK Biobank in this study. This research was conducted using the UK Biobank Resource under Application Number ‘11559’.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Garijo, D. et al. (2019). Towards Automated Hypothesis Testing in Neuroscience. In: Gadepally, V., et al. Heterogeneous Data Management, Polystores, and Analytics for Healthcare. DMAH Poly 2019 2019. Lecture Notes in Computer Science(), vol 11721. Springer, Cham. https://doi.org/10.1007/978-3-030-33752-0_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-33752-0_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33751-3
Online ISBN: 978-3-030-33752-0
eBook Packages: Computer ScienceComputer Science (R0)