Abstract
We discuss our work in progress towards the practical application of a system we have developed for automated schema mapping of medical datasets. While starting with a purely knowledge-driven approach to the schema mapping problem, based on information in data dictionaries, we are now incorporating machine-learning classification for determining mappings. We are further integrating the mapping system into a production medical informatics environment. We discuss our ongoing approach and progress in these areas, as well as current challenges.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Ashish, N., Dewan, P., Ambite, J., Toga, A.: GEM: The GAAIN entity mapper. 11th DILS (2015)
Dewan, P., Ashish, N., Toga, A.: A schema-matching tool for Alzheimer’s disease data integration. In: 5th ACM BCB (2014)
Rubens, N., Kaplan, D., Sugiyama, M.: Recommender Systems Handbook. Springer, US (2011)
Tata, S., Patel, J.: Estimating the selectivity of tf-idf based cosine similarity predicates. Sigmod Rec. 36(2), 75–80 (2007)
Blei, D.M.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012). doi:10.1145/2133806.2133826
Mueller, S.G., Weiner, M.W., Thal, L.J., Peterson, R.C., Jack, C., Jagust, W., Trojanowski, J.Q., Toga, A.W., Beckett, L.: Alzheimer’s disease neuroimaging initiative. Neuroimaging Clin. N. Am. 15(4), 869–877 (2008)
Shen, L., Thompson, P.M., Potkin, S.G., Bertram, L., Farrer, L.A., Foroud, T.M., Green, R.C., Hu, X., Huentelman, M.J., Kim, S., Kauwe, J.S., Li, Q., Liu, E., Macciardi, F., Moore, J.H., Munsie, L., Nho, K., Ramanan, V.K., Risacher, S.L., Stone, D.J., Swaminathan, S., Toga, A.W., Weiner, M.W., Saykin, A.J.: Generic analysis of quantitative phenotypes in AD and MCI: imaging, cognition and biomarkers. Brain Imaging Beh. 8(2), 183–207 (2014)
Beekly, D.L., Ramos, E.M., Lee, W.W., et al.: The National Alzheimer’s Coordinating Center (NACC) database: The uniform data set. Alzheimer Dis. Assoc. Disord. 21, 249–258 (2007)
Xie, S.X., Baek, Y., Grossman, M., Arnold, S.E., Karlawish, J., Siderowf, A., Hurtig, H., Elman, L., McCluskey, L., Van Deerlin, V., Lee, V.M., Trojanowski, J.Q.: Building an integrated neurodegenerative disease database at an academic health center. Alzheimer’s Dement. 7, e84–e93 (2011). doi:10.1016/j.jalz.2010.08
Wu, X., Li, J., Ayutyanont, N., Protas, H., Jagust, W., Fleisher, A., Reiman, E., Yao, L., Chen, K.: The receiver operational characteristic for binary classification with multiple indices and its application to the neuroimaging study of Alzheimer’s disease. IEEE/ACM Trans. Comput. Biol. Bioinform. 10, 173–180 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Dewan, P. (2015). The GAAIN Entity Mapper: Towards Practical Medical Informatics Application. In: Ashish, N., Ambite, JL. (eds) Data Integration in the Life Sciences. DILS 2015. Lecture Notes in Computer Science(), vol 9162. Springer, Cham. https://doi.org/10.1007/978-3-319-21843-4_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-21843-4_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21842-7
Online ISBN: 978-3-319-21843-4
eBook Packages: Computer ScienceComputer Science (R0)