Definition
Information integration refers to the field of study of techniques attempting to combine information from disparate sources despite differing conceptual, contextual and lexical representations. One goal of information integration, commonly held by the data management community, is to combine the information in such a way that the user gets a unified view of the data. In other words, the user should see and query the data as though it is present in a common, unified schema.
In the domain of scientific applications, the problems and expectations are different. In this domain the semantics of data play a very strong role in data integration, and semantic compatibility need to be ensured as part of the data integration process. Further, straightforward view-based data integration, which works well for commercial applications, does not always suit the needs of the scientific users. Finally, scientists need to ensure that the result of any query on integrated data is...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Bowers S, Lin K, Ludaescher B. On integrating scientific resources through semantic registration. In: Proceedings of the 16th International Conference on Scientific and Statistical Database Management; 2004. p. 349.
Cluet S, Delobel C, Siméon J, Smaga K. Your mediators need data conversion! In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1998. p. 177–88.
Critchlow T, Fidelis K, Ganesh M, Musick R, Slezak T. Datafoundry: information management for scientific data. IEEE Trans Inf Technol Biomed. 2000;4(1):52–7.
Gupta A, Ludäscher B, Martone ME, Rajasekar A, Ross E, Qian X, Santini S, He H, Zaslavsky I. BIRN-M: a semantic mediator for solving real-world neuroscience problems. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003. p. 678.
Hakimpour F, Geppert A. Resolving semantic heterogeneity in schema integration. In: Proceedings of the 2nd International Conference on Formal Ontology in Information Systems; 2001. p. 297–308.
Halevy A. Why your data won’t mix. ACM Queue. 2003;3(8):50–8.
Ludäscher B, Gupta A, Martone ME. Model-based mediation with domain maps. In: Proceedings of the 17th International Conference on Data Engineering; 2001. p. 81–90.
Qi Y, Candan KS, Sapino ML, Kintigh KW. Integrating and querying taxonomies with quest in the presence of conflicts. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2007. p. 1153–5.
Searls DB. Data integration: challenges for drug discovery. Nat Rev Drug Discov. 2005;4:45–58.
National Research Council (US) Committee on Applied and Theoretical Statistics. Steps Toward Large-Scale Data Integration in the Sciences: Summary of a Workshop. 2010. available from http://www.ncbi.nlm.nih.gov/books/NBK45678/.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Gupta, A. (2018). Information Integration Techniques for Scientific Data. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_1303
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_1303
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering