Abstract
Selection-dynamic data integration employs a set of known data sources attached to an integration system. For answering a given query, suitable sources are selected from this set and dynamically integrated. This procedure requires a method to determine the degree of suitability of the individual data sources within a short timeframe, eliminating conventional schema matching approaches. We developed a registry component for our DynaGrid virtual data source which analyzes data sources upon registration and constructs a catalog of schema fragments grouped by content and cohesion. Given a concrete query, it provides a ranked list of data sources capable of contributing to answering the query. In this paper, we first give an overview of dynamic data integration and the DynaGrid virtual data source. We then present the design and the functionality of the registry component and illustrate its task in the overall process of selection-dynamic data integration.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Husemann, M., Ritter, N.: A Virtual Data Source for Service Grids. In: Second Int. Conf. on Data Management in Grid and P2P Systems, September 2009, pp. 24–35 (2009)
Wiederhold, G.: Mediators in the Architecture of Future Information Systems. IEEE Computer 25(3), 38–49 (1992)
Shvaiko, P., Euzenat, J.: A Survey of Schema-Based Matching Approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005)
Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. VLDB J. 10(4), 334–350 (2001)
Gounaris, A., Sakellariou, R., Comito, C., Talia, D.: Service Choreography for Data Integration on the Grid. In: Knowledge and Data Management in GRIDs, February 2007, pp. 19–33. Springer, Heidelberg (2007)
Gorton, I., Almquist, J., Dorow, K., et al.: An Architecture for Dynamic Data Source Integration. In: 38th Hawaii Int. Conf. on System Sciences (January 2005)
Chang, K.C.C., He, B., Zhang, Z.: Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. In: CIDR, January 2005, pp. 44–55 (2005)
Al-Hussaini, L., Viglas, S., Atkinson, M.: A Service-based Approach to Schema Federation of Distributed Databases. Technical Report EES-2006-01, University of Edinburgh (November 2005)
Lacroix, Z., Parekh, K., Vidal, M.E., et al.: BioNavigation: Selecting Optimum Paths Through Biological Resources to Evaluate Ontological Navigational Queries. In: Ludäscher, B., Raschid, L. (eds.) DILS 2005. LNCS (LNBI), vol. 3615, pp. 275–283. Springer, Heidelberg (2005)
Aziz, M., Lacroix, Z.: ProtocolDB: Classifying Resources with a Domain Ontology to Support Discovery. In: 10th Int. Conf. on Information Integration and Web-based Applications Services, November 2008, pp. 462–469 (2008)
Wilkinson, M.D., Links, M.: BioMOBY: An Open Source Biological Web Services Proposal. Briefings in Bioinformatics 3(4), 331–341 (2002)
Ayadi, N.Y., Lacroix, Z., Vidal, M.E.: BiOnMap: A Deductive Approach for Resource Discovery. In: 10th Int. Conf. on Information Integration and Web-based Applications Services, November 2008, pp. 477–482 (2008)
Li, J., Ma, D., Zhao, Z., et al.: An Efficient Semantic Web Services Matching Mechanism. In: Second Int. Workshop on Resource Discovery (August 2009)
Foster, I.T.: Globus Toolkit Version 4: Software for Service-Oriented Systems. In: IFIP Int. Conf. on Network and Parallel Computing, November 2005, pp. 2–13 (2005)
Antonioletti, M., Atkinson, M.P., Baxter, R., et al.: The design and implementation of Grid database services in OGSA-DAI. Concurrency - Practice and Experience 17(2-4), 357–376 (2005)
Antonioletti, M., Atkinson, M., Baxter, R., et al.: OGSA-DAI: Two Years On. In: The Future of Grid Data Environments Workshop, GGF10 (March 2004)
Lenzerini, M.: Data Integration: A Theoretical Perspective. In: PODS, June 2002, pp. 233–246 (2002)
Pottinger, R., Halevy, A.Y.: MiniCon: A scalable algorithm for answering queries using views. VLDB J. 10(2-3), 182–198 (2001)
Yu, B., Li, G., Sollins, K.R., Tung, A.K.H.: Effective keyword-based selection of relational databases. In: SIGMOD, June 2007, pp. 139–150 (2007)
Fellbaum, C. (ed.): WordNet - An Electronic Lexical Database, May 1998. MIT Press, Cambridge (1998), http://wordnet.princeton.edu
Rahm, E., Do, H.H., Maßmann, S.: Matching Large XML Schemas. SIGMOD Record 33(4), 26–31 (2004)
Floyd, R.W.: Algorithm 97: Shortest path. ACM Commun. 5(6), 345 (1962)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Husemann, M., Ritter, N. (2010). Data Source Management and Selection for Dynamic Data Integration. In: Lacroix, Z. (eds) Resource Discovery. RED 2009. Lecture Notes in Computer Science, vol 6162. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14415-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-14415-8_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14414-1
Online ISBN: 978-3-642-14415-8
eBook Packages: Computer ScienceComputer Science (R0)