Abstract
As XML is becoming a de facto standard data exchange format for web-based business applications, it is imperatively required to integrate semantically heterogeneous XML data sources. In this paper, we study a semantic integration of heterogeneous XML data sources. First, we consider a common data model that is designed to capture semantics of XML data. Second, we define semantic conflicts in the context of XML data, and resolve them using the rule-based method. Third, we develop a semantic integration technique of XML data using XML view mechanism. We describe how our approach has been used to integrate heterogeneous XML data sources providing various object-oriented abstraction facilities such as generalization, specialization and aggregation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
S. Abiteboul, A. Bonner. Objects and Views, In Proceedings of the ACM SIGMOD Conference, Denver, Colorado, 1991.
S. Abiteboul, P. Buneman and D. Suciu. Data on the Web, Morgan Kaufmann publishers, 2000.
S. Abiteboul, S. Cluet and T. Milo. Correspondence and translation for heterogeneous data. In Proceedings of ICDT, pp. 351–363, 1997.
S. Abiteboul, On Views and XML, ACM SIGMOD Record, Vol. 28, No. 4, pp. 30–38, 1999.
R. Ahmed, et al., “The Pegasus Heterogeneous Multidatabase System”, IEEE Computer, Vol. 24, No. 12, pp. 19–27, 1991.
C. Batini, M. Lenzerini and S. B. Navathe. A Comparative Analysis of Methodologies for Database Schema Integration, ACM Computing Surveys, Vol. 18, No. 4, Dec. 1986.
S. Bergamaschi, S. Castano and M. Vincini, Semantic Integration of Semistructured and Data Sources. SIGMOD Record Special Issue on Semantic Interoperability in Global Information, Vol. 28, No. 1, March 1999.
T. Bray, J. Paoli and C. M. Sperberg-McQueen. Extensible Markup Language (XML) 1.0, http://www.w3c.org/TR/REC-xml.
M. J. Carey, et al., Towards Heterogeneous Multimedia Information Systems: The Garlic Approach, In Proceedings of the Fifth International Workshop on Research Issues in Data Engineering, 1995.
S. Cluet, C. Delobel, J. Simeon, and K. Sgaga. Your mediator needs data conversion! In Proceedings of ACM SIGMOD Conference, Seattle, Washington, June 1998.
A. Elmagarmid and C. Pu, eds., Special Issue on Heterogeneous Databases, ACM Computing Surveys, Vol. 22, No. 3, Sept. 1990.
D. C. Fallside, XML Schema Part 0: Primer, http://www.w3c.org/TR/xmlschema-0
C. Forgy. Rete: A Fast Algorithm for the Many Pattern/Many Object Pattern Match Problem. Artificial Intelligence, Vol. 19, No. 1, pp. 17–37, 1982.
H. Garcia-Molina et al. The TSIMMIS project: Integration of heterogeneous information sources. Journal of Intelligent Information Systems, Vol. 8, No. 2, pp. 117–132, 1997.
M. R. Genesereth and S. P. Ketchpel, Software Agent, Communications of the ACM, Vol. 37, No. 7, pp. 48–53, 1994.
JESS, The expert system schell for the java platform, http://herzberg.ca.sandia.gov/Jess
W. Kim, Modern Database Systems: The Object Model, Interoperability, and Beyond. Addison Wesley, 1995.
W. Kim, I. Choi, S. Gala and M. Scheevel, On resolving schematic heterogeneity in multidatabase systems. Distributed and Parallel Databases, Vol. 1, No. 3, pp. 251–279, 1993.
D. Lee and W. W. Chu, Comparative Analysis of Six XML Schema Languages, ACM SIGMOD Record, Vol. 29, No. 3, September, 2000.
B. Ludascher, Y. Papakonstantinou, P. Velikhov. Navigation-Driven Evaluation of Virtual Mediated View. In Proceedings of EDBT conference, Konstanz, Germany, March 2000.
Y. Papakonstantinou, H. Garcia-Molina and J. Widom. Object Exchange Across Heterogeneous Information Sources, In Proceedings of IEEE International Conference on Data Engineering, pp. 251–260, Taiwan, March, 1995.
Y. Papakonstantinou and P. Velikhov, Enhancing Semistructured Data Mediators with Document Type Definitions, In proceedings of the IEEE International Conference on Data Engineering, 1999.
Y. Papakonstantinou, S. Abiteboul, and H. Garcia-Molina. Object fusion in mediator systems. In Proceedings of VLDB Conference, 1996.
C. Reynaud, J. Sirot and D. Vodislav. Semantic Integration of XML Heterogeneous Data Sources. In Proceedings of the 2001 International Database Engineering & Applications Symposium, Grenoble, France, 2001.
G. Wiederhold. Mediators in the architecture of future information systems. IEEE Computer, Vol. 25, No. 3, pp. 38–49, March, 1992.
K. Zhang and D. Shasha. Simple fast algorithms for the editing distance between trees and related problems, SIAM Journal of Computing, Vol. 18, No. 6, pp. 1245–1262, Dec. 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, H.H., Park, S.S. (2002). Semantic Integration of Heterogeneous XML Data Sources. In: Bellahsène, Z., Patel, D., Rolland, C. (eds) Object-Oriented Information Systems. OOIS 2002. Lecture Notes in Computer Science, vol 2425. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46102-7_13
Download citation
DOI: https://doi.org/10.1007/3-540-46102-7_13
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44087-1
Online ISBN: 978-3-540-46102-9
eBook Packages: Springer Book Archive