Abstract
While most business applications typically operate on structured data that can be effectively managed using relational databases, some applications use more complex semistructured data that lacks a stable schema. XML techniques are available for the management of semistructured data, but such techniques tend to be ineffective when applied to large amounts of heterogeneous data, in particular in applications with complex query requirements. We describe an approach that relies on the mapping of multiple semistructured data sets to object-relational structures and uses an object-relational database to support complex query requirements. As an example we use weakly heterogeneous oceanographic data.
Similar content being viewed by others
References
Amornsinlaphachai, P., Rossiter, N. and Ali, M.A. (2006) Storing Linked XML Documents in Object-Relational DBMS. Journal of Computing and Information Technology (CIT) 14(3), 225–241.
Beyer, K., Cochrane, R.J., Josifovski, V., Kleewein, J., Lapis, G., Lohman, G., Lyle, B., Ozcan, F., Pirahesh, H., Seemann, N., Truong, T., Van der Linden, B., Vickery, B. and Zhang1, Ch. (2005): System RX: One Part Relational, One Part XML. In Proc. of the 2005 ACM SIGMOD Int. Conf. on Management of Data, Baltimore, Maryland, USA, ACM Press, pp. 347–358.
Boag, S., Chamberlin, D., Fernández, M.F., Florescu, D., Robie, J. and Siméon, J. (2005) XQuery 1.0: An XML Query Language, W3C Working Draft, 04 April 2005. Retrieved July 4, 2008 from: http://www.w3.org/TR/xquery/
Bourret, R. (2007) XML Database Products. Retrieved July 4, 2008 from: http://www.rpbourret.com/xml/XMLDatabaseProds.htm
Buneman, P. (1997) Semistructured Data. In Proc. of 1997 Symposium on Principles of Database Systems (PODS97), Tucson, Arizona, pp.117–121.
Clark, J. (1999): XSL Transformations (XSLT) Version 1.0. W3C Recomm. Nov 16, 1999.
Fiebig, T., Helmer, S., Kanne, C.-C., Moerkotte, G., Neumann, J., Schiele, R. and Westmann, T. (2002): Anatomy of a native XML base management system. VLDB Journal, 11(4), 292–314.
Harold, E.R. (2005) Managing XML data: Native XML databases. Retrieved July 4, 2008 from: http://www.ibm.com/developerworks/xml/library/x-mxd4.html
Isenor, A.W. and Keeley, J.R (2005) Modeling Generic Oceanographic Data Objects in XML. Computing in Science and Engineering, July/August, 58–65.
ISO/IEC 9075:2003 (2003) Information Technology, Database Languages, SQL. Part 2: Foundations.
Liu, Z.H., Krishnaprasad, M. and Arora, V. (2005): Native XQuery Processing in Oracle XMLDB. In Proc. of the ACM SIGMOD Int. Conf. on Management of Data. Baltimore, MD, pp. 828–833.
Mlýnková, I. and Pokorný, J. (2005) XML in the World of (Object-)Relational Database Systems. In Vasilecas, O., et al. (Eds.), Information Systems Development Advances in Theory, Practice, and Education 2004. Springer Science+Business Media, Inc., pp. 63–76.
OCG (2008) Geography Markup Language. Retrieved July 4, 2008 from: http://www.opengeospatial.org/standards/gml
Rahayu, J.W., Pardede, E. and Taniar, D. (2007) XML Databases: Trends, Issues, and Future Research. In The 9th Int. Conf. on Information Integration and Web-based Applications and Services (iiWAS 2007), ACS, 229, 9–10, Jakarta, Indonesia. http://www.ocg.at/publikationen/books/volumes/sr229.html
Runapongsa, K. and Patel, J.M. (2002) Storing and Querying XML Data in Object-Relational DBMSs. In XML-Based Data Management and Multimedia Engineering. EDBT 2002 Workshops, LNCS 2490/2002.
Ruttananontsatean, N. (2007) An Investigation of Query Techniques for Semistructured Data, PhD Thesis, Faculty of Information Technology, University of Technology, Sydney, Australia, November 2007.
Shanmugasundaram, J., Shekita, E., Barr, R., Carey, M., Lindsay, B., Pirahesh H. and Reinwald B. (2001) Efficiently Publishing Relational Data as XML Documents. VLDB Journal 10(2–3), 133–154.
Suresh, R., Shukla, P. and Schwenke G. (2000) XML-Based Data Systems for Earth Science Applications. In Proc. of Geoscience and Remote Sensing Symposiun (IGARSS), IEEE 2000 Int., 3, 1214–1216.
W3C (2004): Extensible Markup Language (XML) 1.1. (Second Edition), W3C Recommendation 16 August 2004. Retrieved July 4, 2008 from: http://www.w3.org/TR/xml11/
Acknowledgments
This research has been partially supported by the National Program of Research, Information Society Project No. 1ET100300419 and also by the grants of GACR No. GA201/09/0990, GA201/06/0175, and GA201/06/0648.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Feuerlicht, G., Pokorný, J., Richta, K., Ruttananontsatean, N. (2009). Integration of Weakly Heterogeneous Semistructured Data. In: Papadopoulos, G., Wojtkowski, W., Wojtkowski, G., Wrycza, S., Zupancic, J. (eds) Information Systems Development. Springer, Boston, MA. https://doi.org/10.1007/b137171_8
Download citation
DOI: https://doi.org/10.1007/b137171_8
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-84809-9
Online ISBN: 978-0-387-84810-5
eBook Packages: Computer ScienceComputer Science (R0)