Abstract
As the popularity of eXtensible Markup Language (XML) continues to increase at an astonishing pace, data management systems for storing and querying large repositories of XML data are urgently needed. In this paper, we investigate an Object-Relational DBMS (ORDBMS) for storing and querying XML data. We present an algorithm, called XORator, for mapping XML documents to tables in an ORDBMS. An important part of this mapping is assigning a fragment of an XML document to a new XML data type. We demonstrate that using the XORator algorithm, an ORDBMS is usually more efficient than a Relational DBMS (RDBMS). Based on an actual implementation in DB2 V.7.2, we compare the performance of the XORator algorithm with a well-known algorithm for mapping XML data to an RDBMS. Our experiments show that the XORator algorithm requires less storage space, has much faster loading times, and in most cases can evaluate queries faster. The primary reason for this performance improvement is that the XORator algorithm results in a database that is smaller in size, and queries that usually have fewer number of joins.
This work was supported in part by the National Science Foundation under NSF grant IIS-0208852, by an IBM CAS Fellowship, and by gift donations from IBM and NCR.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Software AG. Tamino-The Information Server for Electronic Business, 2000. http://www.softwareag.com/tamino/.
P. Bohannon, J. Freire, P. Roy, and J. Simeón. From XML Schema to Relations: A Cost-Based Approach to XML Storage. In Proceedings IEEE International on Data Engineering, San Jose, California, February 2002.
J. Bosak. The Plays of Shakespeare in XML, July 1999. http://metalab.unc.edu/xml/examples/shakespeare/.
J. Bosak, T. Bray, D. Connolly, E. Maler, G. Nicol, C.M. Sperberg-McQueen, L. Wood, and J. Clark. W3C XML Specification DTD, June 1998. http://www.w3.org/XML/1998/06/xmlspec-report-19980910.htm.
T. Bray, J. Paoli, C.M. Sperberg-McQueen, and E. Maler. Extensible Markup Language (XML), October 2000. http://www.w3.org/TR/REC-xml.
M.J. Carey, J. Kiernan, J. Shanmugasundaram, E.J. Shekita, and S.N. Subramanian. XPERANTO: A Middleware for Publishing Object-Relational Data as XML Documents. In Proceedings International Conference Very Large Data Bases, pages 646–648, Cairo, Egypt, September 2000.
D. Chamberlin. Using The New DB2: IBM’s Object-Relational Database System. Morgan Kaufmann Publishers, Inc., San Francisco, California, 1996.
IBM Corporation. XML Parser for Java., February 1998. http://www.alphaworks.ibm.com/tech/xml4j.
IBM Corporation. IBM XML Generator, September 1999. http://www.alphaworks.ibm.com/tech/xmlgenerator.
IBM Corporation. IBM DB2 UDB XML Extender Adminstration and Programming, March 2000. http://www-4.ibm.com/software/data/db2/extenders/xmlext/docs/v71wrk/dxx%awmst.pdf.
Oracle Corporation. Oracle XML SQL Utility. http://otn.oracle.com/tech/xml/oracle_xsu/content.html.
A. Deutsch, M. F. Fernandez, and D. Suciu. Storing Semistructured Data with STORED. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 431–442. ACM Press, 1999.
M. F Fernandez, Dan Suciu, and Wang-Chiew Tan. SilkRoute: Trading between Relations and XML. In Proceedings of Ninth International World Wide Web Conference, 2000.
D. Florescu, G. Graefe, G. Moerkotte, H. Pirahesh, and H. Schöning. Panel: XML Data Management: Go Native or Spruce up Relational Systems? In Proceedings of the ACM SIGMOD International Conference on Management of Data, Santa Barbara, California, May 2001. (Panel Chair: Per-Ake Larson).
D. Florescu and D. Kossmann. Storing and Querying XML Data using an RDBMS. Bulletin of the Technical Committee on Data Engineering, 22(3):27–34, 1999.
G. Kappel, E. Kapsammer, S. Raush-Schott, and W. Retschzegger. X-Ray-Towards Integrating XML and Relational Database Systems. In International Conference on Conceptual Modeling (ER), pages 339–353, Utah, USA, October 2000.
M. Klettke and H. Meyer. XML and Object-Relational Database Systems-Enhancing Structural Mappings Based on Statistics. In International Workshop on the Web and Databases, Dallas, Texas, May 2000.
D. Lee and W. W. Chu. Constraints-Preserving Transformation from XML Document Type Definition to Relational Schema. In International Conference on Conceptual Modeling (ER), October 2000.
H. Liefke and D. Suciu. XMill: an Efficient Compressor for XML Data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 153–164, Dallas, Texas, May 2000.
J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. SIGMOD Record, 26(3):54–66, September 1997.
Sigmod Record. Sigmod Record: XML Edition. http://www.dia.uniroma3.it/Araneus/Sigmod/Record/DTD/.
K. Runapongsa and J. M. Patel. Storing and Querying XML Data in ORDBMSs. University of Michigan, Technical Report, 2002.
M. Rys. State-of-the-art Support in RDBMS:Microsoft SQL Server’s XML Features. Bulletin of the Technical Committee on Data Engineering, 24(2):3–11, June 2001.
A.R. Schmidt, M.L. Kersten, M. Windhouwer, and F. Waas. Efficient Relational Storage and Retrieval of XML Documents. In WebDB’2000 Third International Workshop on the Web and Databases, Dallas, Texas, May 2000.
J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. DeWitt, and J. Naughton. Relational Databases for Querying XML Documents: Limitations and Opportunities. In Proceedings International Conference Very Large Data Bases, pages 302–314, Edinburgh, Scotland, September 1999.
T. Shimura, M. Yoshikawa, and S. Uemura. Storage and Retrieval of XML Documents Using Object-Relational Databases. In International Conference on Database and Expert Systems Applications, pages 206–217, Florence, Italy, September 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Runapongsa, K., Patel, J.M. (2002). Storing and Querying XML Data in Object-Relational DBMSs. In: Chaudhri, A.B., Unland, R., Djeraba, C., Lindner, W. (eds) XML-Based Data Management and Multimedia Engineering — EDBT 2002 Workshops. EDBT 2002. Lecture Notes in Computer Science, vol 2490. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36128-6_15
Download citation
DOI: https://doi.org/10.1007/3-540-36128-6_15
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00130-0
Online ISBN: 978-3-540-36128-2
eBook Packages: Springer Book Archive