Skip to main content

Storing and Querying XML Data in Object-Relational DBMSs

  • Conference paper
  • First Online:
XML-Based Data Management and Multimedia Engineering — EDBT 2002 Workshops (EDBT 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2490))

Included in the following conference series:

Abstract

As the popularity of eXtensible Markup Language (XML) continues to increase at an astonishing pace, data management systems for storing and querying large repositories of XML data are urgently needed. In this paper, we investigate an Object-Relational DBMS (ORDBMS) for storing and querying XML data. We present an algorithm, called XORator, for mapping XML documents to tables in an ORDBMS. An important part of this mapping is assigning a fragment of an XML document to a new XML data type. We demonstrate that using the XORator algorithm, an ORDBMS is usually more efficient than a Relational DBMS (RDBMS). Based on an actual implementation in DB2 V.7.2, we compare the performance of the XORator algorithm with a well-known algorithm for mapping XML data to an RDBMS. Our experiments show that the XORator algorithm requires less storage space, has much faster loading times, and in most cases can evaluate queries faster. The primary reason for this performance improvement is that the XORator algorithm results in a database that is smaller in size, and queries that usually have fewer number of joins.

This work was supported in part by the National Science Foundation under NSF grant IIS-0208852, by an IBM CAS Fellowship, and by gift donations from IBM and NCR.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Software AG. Tamino-The Information Server for Electronic Business, 2000. http://www.softwareag.com/tamino/.

  2. P. Bohannon, J. Freire, P. Roy, and J. Simeón. From XML Schema to Relations: A Cost-Based Approach to XML Storage. In Proceedings IEEE International on Data Engineering, San Jose, California, February 2002.

    Google Scholar 

  3. J. Bosak. The Plays of Shakespeare in XML, July 1999. http://metalab.unc.edu/xml/examples/shakespeare/.

  4. J. Bosak, T. Bray, D. Connolly, E. Maler, G. Nicol, C.M. Sperberg-McQueen, L. Wood, and J. Clark. W3C XML Specification DTD, June 1998. http://www.w3.org/XML/1998/06/xmlspec-report-19980910.htm.

  5. T. Bray, J. Paoli, C.M. Sperberg-McQueen, and E. Maler. Extensible Markup Language (XML), October 2000. http://www.w3.org/TR/REC-xml.

  6. M.J. Carey, J. Kiernan, J. Shanmugasundaram, E.J. Shekita, and S.N. Subramanian. XPERANTO: A Middleware for Publishing Object-Relational Data as XML Documents. In Proceedings International Conference Very Large Data Bases, pages 646–648, Cairo, Egypt, September 2000.

    Google Scholar 

  7. D. Chamberlin. Using The New DB2: IBM’s Object-Relational Database System. Morgan Kaufmann Publishers, Inc., San Francisco, California, 1996.

    Google Scholar 

  8. IBM Corporation. XML Parser for Java., February 1998. http://www.alphaworks.ibm.com/tech/xml4j.

  9. IBM Corporation. IBM XML Generator, September 1999. http://www.alphaworks.ibm.com/tech/xmlgenerator.

  10. IBM Corporation. IBM DB2 UDB XML Extender Adminstration and Programming, March 2000. http://www-4.ibm.com/software/data/db2/extenders/xmlext/docs/v71wrk/dxx%awmst.pdf.

  11. Oracle Corporation. Oracle XML SQL Utility. http://otn.oracle.com/tech/xml/oracle_xsu/content.html.

  12. A. Deutsch, M. F. Fernandez, and D. Suciu. Storing Semistructured Data with STORED. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 431–442. ACM Press, 1999.

    Google Scholar 

  13. M. F Fernandez, Dan Suciu, and Wang-Chiew Tan. SilkRoute: Trading between Relations and XML. In Proceedings of Ninth International World Wide Web Conference, 2000.

    Google Scholar 

  14. D. Florescu, G. Graefe, G. Moerkotte, H. Pirahesh, and H. Schöning. Panel: XML Data Management: Go Native or Spruce up Relational Systems? In Proceedings of the ACM SIGMOD International Conference on Management of Data, Santa Barbara, California, May 2001. (Panel Chair: Per-Ake Larson).

    Google Scholar 

  15. D. Florescu and D. Kossmann. Storing and Querying XML Data using an RDBMS. Bulletin of the Technical Committee on Data Engineering, 22(3):27–34, 1999.

    Google Scholar 

  16. G. Kappel, E. Kapsammer, S. Raush-Schott, and W. Retschzegger. X-Ray-Towards Integrating XML and Relational Database Systems. In International Conference on Conceptual Modeling (ER), pages 339–353, Utah, USA, October 2000.

    Google Scholar 

  17. M. Klettke and H. Meyer. XML and Object-Relational Database Systems-Enhancing Structural Mappings Based on Statistics. In International Workshop on the Web and Databases, Dallas, Texas, May 2000.

    Google Scholar 

  18. D. Lee and W. W. Chu. Constraints-Preserving Transformation from XML Document Type Definition to Relational Schema. In International Conference on Conceptual Modeling (ER), October 2000.

    Google Scholar 

  19. H. Liefke and D. Suciu. XMill: an Efficient Compressor for XML Data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 153–164, Dallas, Texas, May 2000.

    Google Scholar 

  20. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. SIGMOD Record, 26(3):54–66, September 1997.

    Google Scholar 

  21. Sigmod Record. Sigmod Record: XML Edition. http://www.dia.uniroma3.it/Araneus/Sigmod/Record/DTD/.

  22. K. Runapongsa and J. M. Patel. Storing and Querying XML Data in ORDBMSs. University of Michigan, Technical Report, 2002.

    Google Scholar 

  23. M. Rys. State-of-the-art Support in RDBMS:Microsoft SQL Server’s XML Features. Bulletin of the Technical Committee on Data Engineering, 24(2):3–11, June 2001.

    Google Scholar 

  24. A.R. Schmidt, M.L. Kersten, M. Windhouwer, and F. Waas. Efficient Relational Storage and Retrieval of XML Documents. In WebDB’2000 Third International Workshop on the Web and Databases, Dallas, Texas, May 2000.

    Google Scholar 

  25. J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. DeWitt, and J. Naughton. Relational Databases for Querying XML Documents: Limitations and Opportunities. In Proceedings International Conference Very Large Data Bases, pages 302–314, Edinburgh, Scotland, September 1999.

    Google Scholar 

  26. T. Shimura, M. Yoshikawa, and S. Uemura. Storage and Retrieval of XML Documents Using Object-Relational Databases. In International Conference on Database and Expert Systems Applications, pages 206–217, Florence, Italy, September 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Runapongsa, K., Patel, J.M. (2002). Storing and Querying XML Data in Object-Relational DBMSs. In: Chaudhri, A.B., Unland, R., Djeraba, C., Lindner, W. (eds) XML-Based Data Management and Multimedia Engineering — EDBT 2002 Workshops. EDBT 2002. Lecture Notes in Computer Science, vol 2490. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36128-6_15

Download citation

  • DOI: https://doi.org/10.1007/3-540-36128-6_15

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00130-0

  • Online ISBN: 978-3-540-36128-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics