Skip to main content

DBMSs with Native XML Support: Towards Faster, Richer, and Smarter Data Management

  • Conference paper
Advances in Web and Network Technologies, and Information Management (APWeb 2007, WAIM 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4537))

  • 2409 Accesses

Abstract

XML provides a natural mechanism for representing semi-structured and unstructured data. It becomes the basis for encoding a large variety of information, for example, the ontology. To exploit the full potential of XML in supporting advanced applications, we must solve two issues. First, the integration of structured (relational) data and unstructured or semi-structured data, and on a higher level, the integration of data and knowledge. In this talk, we will address these two issues by introducing a solution that leverages the power of pure XML support in DB2 9.

The semistructured and structured data models represent two seemingly conflicting philosophies: one focuses on being flexible and self-describing, and the other focuses on leveraging the rigid data schema for a wide range of benefits in traditional data management. For many applications such as e-commerce that depend heavily on semistructured data, the relational model, with its rigid schema requirements, fails to support them in an effective way; on the other hand, the flexibility of XML in modeling semistructured data comes with a big cost in terms of storage and query efficiency, which to a large extent has impeded the deployment of pure XML databases to handle such data. We introduce a new approach called eXtricate that taps on the advantages of both philosophies. We argue that semistructured documents, such as data in an E-catalog, often share a considerable amount of information, and by regarding each document as consisting of a shared framework and a small diff script, we can leverage the strengths of relational and XML data- bases at the same time to handle such data effectively. We also show that our approach can be seamlessly integrated into the emerging support of native XML data in commercial DBMSs (e.g., IBM’s recent DB2 9 release with Native XML Support). Our experiments validate the amount of redundancy in real e-catalog data and show the effectiveness of our method.

The database community is on a constant quest for better integration of data management and knowledge management. Recently, with the increasing use of ontology in various applications, the quest has become more concrete and urgent. However, manipulating knowledge along with relational data in DBMSs is not a trivial undertaking. In this paper, we introduce a novel, unified framework for managing data and domain knowledge. We provide the user with a virtual view that unifies the data, the domain knowledge and the knowledge inferable from the data using the domain knowledge. Because the virtual view is in the relational format, users can query the data and the knowledge in a seamlessly integrated manner. To facilitate knowledge representation and inferencing within the database engine, our approach leverages native XML support in hybrid relational-XML DBMSs. We provide a query rewriting mechanism to bridge the difference between logical and physical data modeling, so that queries on the virtual view can be automatically transformed to components that execute on the hybrid relational-XML engine in a way that is transparent to the user.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Editor information

Kevin Chen-Chuan Chang Wei Wang Lei Chen Clarence A. Ellis Ching-Hsien Hsu Ah Chung Tsoi Haixun Wang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, M. (2007). DBMSs with Native XML Support: Towards Faster, Richer, and Smarter Data Management. In: Chang, K.CC., et al. Advances in Web and Network Technologies, and Information Management. APWeb WAIM 2007 2007. Lecture Notes in Computer Science, vol 4537. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72909-9_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72909-9_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72908-2

  • Online ISBN: 978-3-540-72909-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics