Abstract
In many cases, it is pretty difficult to choose an efficient storage method, such as native, xml-enabled or hybrid, for storing XML documents in a relational database. We provide multiple storage approaches for XML documents in our hybrid XML-relational database PXRDB( Pure XML-Relational DataBase). Further, another problem is how to automatically choose storage method for a given XML document and whether different documents in same column can be stored in different formats. In this paper, we provide a content-aware adaptive storage approach for XML in PXRDB. This novel storage approach automatically selects one better storage scheme for a specific XML document from three candidate schemata, i.e., native storage, flat stream and multi-relations after fast-checking its content. Our approach frees end-users or administrators from either having no choice or having to specify the specific storage scheme for large number of XML documents manually. It also allows different XML documents in same relational column to be stored in different formats while being accessed indistinctively. By providing unified access interfaces, new storage approaches can be easily registered in our system. The performance evaluation illustrates our approach is feasible and effective.
Partly supported by National 863 High Tech. Project (No. 2009AA01Z149), the Important National Science & Technology Specific Projects of China ("HGJ" Projects, Grant No.2010ZX01042-002-002-03), and National Natural Science Foundation of China (Grant No.61070054).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rys, M.: XML and Relational Database Management Systems: inside Microsoft SQL Server 2005. In: Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 958–962 (2005)
Microsoft. White Paper: What’s New for XML in SQL Server, White Paper (2008)
Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton, J.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: VLDB 1999 (1999)
Tian, F., DeWitt, D., Chen, J., Zhang, C.: The Design and Performance Evaluation of Alternative XML Storage Strategies. ACM Sigmod Record 31(1) (March 2002)
Florescu, D., Kossmann, D.: A Performance Evaluation of Alter native mapping Schemas for Storing XML Data in a Relational Database. In: Proc. of the VLDB 1999 (1999)
Shanmugasundaram, J., Tufte, K., He, G., et al.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: VLDB 1999 (1999)
Boncz, P.A., Grust, T., Keulen, M., Manegold, S., Rittinger, J., Teubner, J.: MonetDB/XQuery: a fast XQuery processor powered by a relational engine. In: SIGMOD Conference 2006, pp. 479–490 (2006)
Ennser, L., Delporte, C., Oba, M., Sunil, K.: Integrating XML with DB2 XML Extender and DB2 Text Extender. IBM Redbooks (2001)
Profressional XML, Wrox Press (2000)
DB2 goes hybrid Integrating native XML and XQuery with relational data and SQL 2006 (2006)
Liu, Z.H., Chandrasekar, S., Baby, T., Chang, H.J.: Towards a Physical XML independent XQuery/SQL/XML Engine. In: Proc. 34th Int. Conf. on Very Large Data Bases, pp. 1356–1367 (2008)
Chebotko, A.: Reconstructing XML subtrees from Relational Storage of XML documents. In: ICDE 2004 (2004)
Florescu, D., Kossmann, D.: A Performance Evaluation of Alternative Mapping Schemes for Storing XML in A Relational Database. Technical Report 3680, INRIA (1999)
Kanne, C.-C., Moerkotte, G.: A Linear Time Algorithm for Optimal Tree Sibling Partitioning and Approximation Algorithms in Natix. In: Proc. 32th Int. Conf. on Very Large Data Bases, (September 2006)
Fiebig, T., Helmer, S., Kanne, C.-C., Mildenberger, J., Moerkotte, G., Schiele, R., Westmann, T.: Anatomy of a Native XML Base Management System. The VLDB Journal 11(4), 292–314 (2002)
Kanne, C.-C., Moerkotte, G.: Efficient Storage of XML Data. In: Proc. 16th Int. Conf. on Data Engineering, pp. 198–209 (2000)
Beyer, K.S., Cochrane, R., Josifovski, V., Kleewein, J., Lapis, G., Lohman, G.M., Lyle, B., Ozcan, F., Pirahesh, H., Seemann, N., Truong, T.C., der Linden, B.V., Vickery, B., Zhang, C.: System RX: One Part Relational, One Part XML. In: Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 347–358 (2005)
Funderburk, J.E., Kiernan, G., Shanmugasundaram, J., Shekita, E., Wei, C.: XTABLES: Bridging relational technology and XML. IBM Systems Journal 41(4), 616–641 (2002)
Zhang, N., Agarwal, N., Chandrasekar, S.: Sum Idicula.: Binary XML Storage and Query Processing in Oracle 11g. In: Proc. 35th Int. Conf. on Very Large Data Bases (June 2009)
Jagadish, H.V., Al-Khalifa, S., Chapman, A., Lakshmanan, L.V.S., Nierman, A., Paparizos, S., Patel, J.M., Srivastava, D., Wiwatwattana, N., Wu, Y., Yu, C.: TIMBER: A Native XML Database. VLDB Journal 11(1), 274–291 (2002)
Meier, W.: eXist: An Open Source Native XML Database, http://exist-db.org
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, X., Zhang, X., Du, X., Wang, S., Liu, K. (2011). A Content-Aware Adaptive Storage Approach for XML in PXRDB. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds) Database Systems for Adanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20244-5_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-20244-5_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20243-8
Online ISBN: 978-3-642-20244-5
eBook Packages: Computer ScienceComputer Science (R0)