ABSTRACT
XML parsing is generally known to have poor performance characteristics relative to transactional database processing. Yet, its potentially fatal impact on overall database performance is being underestimated. We report real-word database applications where XML parsing performance is a key obstacle to a successful XML deployment. There is a considerable share of XML database applications which are prone to fail at an early and simple road block: XML parsing. We analyze XML parsing performance and quantify the extra overhead of DTD and schema validation. Comparison with relational database performance shows that the desired response times and transaction rates over XML data can not be achieved without major improvements in XML parsing technology. Thus, we identify research topics which are most promising for XML parser performance in database systems.
- Barton, C., Charles, P., Goyal, D., Raghavachari, M., Josifovski, V., and Fontoura, M.,: Streaming XPath Processing with Forward and Backward Axes. ICDE 2003Google Scholar
- Bourret, R.: XML Database Products. http://www.rpbourret.com/xml/XMLDatabaseProds.htmGoogle Scholar
- Expway: Bin-XMLTM for encoding XML documents. http://www.expway.com/graph/Bin-XMLTechnical%20White%20Paper-jan03.pdf, 2003Google Scholar
- IBM XML for C++, http://www.alphaworks.ibm.com/tech/xml4cGoogle Scholar
- Josifovski, V., Fontoura, M., and Barta, A.: Enabling relational engines to query XML streams. IBM Internal publ., 2002Google Scholar
- Kim, S., Lee, M., and Lee, K.: Immediate and Partial Validation Mechanism for the Conflict Resolution of Update Operations in XML Databases. Advances in Web-Age Information Management (WAIM), 2002: 387--396 Google ScholarDigital Library
- Tatarinov, I., Viglas, S., Beyer, K., Shanmugasundaram, J., Shekita, E. J., and Zhang C.: Storing and querying ordered XML using a relational database system. SIGMOD Conference 2002: 204--215 Google ScholarDigital Library
- XML Applications and Initiatives, http://xml.coverpages.org/xmlApplications.htmGoogle Scholar
- XML on Wall Street, http://lighthouse-partners.com/xmlGoogle Scholar
Index Terms
- XML parsing: a threat to database performance
Recommendations
XML screamer: an integrated approach to high performance XML parsing, validation and deserialization
WWW '06: Proceedings of the 15th international conference on World Wide WebThis paper describes an experimental system in which customized high performance XML parsers are prepared using parser generation and compilation techniques. Parsing is integrated with Schema-based validation and deserialization, and the resulting ...
A space efficient XML DOM parser
In many XML applications, parsing is a key operation. When the processing involves modifying data, random access, and/or in an order different from the one in which elements are stored, a DOM parser has to be used. A major problem with using a DOM ...
Mapping of bibliographical standards into XML
The most popular bibliographical standards, which prescribe the exchange of bibliographical data in machine readable form, are MARC (Machine Readable Cataloguing) and UNIMARC (Universal Machine Readable Cataloguing). This paper presents two schemas, ...
Comments