Abstract
As the first step of most XML processing algorithms, one usually extracts token content out of the source document into many discrete string objects. We propose a “non-extractive” tokenization approach that maintains the source document intact in memory. Using a binary encoding specification called Virtual Token Descriptor (VTD), the processing model represents tokens exclusively using starting offset and length. To create a hierarchical view of the data encapsulated in the SOAP message, the parser further indexes elements of same depths using directory-like structures we call location cache. Through a demonstration of navigating the document hierarchy using VTD and location caches, we show that it is indeed possible to create a cursor-based API that retains most of DOM’s random-access capabilities at a fraction of its memory usage. Furthermore, by analyzing key design constraints of custom hardware, we reason that the memory conserving characteristics of the processing model simultaneously make possible “SOAP on a chip” and “binary-enhanced SOAP.” The benchmark results show that the reference implementation of our processing model significantly outperforms Xerces DOM in terms of both memory and processing performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
DOM, SAX and JDOM, http://www.brics.dk/~amoeller/XML/programming/
XML Applications and Initiatives, http://xml.coverpages.org/xmlApplications.htm
XML protocol requirements, http://www.w3.org/TR/2002/WD-xmlp-reqs-20020626
Bosworth, A.: Loosely speaking, http://www.fawcette.com/xmlmag/2002_04/magazine/departments/endtag/
Matthias, N., Jasmi, J.: XML Parsing: A Threat to Database Performance. In: CIKM 2003 (2003)
Salz, R.: Processing SOAP header, http://webservices.xml.com/pub/a/ws/2002/07/17/salz.html
Apache Axis User’s Guide, http://ws.apache.org/axis/java/user-guide.html
Micron’s Samurai DDR Chipset, http://www.sysopt.com/articles/samurai/
Zhang, C., Dewitt, D., Chen, J., Tian, F.: The Design and Performance Evaluation of Alternative XML Storage Strategies. SIGMOD Record 31(1), 5–10 (2002)
Chiu, K., Govindaraju, M., Bramley, R.: Investigating the limits of SOAP performance for scientific computing. In: The 11th IEEE International Symposium on High Performance Distributed Computing HPDC-11 2002 (HPDC 2002) (July 2002)
Introduction to DES, http://www.rsasecurity.com/rsalabs/node.asp?id=2226
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, J. (2004). SOAP Processing: A Non-extractive Approach. In: Zhang, LJ.(., Jeckle, M. (eds) Web Services. ECOWS 2004. Lecture Notes in Computer Science, vol 3250. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30209-4_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-30209-4_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23202-5
Online ISBN: 978-3-540-30209-4
eBook Packages: Springer Book Archive