Abstract
To answer queries, many XML management systems perform structural joins, i.e., they determine all occurences of parent/child or ancestor/descendant relationships between node sets. These joins are often one of the most time-consuming phases in query evaluation, so it is desirable to reduce the size of the node sets before performing the joins. This problem has earlier been approached by using signatures built on the content of the nodes, but in this paper, we propose a novel method in which the nodes are filtered based on the structural properties of their subtrees. To achieve this, we use a schema graph which summarizes the structures of XML documents more accurately than conventional summarization methods.
Chapter PDF
References
World Wide Web Consortium. Extensible Markup Language (XML) 1.0 (2000), http://www.w3c.org/TR/REC-xml
Chaudri, A.B., Rashid, A., Zicari, R.: XML Data Management: Native XML and XML-Enabled Database Systems. Addison-Wesley, Reading (2003)
World Wide Web Consortium. XML Path Language (XPath) Version 1.0 (2000), http://www.w3c.org/TR/xpath
Dietz, P.F.: Maintaining order in a linked list. In: Proc. of the 14th ACM Symposium on Theory of Computing, pp. 122–127 (1982)
Kha, D.D., Yoshikawa, M., Uemura, S.: An XML indexing structure with relative region coordinate. In: Proc. of the 17th IEEE Intl Conf. on Data Engineering, pp. 212–220 (2001)
Lee, Y.K., Yoo, S., Yoon, K., Berra, B.: Index structures for structured documents. In: Proc. of the 1st Intl Conf. on Digital Libraries, pp. 91–99 (1996)
Luoma, O.: Indexing XML data with a schema graph. In: Proc. of the IASTED Intl Conf. on Databases and Applications, pp. 274–279 (2004)
McHugh, J., Abiteboul, S., Goldman, R., Quass, R., Widom, J.: Lore: A database management system for semistructured data. SIGMOD Record 26(3), 54–66 (1997)
Shin, D., Jang, H., Jin, H.: BUS: An effective indexing and retrieval scheme in structured documents. In: Proc. of the 3rd ACM Intl Conf. on Digital Libraries, pp. 235–243 (1998)
Tak, T.W., Annevelink, J.: Integrating a structured-text retrieval system with an object-oriented database system. In: Proc. of the 20th Intl Conf. on Very Large Databases, pp. 740–749 (1994)
Kanne, C.C., Moerkotte, G.: Efficient storage of XML data. In: Poster abstract in Proc. of the 16th Intl Conf. on Data Engineering, p. 198 (2000)
Chen, Y., Aberer, K.: Combining pat-trees and signature files for query evaluation in document databases. In: Proc. of the 10th Intl Conf. on Database and Expert Systems Applications, pp. 473–484 (1999)
Park, S., Kim, H.J.: A new query processing techique for XML based on signature. In: Proc. of the 7th Intl Conf. on Database Systems for Advanced Applications, pp. 22–31 (2001)
Yoshikawa, M., Amagasa, T., Shimura, T., Uemura, S.: XRel: A path-based approach to storage and retrieval of XML documents using relational databases. ACM Transactions on Internet Technologies 1(1), 110–141 (2001)
Bosak, J.: The complete plays of Shakespeare marked up in XML, http://www.ibiblio.org/xml/examples/shakespeare
Busse, R., Carey, M., Florescu, D., Kersten, M., Manolescu, I., Schmidt, A., Waas, F.: XMark - an XML benchmark project, http://monetdb.cwi.nl/xml/index.html
Ley, M.: Digital bibliography library project, http://dblp.uni-trier.de/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Luoma, O. (2004). A Structure-Based Filtering Method for XML Management Systems. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2004. Lecture Notes in Computer Science, vol 3180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30075-5_39
Download citation
DOI: https://doi.org/10.1007/978-3-540-30075-5_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22936-0
Online ISBN: 978-3-540-30075-5
eBook Packages: Springer Book Archive