Synonyms
Dataguide; Path index; Sketch; Structural index; Structural summary; Synopsis
Definition
Structure indexing creates summaries of the structure present in semi-structured data collections by grouping data items with similar structure, providing a mechanism to index such items. Since semi-structured data models are commonly represented by labeled graphs or trees (the XML data model being a prime example), structural indexes or summaries are naturally described as graphs where nodes represent sets of data items (called extents), and where edges represent structural relationships between the corresponding extents derived from the instance data. A concrete physical index can be created by selecting appropriate data structures to store the graph and the extents.
Structure indexing helps to find data items that satisfy structural constraints in queries by locating nodes in the structural summary graph that satisfy the query conditions (expecting far less summary nodes than data...
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsRecommended Reading
Buneman P, Choi B, Fan W, Hutchison R, Mann R, Viglas S. Vectorizing and querying large XML repositories. In: Proceedings of the 21st International Conference on Data Engineering; 2005. p. 261–72.
Chung C-W, Min J-K, Shim K. APEX: an adaptive path index for XML data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2002. p. 121–32.
Consens MP, Milo T. Optimizing queries on files. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1994. p. 301–12.
Consens MP, Rizzolo F, Vaisman AA. AxPRE summaries: exploring the (semi-)structure of XML web collections. In: Proceedings of the 24th International Conference on Data Engineering; 2008.p. 1519–21.
Freire J, Haritsa JR, Ramanath M, Roy P, Simeon J. StatiX: making XML count. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2002. p. 181–91.
Goldman R, Widom J. Dataguides: enabling query formulation and optimization in semistructured databases. In: Proceedings of the 23th International Conference on Very Large Data Bases; 1997. p. 436–45.
Kaushik R, Bohannon P, Naughton JF, Korth HF. Covering indexes for branching path queries. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2002. p. 133–44.
Kaushik R, Shenoy P, Bohannon P, Gudes E. Exploiting local similarity for indexing paths in graph-structured data. In: Proceedings of the 18th International Conference on Data Engineering; 2002. p. 129–40.
Milo T, Suciu D. Index structures for path expressions. In: Proceedings of the 7th International Conference on Database Theory; 1999. p. 277–95.
Nestorov S, Ullman JD, Wiener JL, Chawathe SS. Representative objects: concise representations of semistructured, hierarchial data. In: Proceedings of the 13th International Conference on Data Engineering; 1997. p. 79–90.
Polyzotis N, Garofalakis MN. XSketch synopses for XML data graphs. ACM Trans Database Syst. 2006;31(3):1014–63.
Qun C, Lim A, Ong KW. D(k)-index: an adaptive structural summary for graph-structured data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003. p. 134–44.
Rizzolo F, Mendelzon AO. Indexing XML data with ToXin. In: Proceedings of the 4th International Workshop on the Web and Databases; 2001. p. 49–54.
Young-Lai M, Tompa FW. One-pass evaluation of region algebra expressions. Inform Syst. 2003;28(3):159–68.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Consens, M.P. (2018). Structural Indexing. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_376
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_376
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering