Skip to main content

Indexing and Retrieval of XML-Encoded Structured Documents in Dynamic Environment

  • Conference paper
  • First Online:
  • 477 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2480))

Abstract

In order to retrieve structured documents efficiently, many researches have been done to design indexing technique that supports fast and direct access for arbitrary element as well as whole document. On the other hand, fast and efficient indexing technique for supporting dynamic update of structured documents in business domain is required. In this paper, we propose an inverted index structure that supports dynamic update, such as including both structure and content updates, quickly. In the proposed index structure, in addition to a horizontal term-based index as in general inverted file structure, we add a vertical index. The vertical index uses element identifier as key. Using this dual index structure, it is possible to support fast and efficient updates on the parts of a document as well as whole document as reducing re-indexed space and time dramatically.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Y.K. Lee et al., Index structures for structured documents, Proc.of the 1st ACM Int’l Conf. on Digital Libraries (1996)

    Google Scholar 

  2. Dongwook Shin et al., BUS: An Effective Indexing and Retrieval Scheme in Structured Documents, Proc.of the 3rd ACM Int’l Conference on Digital Libraries (1998)

    Google Scholar 

  3. Dongwook Shin, XML Indexing and Retrieval with a Hybrid Storage Model, Knowledge and Information Systems (2001)

    Google Scholar 

  4. J. McHugh, J. Widom, S. Abiteboul, Q. Luo, and A. Rajaraman. Indexing Semistructured Data. Technical Report (1998)

    Google Scholar 

  5. R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. VLDB (1997)

    Google Scholar 

  6. Ron Sack-Davis, T. Arnold-Moore, Justin Zobel, Database Systems for Structured Documents, ADTI (1994)

    Google Scholar 

  7. A. Tomasic, H. Garcia-Molina, K. Shoens, Incremental Updates of Inverted Lists for Text Document Retrieval, Stanford Univ, Technical Report Number STAN-CS-TN-93-1 (1993)

    Google Scholar 

  8. E.W. Brown, J.P. Calla, W.B. Croft, Fast Incremental Indexing for Full-Text Information Retrieval, VLDB (1994)

    Google Scholar 

  9. E. Kotsakis, Structured Information Retrieval in XML documents, ACM SAC (2002)

    Google Scholar 

  10. Martin Porter, Porter Stemming Algorithm, available at http://www.tartarus.org/~martin

  11. J. Bosak, XML examples, available at http://www.ibiblio.org/bosak

  12. T. Shimura et al, Storage and Retrieval of XML Documents using Object-Relational Databases, DEXA (1999)

    Google Scholar 

  13. XPath, http://www.w3c.org

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, S.W., Lee, J., Lim, H.C. (2002). Indexing and Retrieval of XML-Encoded Structured Documents in Dynamic Environment. In: Han, Y., Tai, S., Wikarski, D. (eds) Engineering and Deployment of Cooperative Information Systems. EDCIS 2002. Lecture Notes in Computer Science, vol 2480. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45785-2_11

Download citation

  • DOI: https://doi.org/10.1007/3-540-45785-2_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44222-6

  • Online ISBN: 978-3-540-45785-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics