Skip to main content

Hash-Search: An Efficient SLCA-Based Keyword Search Algorithm on XML Documents

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5463))

Included in the following conference series:

Abstract

XML is a de-facto standard for exchanging and presenting information and keyword search over XML documents has become an interesting topic. However semi-structured XML data give rise to many challenges of conventional information retrieval technologies. In order to return highly-related data nodes and improve the quality of keyword search result, SLCA(Smallest Lowest Common Ancestor )-based keyword search on XML data is recently attracting more and more attention in the database community. In this paper, we design efficient index and propose hash-based method to answer SLCA-based keyword search queries. Our approach outperforms Incremental Multiway-SLCA approach, which is the most efficient algorithms in the literature. We demonstrate the effectiveness of our algorithms analytically and experimentally.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sun, C., Chan, C.Y., Goenka, A.K.: Multiway SLCA-based Keyword Search in XML Data. In: WWW (2007)

    Google Scholar 

  2. Xu, Y., Papakonstantinou, Y.: Efficient Keyword Search for Smallest LCAs in XML Databases. In: SIGMOD Conference (2005)

    Google Scholar 

  3. Xu, Y., Papakonstantinou, Y.: Efficient LCA based Keyword Search in XML Data. In: CIKM (2007)

    Google Scholar 

  4. Li, Y., Yu, C., Jagadish, H.V.: Schema-Free XQuery. In: VLDB (2004)

    Google Scholar 

  5. Bender, M., Colton, M.F.: The LCA problem revisited. In: Latin American Theoretical Informatics (2000)

    Google Scholar 

  6. Harel, D., Tarjan, R.E.: Fast algorithm for finding nearest common ancestors. SIAM Journal on Computing (1984)

    Google Scholar 

  7. Sleepycat Software, The Berkeley Database (Berkeley DB), http://www.sleepycat.com

  8. Schmidt, A., Kersten, M., Windhouwer, M.: Querying XML Document Made Easy:Nearest Concept Queries. In: ICDE (2001)

    Google Scholar 

  9. Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked Keyword Search over XML Documents. In: SIGMOD (2003)

    Google Scholar 

  10. Botev, C., Amer-Yahia, S., Shanmugasundaram, J.: Expressiveness and performance of full-text search languages. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Böhm, K., Kemper, A., Grust, T., Böhm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 349–367. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  11. Liu, Z., Chen, Y.: Identifying Meaningful Return Information for XML Keyword Search. In: SIGMOD (2007)

    Google Scholar 

  12. Liu, Z., Chen, Y.: Reasoning and Identifying Relevant Matches for XML Keyword Search. In: VLDB (2008)

    Google Scholar 

  13. Li, G., Ooi, B.C., Feng, J., Wang, J., Zhou, L.: EASE: An Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data. In: SIGMOD (2008)

    Google Scholar 

  14. DBLP XML records, http://dblp.uni-trier.de/xml/

  15. INEX XML data sets

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, W., Wang, X., Zhou, A. (2009). Hash-Search: An Efficient SLCA-Based Keyword Search Algorithm on XML Documents. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds) Database Systems for Advanced Applications. DASFAA 2009. Lecture Notes in Computer Science, vol 5463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00887-0_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00887-0_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00886-3

  • Online ISBN: 978-3-642-00887-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics