Abstract
XML is a de-facto standard for exchanging and presenting information and keyword search over XML documents has become an interesting topic. However semi-structured XML data give rise to many challenges of conventional information retrieval technologies. In order to return highly-related data nodes and improve the quality of keyword search result, SLCA(Smallest Lowest Common Ancestor )-based keyword search on XML data is recently attracting more and more attention in the database community. In this paper, we design efficient index and propose hash-based method to answer SLCA-based keyword search queries. Our approach outperforms Incremental Multiway-SLCA approach, which is the most efficient algorithms in the literature. We demonstrate the effectiveness of our algorithms analytically and experimentally.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sun, C., Chan, C.Y., Goenka, A.K.: Multiway SLCA-based Keyword Search in XML Data. In: WWW (2007)
Xu, Y., Papakonstantinou, Y.: Efficient Keyword Search for Smallest LCAs in XML Databases. In: SIGMOD Conference (2005)
Xu, Y., Papakonstantinou, Y.: Efficient LCA based Keyword Search in XML Data. In: CIKM (2007)
Li, Y., Yu, C., Jagadish, H.V.: Schema-Free XQuery. In: VLDB (2004)
Bender, M., Colton, M.F.: The LCA problem revisited. In: Latin American Theoretical Informatics (2000)
Harel, D., Tarjan, R.E.: Fast algorithm for finding nearest common ancestors. SIAM Journal on Computing (1984)
Sleepycat Software, The Berkeley Database (Berkeley DB), http://www.sleepycat.com
Schmidt, A., Kersten, M., Windhouwer, M.: Querying XML Document Made Easy:Nearest Concept Queries. In: ICDE (2001)
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked Keyword Search over XML Documents. In: SIGMOD (2003)
Botev, C., Amer-Yahia, S., Shanmugasundaram, J.: Expressiveness and performance of full-text search languages. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Böhm, K., Kemper, A., Grust, T., Böhm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 349–367. Springer, Heidelberg (2006)
Liu, Z., Chen, Y.: Identifying Meaningful Return Information for XML Keyword Search. In: SIGMOD (2007)
Liu, Z., Chen, Y.: Reasoning and Identifying Relevant Matches for XML Keyword Search. In: VLDB (2008)
Li, G., Ooi, B.C., Feng, J., Wang, J., Zhou, L.: EASE: An Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data. In: SIGMOD (2008)
DBLP XML records, http://dblp.uni-trier.de/xml/
INEX XML data sets
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, W., Wang, X., Zhou, A. (2009). Hash-Search: An Efficient SLCA-Based Keyword Search Algorithm on XML Documents. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds) Database Systems for Advanced Applications. DASFAA 2009. Lecture Notes in Computer Science, vol 5463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00887-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-00887-0_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00886-3
Online ISBN: 978-3-642-00887-0
eBook Packages: Computer ScienceComputer Science (R0)