Effective Keyword Search with Synonym Rules over XML Document

Zhang, Linlin; Liu, Qing; Lu, Jiaheng

doi:10.1007/978-3-642-33050-6_28

Linlin Zhang²⁵,
Qing Liu²⁵ &
Jiaheng Lu²⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7419))

Included in the following conference series:

International Conference on Web-Age Information Management

793 Accesses

Abstract

Keyword search is a friendly way for user to find the information they are interested in from XML documents without having to learn a complex query language or needing prior knowledge of the structure of the underlying data. However, the existing methods are usually limited to the input keywords. In this paper, we introduced the notion of synonyms, acronym, abbreviations and so on to capture user query intentions. We propose a SLCA based keyword search with synonym rules over xml documents which are orthogonal to various of xml keyword search techniques. In addition, we also use this to give a effective and efficient slca based keyword search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arasu, A., Chaudhuri, S., Kaushik, R.: Transformation-based framework for record matching. In: ICDE, pp. 40–49 (2008)
Google Scholar
Arasu, A., Ganti, V., Kaushik, R.: Efficient exact set-similarity joins. In: VLDB, pp. 918–929 (2006)
Google Scholar
Bao, Z., Ling, T.W., Chen, B., Lu, J.: Effective xml keyword search with relevance oriented ranking. In: ICDE, pp. 517–528 (2009)
Google Scholar
Bilenko, M., Mooney, R.J.: Adaptive duplicate detection using learnable string similarity measures. In: KDD, pp. 39–48 (2003)
Google Scholar
Chaudhuri, S., Ganti, V., Kaushik, R.: A primitive operator for similarity joins in data cleaning. In: ICDE, p. 5 (2006)
Google Scholar
Cohen, S., Mamou, J., Kanza, Y., Sagiv, Y.: Xsearch: A semantic search engine for xml. In: VLDB, pp. 45–56 (2003)
Google Scholar
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: Xrank: Ranked keyword search over xml documents. In: SIGMOD Conference, pp. 16–27 (2003)
Google Scholar
Kondrak, G.: N-Gram Similarity and Distance. In: Consens, M.P., Navarro, G. (eds.) SPIRE 2005. LNCS, vol. 3772, pp. 115–126. Springer, Heidelberg (2005)
Chapter Google Scholar
Koudas, N., Sarawagi, S., Srivastava, D.: Record linkage: similarity measures and algorithms. In: SIGMOD Conference, pp. 802–803 (2006)
Google Scholar
Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable lcas over xml documents. In: CIKM, pp. 31–40 (2007)
Google Scholar
Miller, D.R.H., Leek, T., Schwartz, R.M.: A hidden markov model information retrieval system. In: SIGIR, pp. 214–221 (1999)
Google Scholar
Murray, G.C., Teevan, J.: Query log analysis: social and technological challenges. SIGIR Forum 41(2), 112–120 (2007)
Article Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)
Article Google Scholar
Sarawagi, S., Kirpal, A.: Efficient set joins on similarity predicates. In: SIGMOD Conference, pp. 743–754 (2004)
Google Scholar
Tsuruoka, Y., McNaught, J., Tsujii, J.-i., Ananiadou, S.: Learning string similarity measures for gene/protein name dictionary look-up using logistic regression. Bioinformatics 23(20), 2768–2774 (2007)
Article Google Scholar
Wikipedia, http://en.wikipedia.org/
Winkler, W.E.: The state of record linkage and current research problems. Technical report, Statistical Research Division, U.S. Census Bureau (1999)
Google Scholar
Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest lcas in xml databases. In: SIGMOD Conference, pp. 537–538 (2005)
Google Scholar
Xu, Y., Papakonstantinou, Y.: Efficient lca based keyword search in xml data. In: CIKM, pp. 1007–1010 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information, Renmin University of China, Beijing, China
Linlin Zhang, Qing Liu & Jiaheng Lu

Authors

Linlin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Qing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jiaheng Lu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, National University of Singapore, Singapore
Zhifeng Bao
College of Computer Science and Technology, Zhejiang University, 38 ZheDa Road, 310027, Hangzhou, China
Yunjun Gao
Northeastern University, Shenyang, China
Yu Gu
Heilongjiang University, 150080, Harbin, China
Longjiang Guo
Department of Computer Science, Georgia State University, 34 Peachtree Street, Suite 1413, 30303, Atlanta, GA, USA
Yingshu Li
Renmin University of China, Beijing, China
Jiaheng Lu
School of Computer Science, Hangzhou Dianzi University, Hangzhou, China
Zujie Ren
School of Software, Tsinghua University, Beijing, China
Chaokun Wang
School of Information, Renmin University of China, 100872, Beijing, China
Xiao Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, L., Liu, Q., Lu, J. (2012). Effective Keyword Search with Synonym Rules over XML Document. In: Bao, Z., et al. Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7419. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33050-6_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-33050-6_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33049-0
Online ISBN: 978-3-642-33050-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics