Nearest Keyword Search on Probabilistic XML Data

Zhao, Yue; Yuan, Ye; Wang, Guoren

doi:10.1007/978-3-319-11116-2_43

Nearest Keyword Search on Probabilistic XML Data

Yue Zhao¹⁹,
Ye Yuan¹⁹ &
Guoren Wang¹⁹

Conference paper

3240 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8709))

Abstract

This paper pays attention to the nearest keyword (NK) problem on probabilistic XML data (NK-P). NK search occupies an important position in information discovery, information extraction and many other areas. Compared with traditional XML data, it is more expensive to answer NK-P search because of so many possible worlds. NK-P can be seen as an NK problem on many traditional XML documents. For a given node q and a keyword k, an NK-P query returns the node which is nearest to q among all the nodes associated with k in all the possible worlds. NK-P search is not only useful independent operator but also as an important part for keyword search. Firstly, we propose a new NK concept on probabilistic XML data based on possible worlds. Next, we present an indexing algorithm to answer an NK-P query efficiently. Finally, extensive experimental results show that our approach is an effective method on probabilistic XML data, and it could significantly reduce the execution time.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Nierman, A., Jagadish, H.V.: ProTDB: Probabilistic data in xml. In: Proc. of VLDB, pp. 646–657 (2002)
Google Scholar
Xu, Y., Papakonstantinou, Y.: Efficient Keyword Search for Smallest LCAs in XML Databases. In: Proc. of SIGMOD, pp. 537–538 (2005)
Google Scholar
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked Keyword Search over XML Documents. In: Proc. of SIGMOD, pp. 16–27 (2003)
Google Scholar
Kimelfeld, B., Kosharovsky, Y., Sagiv, Y.: Query efficiency in probabilistic xml models. In: Proc. of SIGMOD, pp. 701–714 (2008)
Google Scholar
Tao, Y., Papadopoulos, S., Sheng, C., Stefanidis, K.: Nearest Keyword Search in XML Documents. In: Proc. of SIGMOD (2011)
Google Scholar
Li, J., Liu, C., Zhou, R., Wang, W.: Top-k Keyword Search over Probabilistic XML Data. In: Proc. of ICDE (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Science and Engineering, Northeastern University, China
Yue Zhao, Ye Yuan & Guoren Wang

Authors

Yue Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Ye Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Guoren Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Beijing Institute of Spacecraft System Engineering, Beijing, China
Lei Chen
School of Computer Science, National University of Defense Technology, 410073, Changsha, Hunan, China
Yan Jia
RMIT University, Melbourne, Australia
Timos Sellis
School of Computer Science and Technology, Soochow University, 215006, Suzhou, China
Guanfeng Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, Y., Yuan, Y., Wang, G. (2014). Nearest Keyword Search on Probabilistic XML Data. In: Chen, L., Jia, Y., Sellis, T., Liu, G. (eds) Web Technologies and Applications. APWeb 2014. Lecture Notes in Computer Science, vol 8709. Springer, Cham. https://doi.org/10.1007/978-3-319-11116-2_43

Download citation

DOI: https://doi.org/10.1007/978-3-319-11116-2_43
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11115-5
Online ISBN: 978-3-319-11116-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics