XEdge: An Efficient Method for Returning Meaningful Clustered Results for XML Keyword Search

Liang, Wenxin; Gan, Yuanyuan; Zhang, Xianchao

doi:10.1007/978-3-319-08608-8_19

Wenxin Liang¹⁷,
Yuanyuan Gan¹⁷ &
Xianchao Zhang¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8506))

Included in the following conference series:

Australasian Database Conference

1155 Accesses

Abstract

In this paper, we investigate the problem of returning meaningful clustered results for XML keyword search. We begin by presenting a multi-granularity computing methodology, in order to make full use of the structural information of XML trees to extract features. In this method, we first propose the concept of Cluster Compactness Granularity (CCG) to partition the search results into different clusters, which enable users to precisely and quickly seek their desired answers, according to the connection compactness between LCA nodes. We then propose the concept of Subtree Compactness Granularity (SCG) to rank individual results within clusters and measure the query result relevance. Furthermore, we define a novel semantics of Compact LCA (CLCA), which not only improves the accuracy by eliminating redundant LCAs that do not contribute to meaningful answers, but also overcomes the shielding effects of SLCA-based methods. Using the proposed CCG and SCG features and the CLCA semantics, we finally implement an efficient algorithm called XEdge for generating meaningful clustered results. Comparing with the existing methods such as XSeek and XKLUSTER, the experimental results demonstrate the effectiveness of the proposed multi-granularity clustering methodology and validity of the complemented ranking strategy, as well as the meaningfulness of CLCA semantics.

This work was partially supported by NSFC (No. 61272374, 61300190), Program for NCET in University of China (No. NCET-11-0056), Specialized RFDP of Higher Education (No.20120041110046), Key Project of Chinese Ministry of Education(No. 313011) and the Fundamental Research Funds for the Central Universities (No. DUT13JR04).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Liu, Z., Chen, Y.: Identifying meaningful return information for xml keyword search. In: SIGMOD, pp. 329–340 (2007)
Google Scholar
Liu, Z., Chen, Y.: Return specification inference and result clustering for keyword search on xml. ACM TODS 35(2), 1–47 (2010)
Google Scholar
Yang, W., Zhu, H.: Semantic-distance based clustering for xml keyword search. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS, vol. 6119, pp. 398–409. Springer, Heidelberg (2010)
Chapter Google Scholar
Liu, Z., Chen, Y.: Processing keyword search on xml: A survey. World Wide Web 14(5-6), 671–707 (2011)
Article Google Scholar
Zhou, R., Liu, C., Li, J., Yu, J.X.: Elca evaluation for keyword search on probabilistic xml data. World Wide Web 16(2), 171–193 (2013)
Article Google Scholar
Liu, X., Wan, C., Chen, L.: Returning clustered results for keyword search on xml documents. IEEE TKDE 23(12), 1811–1825 (2011)
Google Scholar
Washington xml data repository, http://www.cs.washington.edu/research/xmldatasets/

Download references

Author information

Authors and Affiliations

School of Software, Dalian University of Technology, Dalian, 116620, China
Wenxin Liang, Yuanyuan Gan & Xianchao Zhang

Authors

Wenxin Liang
View author publications
You can also search for this author in PubMed Google Scholar
Yuanyuan Gan
View author publications
You can also search for this author in PubMed Google Scholar
Xianchao Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Centre for Applied Informatics (CAI), College of Engineering and Science, Victoria University, Ballarat Road, 8001, Footscray, VIC, Australia
Hua Wang
Faculty of Engineering, Architecture and Information Technology, School of Information Technology and Electrical Engineering, The University of Queensland, St. Lucia, 4072, Brisbane, QLD, Australia
Mohamed A. Sharaf

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liang, W., Gan, Y., Zhang, X. (2014). XEdge: An Efficient Method for Returning Meaningful Clustered Results for XML Keyword Search. In: Wang, H., Sharaf, M.A. (eds) Databases Theory and Applications. ADC 2014. Lecture Notes in Computer Science, vol 8506. Springer, Cham. https://doi.org/10.1007/978-3-319-08608-8_19

Download citation

DOI: https://doi.org/10.1007/978-3-319-08608-8_19
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08607-1
Online ISBN: 978-3-319-08608-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics