Abstract
Extensible Markup Language (XML) is used for not only describing structured documents but also for describing data just for generating XML from relational data. The former is called document-centric XML, and the latter is called data-centric XML. From studies on retrieving data-centric XML by using keyword searches, methods based on LCA have been proposed, while from studies on retrieving document-centric XML, methods based on information retrieval that focus on the granularity of XML elements have been proposed. However, documents generally have both data-centric and document-centric elements, so there are cases in which desired results cannot be returned by using existing research. We propose a method for constructing suitable search results for XML documents that include both data-centric and document-centric elements by considering a user’s query intention and element features (data-centric or document-centric). Our experiments show that both data-centric and document-centric elements need to be considered for actual XML documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Pradhan, S.: An algebraic query model for effective and efficient retrieval of XML fragments. In: VLDB, pp. 295–306 (2006)
Ley, M.: DBLP - Some Lessons Learned. PVLDB, 1493–1500 (2009)
Denoyer, L., Gallinari, P.: The Wikipedia XML Corpus. SIGIR Forum, 64–69 (2006)
Xu, Y., Papakonstantinou, Y.: Efficient Keyword Search for Smallest LCAs in XML Databases. In: SIGMOD Conference, pp. 527–538 (2005)
Liu, Z., Chen, Y.: Identifying Meaningful Return Information for XML Keyword Search. SIGMOD, 329–340 (2007)
Bao, Z., Ling, T.W., Bo, C., Jiaheng, L.: Effective XML Keyword Search with Relevance Oriented Ranking. In: ICDE, pp. 517–528 (2009)
Motomura, T., Shimizu, T., Yoshikawa, M.: Alternative Query Generation for XML Keyword Search and Its Optimization. In: Hameurlain, A., Liddle, S.W., Schewe, K.-D., Zhou, X. (eds.) DEXA 2011, Part I. LNCS, vol. 6860, pp. 410–424. Springer, Heidelberg (2011)
Supasitthimethee, U., Shimizu, T., Yoshikawa, M., Porkaew, K.: XSemantic: An Extension of LCA Based XML Semantic Search. IEICE Transactions on Information and Systems, 1079–1092 (2009)
Hatano, K., Kinutani, H., Yoshikawa, M., Uemura, S.: Information Retrieval System for XML Documents. In: Hameurlain, A., Cicchetti, R., Traunmüller, R. (eds.) DEXA 2002. LNCS, vol. 2453, pp. 758–767. Springer, Heidelberg (2002)
Fujimoto, K., Shimizu, T., Terada, N., Hatano, K., Suzuki, Y., Amagasa, T., Kinutani, H., Yoshikawa, M.: Implementation of a High-Speed and High-Precision XML Information Retrieval System on Relational Databases. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 254–267. Springer, Heidelberg (2006)
Malik, S., Kazai, G., Lalmas, M., Fuhr, N.: Overview of INEX 2005. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 1–15. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tanabe, T., Shimizu, T., Yoshikawa, M. (2012). Effective Keyword-Based XML Retrieval Using the Data-Centric and Document-Centric Features. In: Hou, Y., Nie, JY., Sun, L., Wang, B., Zhang, P. (eds) Information Retrieval Technology. AIRS 2012. Lecture Notes in Computer Science, vol 7675. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35341-3_38
Download citation
DOI: https://doi.org/10.1007/978-3-642-35341-3_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35340-6
Online ISBN: 978-3-642-35341-3
eBook Packages: Computer ScienceComputer Science (R0)