Abstract
This paper presents a method (named DOSR) to support the semantic retrieval of XML documents in a specific domain. It takes the entity as the basic unit of information processing to guarantee the semantic integrity of returned results. An efficient index method named Entity-based index is designed for indexing the entities. It can greatly reduce the size of the index file while guarantee the speed of parsing entity. In order to rank the querying results, the Stratified-Weight-Method is proposed to make an improvement towards the traditional technology. Experimental results show that DOSR can infer users’ search intention effectively, locate the search target quickly and return the exact results in accordance with users’ expectation. The results processed by DOSR guarantees semantic integrity and reasonable ranking results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Czumaj, A., Kowaluk, M., Lingas, A.: Faster algorithms for finding lowest common ancestors in directed acyclic graphs. Theoretical Computer Science 380(1-2) (July 2007)
Schmidt, A., Kersten, M.L., Windhouwer, M.: Querying XML Documents Made Easy: Nearest Concept Queries. In: Proc. of the Intl. Conf. on Data Engineering, Washington, USA, pp. 321–329 (2001)
Sun, C., Chan, C., Goenka, A.K.: Multiway SLCA-Based Keyword Search in XML Data. In: Proc. of the Intl. Conf. on World Wide Web, New York, USA, pp. 1043–1052 (2007)
Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable lcas over xml documents. In: Proc. of the Intl. Conf. on Information and Knowledge Management, New York, USA, pp. 31–40 (2007)
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked keyword search over xml documents. In: Proc. of the Intl. Conf. on Management of Data, New York, USA, pp. 16–27 (2003)
Hristidis, V., Koudas, N., Papakonstantinou, Y., Srivastava, D.: Keyword Proximity Search in XML Trees. IEEE Transactions on Knowledge and Data Engineering 18(4) (April 2006)
Li, Y., Yu, C., Jagadish, H.V.: Schema-Free XQuery. VLDB Endowment Very Large Database Endowment 30 (2004)
Xu, Y., Papakonstantinou, Y.: Efficient Keyword Search for Smallest LCAs in XML Databases. In: Proc. of the Intl. Conf. on Management of Data, New York, USA, pp. 527–538 (June 2005)
Huang, Y., Liu, Z., Chen, Y.: Query biased snippet generation in xml search. In: Proc. of the Intl. Conf. on Management of Data, New York, USA, pp. 315–326 (2008)
Bao, Z., Lu, J., Ling, T.W., Chen, B.: Towards an Effective XML Keyword Search. IEEE Transactions on Knowledge and Data Engineering 22(8) (August 2010)
Liu, Z., Chen, Y.: Identifying Meaningful Return Information for XML Keyword Search. In: Proc. of the Intl. Conf. on Management of Data, New York, USA (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Feng, J., Tang, Z., Huang, R. (2012). DOSR: A Method of Domain-Oriented Semantic Retrieval in XML Data. In: Watanabe, T., Watada, J., Takahashi, N., Howlett, R., Jain, L. (eds) Intelligent Interactive Multimedia: Systems and Services. Smart Innovation, Systems and Technologies, vol 14. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29934-6_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-29934-6_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29933-9
Online ISBN: 978-3-642-29934-6
eBook Packages: EngineeringEngineering (R0)