Abstract
We developed a passage retrieval system for XML documents using the vector space model. To be more flexible for the query, we also developed a method of unification of multiple retrieved elements and a fragment indexing system. Our system is composed of an inverted file and an XML Path Language (XPath) path list. The validity of the method was tested as part of the ad hoc track in the Initiative for the Evaluation of XML Retrieval (INEX) 2006.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
XML Path Language (XPath) Version 1.0., http://www.w3.org/TR/xpath
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval (Acm Press Series), pp. 1–69, 141–162. Addison-Wesley, London (1999)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM. 18, 613–620 (1975)
Evans, D., Lefferts, R.: Design and Evaluation of the CLARIT-TREC-2 system. In Harman, D.K. (ed.) Proceedings of the Second Text REtrieval Conference (TREC-2). pp. 500–548. NIST Special Publication (1994)
Tanioka, H., Yamamoto, K.: A Distributed Retrieval System for NTCIR-5 Patent Retrieval Task. In: The 5th NTCIR Workshop Meeting (2005)
Grabs, T., Schek, H.-J.: Flexible Information Retrieval on XML Documents. In: Blanken, H.M., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.) Intelligent Search on XML Data. LNCS, vol. 2818, pp. 95–106. Springer, Heidelberg (2003)
Kazai, G., Gvert, N., Lalmas, M., Fuhr, N.: The INEX evaluation initiative. In: Blanken, H.M., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.) Intelligent Search on XML Data. LNCS, vol. 2818, pp. 279–293. Springer, Heidelberg (2003)
Sigurbjornsson, B., Kamps, J., de Rijke, M.: An element-based approach to XML retrieval. In: INEX Workshop Proceedings. pp. 19–26 ( 2003)
Geva, S., Leo-Spork, M.: XPath Inverted File for Information Retrieval. In: INEX Workshop Proceedings. pp. 110–117 ( 2003)
Kelly, W., Geva, S., Sahama, T., Loke, W.: Distributed XML Information Retrieval. In: INEX Workshop Proceedings. pp. 126–133 ( 2003)
Mihajlovic, V., Ramirez, G., Westerveld, T., Hiemstra, D., Blok, H.E., de Vries, A.P.: TIJAH Scratches INEX 2005: Vague Element Selection, Image Search, Overlap, and Relevance Feedback. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 72–87. Springer, Heidelberg (2006)
Trotman, A., Geva, S.: Passage Retrieval and XML-Retrieval Tasks. In: Proceedings of the SIGIR 2006, Workshop on XML Element Retrieval Methodology. pp. 43–50 ( 2006)
Zobel, J., Moffat, A.: Inverted files for text search engines. ACM Computing Surveys (CSUR)Â 38(2), Article 6 (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tanioka, H. (2007). A Method of Preferential Unification of Plural Retrieved Elements for XML Retrieval Task . In: Fuhr, N., Lalmas, M., Trotman, A. (eds) Comparative Evaluation of XML Information Retrieval Systems. INEX 2006. Lecture Notes in Computer Science, vol 4518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73888-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-73888-6_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73887-9
Online ISBN: 978-3-540-73888-6
eBook Packages: Computer ScienceComputer Science (R0)