Abstract
Structured document retrieval aims at exploiting the structure together with the content of documents to improve retrieval results. Several aspects of traditional information retrieval applied on flat documents have to be reconsidered. These include in particular, document representation, storage, indexing, retrieval, and ranking. This paper outlines the architecture of our system and the adaptation of the standard vector space model to achieve focussed retrieval.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison Wesley, ACM Press, New York, Essex, England (1999)
Salton, G., Lesk, M.E.: Computer evaluation of indexing and text processing. Journal of the ACM 15, 8–36 (1968)
Salton, G.: The SMART Retrieval System - Experiments in Automatic Document Processing. Prentice Hall Inc., Englewood Cliffs (1971)
Grosjohann, K., Fuhr, N., Effing, D., Kriewel, S.: A user interface for XML document retrieval. In: 32, GI-Jahrestagung. Springer, Heidelberg (2002)
Grosjohann, K., Fuhr, N., Effing, D., Kriewel, S.: Query formulation and result visualization for XML retrieval. In: Proceedings ACM SIGIR 2002 Workshop on XML and Information Retrieval. ACM, New York (2002)
Fuhr, N., Grosjohann, K., Kriewel, S.: A query language and user interface for XML information retrieval. In: Blanken, H.M., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.) Intelligent Search on XML Data. LNCS, vol. 2818, pp. 59–75. Springer, Heidelberg (2003)
Fuhr, N., Grosjohann, K.: XIRQL: A query language for information retrieval in XML documents. In: Proc. of the 24th ACM SIGIR, pp. 172–180. ACM Press, New York (2001)
Gövert, N.: Bilingual information retrieval with hyREX and internet translation services. In: Peters, C. (ed.) CLEF 2000. LNCS, vol. 2069, pp. 237–244. Springer, Heidelberg (2001)
Grust, T.: Accelerating XPath location steps. In: Proc. of the 2002 ACM SIGMOD, pp. 109–120. ACM Press, New York (2002)
Hiemstra, D.: A database approach to content-based xml retrieval. In: INitiative for the Evaluation of XML Retrieval (INEX, Workshop), ERCIM, pp. 111–118 (2003)
Florescu, D., Kossmann, D.: A performance evaluation of alternative mapping schemes for storing XML data in a relational database. Technical report (1999)
Abolhassani, M., Fuhr, N.: Applying the divergence from randomness approach for content-only search in XML documents. In: McDonald, S., Tait, J.I. (eds.) ECIR 2004. LNCS, vol. 2997, pp. 409–419. Springer, Heidelberg (2004)
Kazai, G., Lalmas, M., Rölleke, T.: Focussed structured document retrieval. In: Laender, A.H.F., Oliveira, A.L. (eds.) SPIRE 2002. LNCS, vol. 2476, pp. 241–247. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hassler, M., Bouchachia, A. (2006). Searching XML Documents – Preliminary Work. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds) Advances in XML Information Retrieval and Evaluation. INEX 2005. Lecture Notes in Computer Science, vol 3977. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-34963-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-34963-1_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34962-4
Online ISBN: 978-3-540-34963-1
eBook Packages: Computer ScienceComputer Science (R0)