Abstract
The issue of information retrieval in XML documents was first investigated by the database community. Recently, the Information Retrieval (IR) community started to investigate the XML search issue. For this purpose, traditional information retrieval models were adapted to process XML documents and rank results by relevance. In this paper, we describe an IR approach to deal with queries composed of content and structure conditions. The XFIRM model we propose is designed to be as flexible as possible to process such queries. It is based on a complete query language, derived from Xpath and on a relevance values propagation method. The value of this proposed method is evaluated thanks to the INEX evaluation initiative. Results show a relative high precision of our system.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abolhassani, M., Fuhr, N.: Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents. In: McDonald, S., Tait, J.I. (eds.) ECIR 2004. LNCS, vol. 2997, pp. 409–419. Springer, Heidelberg (2004)
Abolhassani, M., Fuhr, N., Malik, S.: HyREX at INEX 2003. In: Proceedings of INEX 2003 Workshop (2003)
Afrati, F.N., Koutras, C.D.: A Hypertext Model Supporting Query Mechanisms. In: Proceedings of the European Conference on Hypertext (1990)
Chiaramella, Y., Mulhem, P., Fourel, F.: A model for multimedia search information retrieval. Technical report, Basic Research Action FERMI 8134, University of Glasgow (1996)
Fuhr, N., Grossjohann, K.: XIRQL: A query Language for Information Retrieval in XML Documents. In: Proc. of the 24th annual ACM SIGIR conference (2001)
Fuhr, N., Malik, S., Lalmas, M.: Overview of the Initiative for the Evaluation of XML Retrieval (INEX) 2003. In: Proceedings of INEX 2003 Workshop (2003)
Fuller, M., Mackie, E., Sacks-Davis, R., Wilkinson, R.: Structural answers for a large structured document collection. In: Proc. ACM SIGIR (1993)
Geva, S., Murray, L.-S.: Xpath inverted file for information retrieval. In: Proceedings of INEX 2003 Workshop (2003)
Norbert, G., Abolhassani, M., Fuhr, N., Grossjohann, K.: Content-oriented XML retrieval with HyREX. In: Proceedings of the first INEX Workshop (2002)
Grust, T.: Accelerating XPath Location Steps. In: Franklin, M.J., Moon, B., Ailamaki, A. (eds.) Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, USA (2002)
Hayashi, Y., Tomita, J., Kikoi, G.: Searching text-rich XML documents with relevance ranking. In: Proc ACM SIGIR 2000 Workshop on XML and IR, Athens (2000)
Lalmas, M.: Dempster-Shafer theory of evidence applied to structured documents: modeling uncertainty. In: Proc. ACM-SIGIR (1997)
List, J., Mihazjlovic, V., de Vries, A.P., Ramirez, G., Hiemstra, D.: The TIJAH XML-IR system at INEX 2003. In: Proceedings of INEX 2003 Workshop (2003)
Mass, Y., Mandelbrod, M.: Retrieving the most relevant XML component. In: Proceedings of INEX 2003 Workshop (2003)
Ogilvie, P., Callan, J.: Using Language Models for Flat Text Queries in XML Retrieval. In: Proceedings of INEX 2003 Workshop (2003)
Pehcevski, J., Thom, J., Vercoustre, A.-M.: RMIT experiments: XML retrieval using Lucy/eXist. In: Proceedings of INEX 2003 Workshop (2003)
Piwowarski, B., Faure, G.-E., Gallinari, P.: Bayesian networks and INEX. In: Proceedings in the First Annual Workshop for the Evaluation of XML Retrieval, INEX (2002)
Robertson, S.E., Walker, S., Hancock-Beaulieu, M.M.: Okapi at TREC 3. In: Proceedings TREC 3 (1994)
Sauvagnat, K.: XFIRM, un modèle flexible de Recherche d’Information pour le stockage et l’interrogation de documents XML. In: CORIA 2004, Toulouse, France (2004)
Sauvagnat, K., Boughanem, M.: Le langage de requête XFIRM pour les documents XML: De la recherche par simples mots-clés à l’utilisation de la structure des documents. Inforsid, Biarritz, France (2004)
Sauvagnat, K., Hubert, G., Boughanem, M., Mothe, J.: IRIT at INEX 2003. In: Proceedings of INEX 2003 Workshop (2003)
Schlieder, T., Meuss, H.: Querying and ranking XML documents. Journal of the American Society for Information Science and Technology 53(6), 489–503 (2002)
Sigurbjörnsson, B., Kaamps, J., de Rijke, M.: An element-based approach to XML retrieval. In: Proceedings of INEX 2003 Workshop (2003)
Wolff, J.E., Flörke, H., Cremers, A.B.: Searching and browsing collections of structural information. In: Proc of IEEE advances in digital libraries, Washington (2000)
W3C. XQuery 1.0: an XML query language. W3C Working Draft (2003)
W3C. Xquery and Xpath Full-Text Use Cases. W3C Working draft (2003)
W3C. Fernandez, M., et al.: XQuery 1.0 and XPath 2.0 Data Model. Working Draft (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sauvagnat, K., Boughanem, M., Chrisment, C. (2004). Searching XML Documents Using Relevance Propagation. In: Apostolico, A., Melucci, M. (eds) String Processing and Information Retrieval. SPIRE 2004. Lecture Notes in Computer Science, vol 3246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30213-1_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-30213-1_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23210-0
Online ISBN: 978-3-540-30213-1
eBook Packages: Springer Book Archive