Abstract
A common approach for developing XML element retrieval systems is to adapt text retrieval systems to retrieve elements from documents. Two key challenges in this approach are to effectively score structural queries and to control overlap in the output across different search tasks. In this paper, we continue our research into the use of navigation models for element scoring as a way to represent the user’s preferences for the structure of retrieved elements. Our goal is to improve search systems using structural scoring by boosting the score of desirable elements and to post-process results to control XML overlap. This year we participated in the Ad-hoc Focused, Efficiency, and Entity Ranking Tracks, where we focused our attention primarily on the effectiveness of small navigation models. Our experiments involved three modifications to our previous work; (i) using separate summaries for boosting and post-processing, (ii) introducing summaries that are generated from user study data, and (iii) confining our results to using small models. Our results suggest that smaller models can be effective but more work needs to be done to understand the cases where different navigation models may be appropriate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Apache Lucene Java (2008), http://lucene.apache.org
Ali, M.S., Consens, M.P., Kazai, G., Lalmas, M.: Structural relevance: a common basis for the evaluation of structured document retrieval. In: CIKM 2008, pp. 1153–1162. ACM Press, New York (2008)
Ali, M.S., Consens, M.P., Khatchadourian, S.: XML retrieval by improving structural relevance measures obtained from summary models. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 34–48. Springer, Heidelberg (2008)
Ali, M.S., Consens, M.P., Khatchadourian, S., Rizzolo, F.: DescribeX: Interacting with AxPRE Summaries. In: ICDE 2008, pp. 1540–1543. IEEE Computer Society Press, Los Alamitos (2008)
Ali, M.S., Consens, M.P., Lalmas, M.: Structural Relevance in XML Retrieval Evaluation. In: SIGIR 2007 Workshop on Focused Retrieval, pp. 1–8 (2007)
Ali, M.S., Consens, M.P., Larsen, B.: Representing user navigation in XML retrieval with structural summaries. In: ECIR 2009 (in press, 2009)
Clarke, C.: Controlling overlap in content-oriented XML retrieval. In: SIGIR 2005, pp. 314–321. ACM Press, New York (2005)
Consens, M.P., Rizzolo, F., Vaisman, A.A.: AxPRE Summaries: Exploring the (Semi-)Structure of XML Web Collections. In: ICDE 2008, pp. 1519–1521 (2008)
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked keyword search over xml documents. In: SIGMOD 2003. ACM Press, New York (2003)
Kazai, G., Lalmas, M., de Vries, A.P.: The overlap problem in content-oriented xml retrieval evaluation. In: SIGIR 2004, pp. 72–79. ACM Press, New York (2004)
Malik, S., Tombros, A., Larsen, B.: The Interactive Track at INEX 2006. In: Fuhr, N., Lalmas, M., Trotman, A. (eds.) INEX 2006. LNCS, vol. 4518, pp. 387–399. Springer, Heidelberg (2007)
Piwowarski, B., Gallinari, P., Dupret, G.: Precision recall with user modeling (PRUM): Application to structured information retrieval. ACM Trans. Inf. Syst. 25(1), 1 (2007)
Ross, S.M.: Introduction to Probability Models, 8th edn. Academic Press, New York (2003)
Theobald, M., Schenkel, R., Weikum, G.: An efficient and versatile query engine for TopX search. In: Proc. VLDB Conf., pp. 625–636 (2005)
Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I (NEXI). In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 16–40. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ali, M.S., Consens, M.P., Helou, B., Khatchadourian, S. (2009). Exploiting User Navigation to Improve Focused Retrieval. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-03761-0_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03760-3
Online ISBN: 978-3-642-03761-0
eBook Packages: Computer ScienceComputer Science (R0)