Evaluating Query-Independent Object Features for Relevancy Prediction

Masegosa, Andres R.; Joho, Hideo; Jose, Joemon M.

doi:10.1007/978-3-540-71496-5_27

Andres R. Masegosa¹,
Hideo Joho² &
Joemon M. Jose²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4425))

Included in the following conference series:

European Conference on Information Retrieval

2102 Accesses

Abstract

This paper presents a series of experiments investigating the effectiveness of query-independent features extracted from retrieved objects to predict relevancy. Features were grouped into a set of conceptual categories, and individually evaluated based on click-through data collected in a laboratory-setting user study. The results showed that while textual and visual features were useful for relevancy prediction in a topic-independent condition, a range of features can be effective when topic knowledge was available. We also re-visited the original study from the perspective of significant features identified by our experiments.

This work was supported by ALGRA project (TIN2004-06204-C03-02), FPU scholarship (AP2004-4678) and EPSRC (Ref: EP/C004108/1).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Random-Sets for Dealing with Uncertainties in Relevance Feature

Correlation, Prediction and Ranking of Evaluation Metrics in Information Retrieval

Pseudo-Relevance Feedback Based on Locally-Built Co-occurrence Graphs

References

Ingwersen, P., Belkin, N.: Information retrieval in context - IRiX: workshop at SIGIR 2004. SIGIR Forum 38(2), 50–52 (2004)
Article Google Scholar
Ingwersen, P., Järvelin, K.: Information retrieval in context: IRiX. SIGIR Forum 39(2), 31–39 (2005)
Article Google Scholar
Ruthven, I., et al. (eds.): Proceedings of the 1st IIiX Symposium, Copenhagen, Denmark (2006)
Google Scholar
Ingwersen, P., Järvelin, K.: The Turn: Integration of Information Seeking and Retrieval in Context. Springer, Heidelberg (2006)
Google Scholar
Kelly, D., Belkin, N.J.: Display time as implicit feedback: understanding task effects. In: Proceedings of the 27th SIGIR Conference, Sheffield, United Kingdom, pp. 377–384. ACM Press, New York (2004)
Google Scholar
Fox, S., et al.: Evaluating implicit measures to improve web search. ACM Transactions on Information Systems 23(2), 147–168 (2005)
Article Google Scholar
White, R.W., Ruthven, I., Jose, J.M.: A study of factors affecting the utility of implicit relevance feedback. In: Proceedings of the 28th SIGIR Conference, Salvador, Brazil, pp. 35–42. ACM Press, New York (2005)
Google Scholar
Freund, L., Toms, E.G., Clarke, C.L.A.: Modeling task-genre relationships for ir in the workspace. In: Proceedings of the 28th SIGIR Conference, Salvador, Brazil, pp. 441–448. ACM Press, New York (2005)
Google Scholar
Shannon, C.E., Weaver, W.: The Mathematical Theory of Communication. University of Illinois Press, Urbana (1949)
MATH Google Scholar
Html parser, http://htmlparser.sourceforge.net/
Firefox add-ons, https://addons.mozilla.org/
Duda, R.O., Hart, P.E.: Pattern Classification. Wiley Interscience (2000)
Google Scholar
Japkowicz, N., Stephen, S.: The class imbalance problem: A systematic study. Intelligent Data Analysis 6(5), 429–449 (2002)
MATH Google Scholar
Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. John Wiley Sons, New York (1973)
MATH Google Scholar
Webb, G.I., Boughton, J.R., Wang, Z.: Not so naive bayes: aggregating one-dependence estimators. Mach. Learn. 58(1), 5–24 (2005)
Article MATH Google Scholar
Zhang, H., Jiang, L., Su, J.: Hidden naive bayes. In: Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI-05), AAAI Press, Menlo Park (2005)
Google Scholar
Pearl, J.: Probabilistic Reasoning with Intelligent Systems. Morgan & Kaufman, San Mateo (1988)
Google Scholar
Cooper, G.F., Herskovits, E.: A bayesian method for the induction of probabilistic networks from data. Machine Learning 9, 309–347 (1992)
MATH Google Scholar
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence 97(1-2), 273–324 (1997), citeseer.ist.psu.edu/article/kohavi97wrappers.html
Article MATH Google Scholar
Joho, H., Jose, J.M.: Slicing and dicing the information space using local contexts. In: Proceedings of the First Symposium on Information Interaction in Context (IIiX), Copenhagen, Denmark, pp. 111–126 (2006)
Google Scholar
Järvelin, K., Kekäläinen, J.: Ir evaluation methods for retrieving highly relevant documents. In: Proceedings of the 23rd SIGIR Conference, Athens, Greece, pp. 41–48. ACM Press, New York (2000)
Google Scholar
Sparck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation 28(1), 11–21 (1972)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and A.I., University of Granada, Spain
Andres R. Masegosa
Department of Computing Science, University of Glasgow, UK
Hideo Joho & Joemon M. Jose

Authors

Andres R. Masegosa
View author publications
You can also search for this author in PubMed Google Scholar
Hideo Joho
View author publications
You can also search for this author in PubMed Google Scholar
Joemon M. Jose
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Giambattista Amati Claudio Carpineto Giovanni Romano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Masegosa, A.R., Joho, H., Jose, J.M. (2007). Evaluating Query-Independent Object Features for Relevancy Prediction. In: Amati, G., Carpineto, C., Romano, G. (eds) Advances in Information Retrieval. ECIR 2007. Lecture Notes in Computer Science, vol 4425. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71496-5_27

Download citation

DOI: https://doi.org/10.1007/978-3-540-71496-5_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71494-1
Online ISBN: 978-3-540-71496-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics