Abstract
This paper explores the feasibility of including image information embedded in Web pages in relevance computation to improve search performance. In determining the ranking of Web pages against a given query, most (if not all) modern Web search engines consider two kinds of factors: text information (including title, URL, body text, anchor text, etc) and static ranking (e.g. PageRank [1]). Although images have been widely used to help represent Web pages and carry valuable information, little work has been done to take advantage of them in computing the relevance score of a Web page given a query. We propose, in this paper, a framework to contain image information in ranking functions. Preliminary experimental results show that, when image information is used properly, ranking results can be improved.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual Web Search Engine. In: WWW-7 (1998)
Cai, D., et al.: Hierarchical Clustering of WWW Image Search Results Using Visual, Textual and Link Information. In: Proc. Of the 12th ACM international conference on Multimedia, ACM Press, New York (2004)
Frankel, C., Swain, M., Athitsos, V.: Webseer: an Image Search Engine for the World Wide Web. University of Chicago Technical Report TR96-14 (1996)
Hu, J., Bagga, A.: Categorizing Images in Web Documents. In: SPIE Document Recognition and Retrieval (DR&R X), Santa Clara (January 2003)
Ivory, M.Y., Hearst, M.A.: Improving Web Site Design. IEEE Internet Computing 6(2), 56–63 (2002)
Jarvelin, K., Kekalainen, J.: IR evaluation methods for retrieving highly relevant documents. In: SIGIR (2000)
Joachims, T.: Optimizing Search Engines Using Clickthrough Data. In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, ACM Press, New York (2002)
Lempel, R., Soffer, A.: PicASHOW: Pictorial Authority Search by Hyperlinks on the Web. In: Proc. 10th Int. World Wide Web Conf, Hong Kong, China, pp. 438–448 (2001)
Robertson, S.E., Walker, S., Beaulieu, M.: Okapi at TREC–7: Automatic Ad-hoc, Filtering, VLC and Filtering Tracks. In: Proceedings of TREC’99 (1999)
Shi, S., Song, R., Wen, J.R.: Latent Additivity: Combining Homogeneous Evidence. Technique report, MSR-TR-2006-110, Microsoft Research (August 2006)
Smith, J.R., Chang, S.: VisualSeek: A Fully Automated Content-Based Image Query System. In: Proc. ACM Conf. Multimedia, ACM Press, New York (1996)
Song, R., et al.: Learning Block Importance Models for Web Pages. In: WWW’2004 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Yu, Q., Shi, S., Li, Z., Wen, JR., Ma, WY. (2007). Improve Ranking by Using Image Information. In: Amati, G., Carpineto, C., Romano, G. (eds) Advances in Information Retrieval. ECIR 2007. Lecture Notes in Computer Science, vol 4425. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71496-5_62
Download citation
DOI: https://doi.org/10.1007/978-3-540-71496-5_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71494-1
Online ISBN: 978-3-540-71496-5
eBook Packages: Computer ScienceComputer Science (R0)