Abstract
Automatic image annotation is fundamental for effective image browsing and search. With the increasing size of image collections such as web images, it is infeasible to manually label large numbers of images. Meanwhile, the textual information contained in the hosting web pages can be used as approximate image description. However, such information is not accurate enough. In this paper, we propose a framework to utilize the visual content, the textual context, and the semantic relations between keywords to refine the image annotation. The hypergraph is used to model the textual information and the semantic relation is deduced by WordNet. Experiments on large-scale dataset demonstrate the effectiveness of the proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blei, D., Jordan, M.: Modeling Annotated Data. In: 26th International Conference on Research and Development in Information Retrieval, ACM Press, New York (2003)
Barnard, K., Duygulu, P., Forsyth, D.: Clustering Art. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, p. 434 (2001)
Budanitsky, A., Hirst, G.: Semantic distance in WordNet: An experimental, Application-oriented Evaluation of Five measure. In: Workshop on WordNet and Other Lexical Resources, the North American Chapter of the ACL, Pittsburgh (2001)
Carneiro, G., Vasconcelos, N.: Formulating Semantic Image Annotation as a Supervised Learning Problem. In: CVPR 2005, Washington, pp. 163–168 (2005)
Duygulu, P., Barnard, K., Freitas, J., Forsyth, D.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Feng, S., Manmatha, R., Laverenko, V.: Multiple Bernoulli Relevance Models for Image and Video Annotation. In: CVPR 2004, pp. 1002–1009 (2004)
He, J., Li, M., Zhang, H.-J., Tong, H., Zhang, C.: Manifold-ranking based Image Retrieval. In: Proceedings of ACM Multimedia 2004, pp. 9–16 (2004)
Jiang, J., Conrath, D.: Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In: Proceedings of Intl. Conf. Research on Computational Linguistics (1997)
Jin, Y., Khan, L., Wang, L., Awad, M.: Image Annotations By Combining Multiple Evidence & WordNet. In: ACM Multimedia 2005, Singapore, pp. 706–715 (2005)
Lavrenko, V., Manmatha, R., Jeon, J.: A Model for Learning the Semantics of Pictures. In: Proceedings of Advance in Neutral Information Processing (2003)
Miller, G.A.: WordNet: A lexical database for English. Communication of ACM 38(11), 4–39 (1995)
Perdersen, T., Patwardhan, S., Michelizzi, J.: WordNet:Similarity – Measuring the Relatedness of Concepts. In: North American Chapter of ACL, May 3-5, 2004, Boston (2004)
Porter, M.F.: An Algorithm for Suffix Stripping. Program 14(3), 130–137 (1980)
Pucher, M.: Performance Evaluation of WordNet-based Semantic Relatedness Measures for Word Prediction in Conversational Speech. In: Sixth International Workshop on Computational Semantics, Tilburg, Netherlands (2005)
Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a taxonomy. In: Proceedings of International Joint Conference on Artificial Intelligence, pp. 448–453 (1995)
Shashua, A., Zass, R., Hazan, T.: Multi-way Clustering Using Super-Symmetric Non-negative Tensor Factorization. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 595–608. Springer, Heidelberg (2006)
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE PAMI 22(12), 1349–1380 (2000)
Tong, H., He, J., Li, M., Zhang, C., Ma, W.-Y.: Graph Based Multi-Modality Learning. In: Proceedings of ACM Multimedia 2005, pp. 862–871 (2005)
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with Local and Global Consistency. In: NIPS, pp. 237–244 (2003)
Zhou, D., Huang, J.: Schölkopf B., Beyond Pairwise Classification and Clustering Using Hypergraphs. MPI Technical Report (143), Tübingen, Germany (2005)
Zhou, X., Huang, T.: Unifying keywords and Visual Contents in Image Retrieval. IEEE Multimedia Magazine (April-June), 23–33 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, B., Li, Z., Li, M. (2006). Automatic Refinement of Keyword Annotations for Web Image Search. In: Cham, TJ., Cai, J., Dorai, C., Rajan, D., Chua, TS., Chia, LT. (eds) Advances in Multimedia Modeling. MMM 2007. Lecture Notes in Computer Science, vol 4351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69423-6_26
Download citation
DOI: https://doi.org/10.1007/978-3-540-69423-6_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69421-2
Online ISBN: 978-3-540-69423-6
eBook Packages: Computer ScienceComputer Science (R0)