ABSTRACT
Annotating or tagging multimedia objects is an important task for enhancing multimedia information retrieval processes. In the context of the Web, automatic tagging deals with many issues, such as loosely tagged images and huge collections of images with no textual data at all. Recently, graph representations have been shown useful for modeling relationships between images and their associated semantics. Using these types of graphs, it is possible to represent images and their textual labels as nodes, and the relationship between them as edges, under the assumption that visual similarity implies semantic similarity. In this work, we present an algorithm for automatic tag propagation in such a graph structure, called the visual-semantic graph. This graph has been used in prior work only for the task of image retrieval re-ranking. The goal of our work, is to show how the visual-semantic graph can be used for efficient tag propagation to unlabeled images. More specifically, our contributions are: (1) An algorithm to propagate tags automatically based on the breadth-first traversal and (2) A set of heuristics for pruning this approach for large size collections.
- D. Bhat and S. Nayar. Ordinal measures for images correspondence. IEEE transactions on pattern analysis and machine intelligence, 20(4):415--423, 1998. Google ScholarDigital Library
- X. Chen, Y. Mu, S. Yan, and T.-S. Chua. Efficient large-scale image annotation by probabilistic collaborative multi-label propagation. In Proceedings of the international conference on Multimedia, MM '10, pages 35--44, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- S. Gao, Z. Wang, L.-T. Chia, and I. W.-H. Tsang. Automatic image tagging via category label and web data. In Proceedings of the international conference on Multimedia, MM '10, pages 1115--1118, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- R. Likert. A technique for the measurement of attitudes. Archives of psychology, 1932.Google Scholar
- D. Liu, X.-S. Hua, M. Wang, and H.-J. Zhang. Image retagging. In Proceedings of the international conference on Multimedia, MM '10, pages 491--500, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- D. Liu, S. Yan, Y. Rui, and H.-J. Zhang. Unified tag analysis with multi-edge graph. In Proceedings of the international conference on Multimedia, MM '10, pages 25--34, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- B. Poblete, B. Bustos, M. Mendoza, and J. Barrios. Visual-semantic graph: Towards reducing the semantic gap in web image retrieval. 2010.Google Scholar
- Y. Shen and J. Fan. Leveraging loosely-tagged images and inter-object correlations for tag recommendation. In Proceedings of the international conference on Multimedia, MM '10, pages 5--14, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
- A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12):1349--1380, 2000. Google ScholarDigital Library
- J. Wang, F. Wang, C. Zhang, H. C. Shen, and L. Quan. Linear neighborhood propagation and its applications. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31:1600--1615, 2009. Google ScholarDigital Library
Index Terms
- Automatic image tagging through information propagation in a query log based graph structure
Recommendations
Automatic video tagging using content redundancy
SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrievalThe analysis of the leading social video sharing platform YouTube reveals a high amount of redundancy, in the form of videos with overlapping or duplicated content. In this paper, we show that this redundancy can provide useful information about ...
Semi-Automatic Tagging of Photo Albums via Exemplar Selection and Tag Inference
As one of the emerging Web 2.0 activities, tagging becomes a popular approach to manage personal media data, such as photo albums. A dilemma in tagging behavior is the users' manual efforts and the tagging accuracy: exhaustively tagging all photos in an ...
Content redundancy in YouTube and its application to video tagging
The emergence of large-scale social Web communities has enabled users to share online vast amounts of multimedia content. An analysis of YouTube reveals a high amount of redundancy, in the form of videos with overlapping or duplicated content. We use ...
Comments