ABSTRACT
In this paper, an automatic image-text alignment algorithm is developed for achieving more accurate indexing and retrieval of large-scale web images. First, large-scale web pages are crawled, where the informative images and their most relevant auxiliary text blocks are extracted. Second, parallel image clustering is performed to partition large-scale informative web images into a large number of clusters. By grouping the visually-similar (near-duplicate) web images into the same cluster, our parallel image clustering algorithm can significantly reduce the huge uncertainty on the relatedness between the web images and their auxiliary text terms, which can provide a good starting point for supporting automatic image-text alignment. Finally, a relevance re-ranking algorithm is developed to identify the most relevant visual text terms for the visually-similar web images in the same cluster. Our experiments on large-scale web images have obtained very positive results.
- S. Feng, V. Lavrenko, R. Manmatha, "Multiple Bernoulli relevance models for image and video annotation", ACM SIGIR, 2004.Google ScholarCross Ref
- T. L. Berg, A. C. Berg, J. Edwards, D. A. Forsyth, "Who's in the picture", NIPS, 2004.Google Scholar
- N. Zhou, J. Fan, "Automatic image-text alignment for large-scale web image indexing and retrieval", Pattern Recognition, vol. 48, no. 1, pp. 205--219, 2015.Google ScholarDigital Library
- R. Fergus, L. Fei-Fei, P. Perona, A. Zisserman, "Learning object categories from Google's image search", IEEE CVPR, 2006.Google Scholar
- D. Cai, X. He, Z. Li, W.-Y. Ma, J.-R. Wen, "Hierarchical clustering of WWW image search resultsusing visual, textual, and link information", ACM Multimedia, 2004. Google ScholarDigital Library
- X.-J. Wang, W.-Y. Ma, G.-R. Xue, X. Li, "Multi-modal similarity propagation and its applicationfor web image retrieval", ACM Multimedia, 2004. Google ScholarDigital Library
- Y. Gao, J. Peng, H. Luo, D.A. Keim, J. Fan, "An interactive approach for filtering out junk images from keyword-based Google search results", IEEE Trans. Circuits Syst. Video Techn., vol. 19, no. 12, pp. 1851--1865, 2009. Google ScholarDigital Library
- J. Fan, Y. Shen, N. Zhou, Y. Gao, "Harvesting large-scale weakly-tagged image databases from the web", IEEE CVPR, pp. 802--809, 2010.Google ScholarCross Ref
- P. Pham, M. Moens, T. Tuytelaars, "Cross-media alignment of names and faces", IEEE Trans. on Multimedia, vol. 12, no. 1, 2010. Google ScholarDigital Library
- B. Frey, D. Dueck, "Clustering by passing messages between data points", Science, vol. 315, pp. 972--976, 2007.Google ScholarCross Ref
- D. Liu, X.-S, Hua, L. Yang, M. Wang, H.-J. Zhang, "Tag ranking", WWW, 2009.Google Scholar
- Y. Shen, J. Fan, "Leveraging loosely-tagged images and inter-object correlations for tag recommendation." ACM Multimedia, pp.5--14, 2010. Google ScholarDigital Library
- I. Givoni, C. Chung, B. J. Frey, "Hierarchical affinity propagation", UAI, 2011.Google Scholar
- Y. Jia, J. Wang, C. Zhang, X. S. Hua, "Finding image exemplars using fast sparse affinity propagation", ACM Multimedia, 2008. Google ScholarDigital Library
- W. Hsu, L. Kennedy, S. F. Chang, "Video search reranking via information bottleneck principle", ACM Multimedia, 2006. Google ScholarDigital Library
- N. Zhou, Y. Shen, J. Peng, X. Feng, J. Fan, "Leveraging auxiliary text terms for automatic image annotation", WWW, 2011. Google ScholarDigital Library
- J. Liu, W. Lai, X.-S. Hua, Y. Huang, S. Li, "Video search re-ranking via multi-graph propagation", ACM Multimedia, 2007. Google ScholarDigital Library
- C. Wang, F. Jing, L. Zhang, H. J. Zhang, "Image annotation refinement using random walk with restarts", ACM Multimedia, 2006. Google ScholarDigital Library
Index Terms
- Parallel AP Clustering and Re-ranking for Automatic Image-Text Alignment and Large-Scale Web Image Search
Recommendations
An automatic image-text alignment method for large-scale web image retrieval
For reducing huge uncertainty on the relatedness between the web images and their auxiliary text terms, an automatic image-text alignment algorithm is developed to achieve more accurate indexing and retrieval of large-scale web images by assigning the ...
Automatic image-text alignment for large-scale web image indexing and retrieval
In this paper, an automatic image-text alignment algorithm is developed to achieve more effective indexing and retrieval of large-scale web images by aligning web images with their most relevant auxiliary text terms or phrases. First, a large number of ...
Automatic image annotation by using relevant keywords extracted from auxiliary text documents
VLS-MCMR '10: Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrievalIn this paper, a novel algorithm is developed to enable automatic image annotation by aligning web images with their most relevant auxiliary text terms. First, large-scale web pages are crawled and automatic web page segmentation is performed to extract ...
Comments