research-article

Image Retrieval by Cross-Media Relevance Fusion

Authors:
Jianfeng Dong

Zhejiang University, Hangzhou, China

Zhejiang University, Hangzhou, China
View Profile

,
Xirong Li

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China
View Profile

,
Shuai Liao

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China
View Profile

,
Jieping Xu

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China
View Profile

,
Duanqing Xu

Zhejiang University, Hangzhou, China

Zhejiang University, Hangzhou, China
View Profile

,
Xiaoyong Du

Renmin University of China, Beijing, China

Renmin University of China, Beijing, China
View Profile

MM '15: Proceedings of the 23rd ACM international conference on MultimediaOctober 2015Pages 173–176https://doi.org/10.1145/2733373.2809929

Published:13 October 2015Publication History

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Pages 173–176

ABSTRACT

How to estimate cross-media relevance between a given query and an unlabeled image is a key question in the MSR-Bing Image Retrieval Challenge. We answer the question by proposing cross-media relevance fusion, a conceptually simple framework that exploits the power of individual methods for cross-media relevance estimation. Four base cross-media relevance functions are investigated, and later combined by weights optimized on the development set. With DCG25 of 0.5200 on the test dataset, the proposed image retrieval system secures the first place in the evaluation.

References

Y. Bai, W. Yu, T. Xiao, C. Xu, K. Yang, W.-Y. Ma, and T. Zhao. Bag-of-words based deep neural network for image retrieval. In ACM MM, 2014. Google ScholarDigital Library
Q. Fang, H. Xu, R. Wang, S. Qian, T. Wang, J. Sang, and C. Xu. Towards msr-bing challenge: Ensemble of diverse models for image retrieval. In MSR-Bing IRC 2013 Workshop, 2013.Google Scholar
R. Goulden, P. Nation, and J. Read. How large can a receptive vocabulary be? Applied Linguistics, 11(4):341--363, 1990.Google ScholarCross Ref
X. S. Hua, L. Yang, J. Wang, J. Wang, M. Ye, K. Wang, Y. Rui, and J. Li. Clickage: Towards bridging semantic and intent gaps via mining click logs of search engines. ACM MM, 2013. Google ScholarDigital Library
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv:1408.5093, 2014.Google Scholar
X. Li, S. Liao, W. Lan, X. Du, and G. Yang. Zero-shot image tagging by hierarchical semantic embedding. SIGIR, 2015. Google ScholarDigital Library
X. Li, C. Snoek, M. Worring, and A. Smeulders. Fusing concept detection and geo context for visual search. In ICMR, 2012. Google ScholarDigital Library
D. Metzler and B. Croft. Linear feature-based models for information retrieval. Inf. Retr., 10(3):257--274, 2007. Google ScholarDigital Library
T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. In ICLR, 2013.Google Scholar
M. Norouzi, T. Mikolov, S. Bengio, Y. Singer, J. Shlens, A. Frome, G. Corrado, and J. Dean. Zero-shot learning by convex combination of semantic embeddings. ICLR, 2014.Google Scholar
Y. Pan, T. Yao, X. Tian, H. Li, and C.-W. Ngo. Click-through-based subspace learning for image search. In ACM MM, 2014. Google ScholarDigital Library
Y. Pan, T. Yao, K. Yang, H. Li, C.-W. Ngo, J. Wang, and T. Mei. Image search by graph-based label propagation with image representation from dnn. In ACM MM, 2013. Google ScholarDigital Library
C.-C. Wu, K.-Y. Chu, Y.-H. Kuo, Y.-Y. Chen, W.-Y. Lee, and W. H. Hsu. Search-based relevance association with auxiliary contextual cues. In ACM MM, 2013. Google ScholarDigital Library
Z. Xu, Y. Yang, A. Kassim, and S. Yan. Cross-media relevance mining for evaluating text-based image search engine. In ICME, 2014.Google Scholar
B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning deep features for scene recognition using places database. NIPS, 2014.Google ScholarDigital Library

Index Terms

Image Retrieval by Cross-Media Relevance Fusion
1. Information systems
  1. Information retrieval

Recommendations

Cross-media Relevance Computation for Multimedia Retrieval
MM '17: Proceedings of the 25th ACM international conference on Multimedia

In this paper, we summarize our works for cross-media retrieval where the queries and retrieval content are of different media types. We study cross-media retrieval in the context of two applications, i.e., ~image retrieval by textual queries, and ...
Read More
Semantic-based cross-media image retrieval
ICAPR'05: Proceedings of the Third international conference on Pattern Recognition and Image Analysis - Volume Part II

In this paper, we propose a novel method for cross-media semantic-based information retrieval, which combines classical text- based and content-based image retrieval techniques. This semantic-based approach aims at determining the strong relationships ...
Read More
Cross-Language and Cross-Media Image Retrieval: An Empirical Study at ImageCLEF2007
Advances in Multilingual and Multimodal Information Retrieval

This paper summarizes our empirical study of cross-language and cross-media image retrieval at the CLEF image retrieval track (ImageCLEF2007). In this year, we participated in the ImageCLEF photo retrieval task, in which the goal of the retrieval task ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '15: Proceedings of the 23rd ACM international conference on Multimedia
October 2015
1402 pages
ISBN:9781450334594
DOI:10.1145/2733373
General Chairs:
Xiaofang Zhou
The University of Queensland, Australia
,
Alan F. Smeaton
Dublin City University, Ireland
,
Qi Tian
The University of Texas at San Antonio, USA
,
Program Chairs:
Dick C.A. Bulterman
FXPAL, USA
,
Heng Tao Shen
The University of Queensland, Australia
,
Ketan Mayer-Patel
The University of North Carolina, USA
,
Shuicheng Yan
National University of Singapore, Singapore
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 October 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
cross-media relevance fusion
image retrieval challenge
Qualifiers
- research-article
Conference

Acceptance Rates
MM '15 Paper Acceptance Rate56of252submissions,22%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 7
  Total Citations
  View Citations
- 366
  Total Downloads
- Downloads (Last 12 months)8
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Image Retrieval by Cross-Media Relevance Fusion

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Cross-media Relevance Computation for Multimedia Retrieval

Semantic-based cross-media image retrieval

Cross-Language and Cross-Media Image Retrieval: An Empirical Study at ImageCLEF2007