skip to main content
10.1145/2802558.2814644acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Score Propagation Based on Similarity Shot Graph for Improving Visual Object Retrieval

Published: 30 October 2015 Publication History

Abstract

The Visual Object Retrieval problem consists in locating the occurrences of a specific entity in an image/video dataset. In this work, we focus on discovering new occurrences of an entity by propagating the detection scores of already computed candidates to other video segments. The score propagation follows the edges of a pre-computed Similarity Shot Graph (SSG). The SSG connects video segments that are similar according to some criterion. Four methods for creating the SSG are presented: two based on computing and comparing low-level visual features, one based on comparing text transcriptions, and other based on computing and comparing high-level concepts.
The score propagation is evaluated on the INS 2014 dataset. The results show that the detection performance can be slightly improved by the proposed algorithm. However, the performance is variable and depends on the properties of the SSG and the target entity. It is part of the future work to automatically decide the kind of SSG that will be used to propagate scores given a set of detection candidates.

References

[1]
Feature Detectors and Descriptors: The State Of The Art and Beyond. Feature Detection Code., 2010. http://kahlan.eps.surrey.ac.uk/featurespace/web/.
[2]
R. Arandjelovic and A. Zisserman. Three things everyone should know to improve object retrieval. In Proc. of CVPR, pages 2911--2918. IEEE, 2012.
[3]
J. M. Barrios, J. M. Saavedra, F. Ramirez, and D. Contreras. Orand at trecvid 2014: Instance search and multimedia event detection. In Proc. of TRECVID. NIST, USA, 2014.
[4]
J. Delhumeau, P.-H. Gosselin, H. Jégou, and P. Pérez. Revisiting the vlad image representation. In Proc. of ACM Multimedia, pages 653--656. ACM, 2013.
[5]
H. Jégou, M. Douze, and C. Schmid. Hamming embedding and weak geometric consistency for large scale image search. In Proc. of ECCV, pages 304--317. Springer, 2008.
[6]
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093, 2014.
[7]
D.-D. Le, C.-Z. Zhu, S. Poullot, V. Q. Lam, D. A. Duong, and S. Satoh. National institute of informatics, japan at trecvid 2011. In Proc. of TRECVID. NIST, USA, 2011.
[8]
P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, W. Kraaij, A. F. Smeaton, and G. Quéenot. Trecvid 2014 -- an overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2014. NIST, USA, 2014.
[9]
L. Pang, W. Zhang, H.-K. Tan, and C.-W. Ngo. Video hyperlinking: Libraries and tools for threading and visualizing large video collection. In Proc. of ACM Multimedia, pages 1461--1464. ACM, 2012.
[10]
J. Philbin and A. Zisserman. Object mining using a matching graph on very large image collections. In Proc. of ICVGIP, pages 738--745. IEEE, 2008.
[11]
S. Poullot, M. Crucianu, and O. Buisson. Scalable mining of large video databases using copy detection. In Proc. of ACM Multimedia, pages 61--70. ACM, 2008.
[12]
A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and TRECVid. In Proc. of MIR, pages 321--330. ACM, 2006.
[13]
P. Turcot and D. G. Lowe. Better matching with fewer features: The selection of useful features in large database recognition problems. In Proc. of ICCV, pages 2109--2116. IEEE, 2009.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SLAM '15: Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia
October 2015
48 pages
ISBN:9781450337496
DOI:10.1145/2802558
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content-based search
  2. multimedia information retrieval
  3. similarity graph
  4. visual object retrieval

Qualifiers

  • Research-article

Funding Sources

Conference

MM '15
Sponsor:
MM '15: ACM Multimedia Conference
October 30, 2015
Brisbane, Australia

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 57
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media