skip to main content
10.1145/1873951.1874125acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Semantic video indexing by fusing explicit and implicit context spaces

Published: 25 October 2010 Publication History

Abstract

This paper addresses the problem of context-based concept fusion (CBCF) for concept detection and semantic video indexing. We introduce a novel framework based on constructing context spaces of concepts, such that the contextual correlations are used to improve the performance of concept detectors. Different from traditional CBCF approach, we present two kinds of such context spaces: explicit context space for modeling the correlation of pairwise concepts, and implicit context space for representing latent themes trained from a set of concepts. The final concept detection scores are then directly fused from explicit and implicit context spaces. Experiments are presented on TRECVid 2006 benchmark and the comparisons with several state-of-the-art approaches demonstrate the effectiveness of proposed framework.

References

[1]
W. Jiang, S.-F. Chang, and A. C. Loui. Active context-based concept fusion with partial user labels. In ICIP, pages 2917--2920, 2006.
[2]
W. Jiang, S.-F. Chang, and A. C. Loui. Context-based concept fusion with boosted conditional random fields. In ICASSP, pages 949--952, 2007.
[3]
Y.-G. Jiang, J. Wang, S.-F. Chang, and C.-W. Ngo. Domain adaptive semantic diffusion for large scale context-based video annotation. In ICCV, 2009.
[4]
Y.-G. Jiang, J. Yang, C.-W. Ngo, and A. G. Hauptmann. Representations of keypoint-based semantic concept detection: A comprehensive study. IEEE Transaction on Multimedia, 12(1):42--53, 2010.
[5]
L. S. Kennedy and S.-F. Chang. A reranking approach for context-based concept fusion in video indexing and retrieval. In CIVR, pages 333--340, 2007.
[6]
H. Lee, A. Battle, R. Raina, and A. Y. Ng. Efficient sparse coding algorithms. In NIPS, pages 801--808, 2007.
[7]
M. Naphade, J. R. Smith, J. Tesic, S. F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. IEEE Multimedia, 13(3):86--91, 2006.
[8]
G.-J. Qi, X.-S. Hua, Y. Rui, J. Tang, T. Mei, and H.-J. Zhang. Correlative multi-label video annotation. In ACM Multimedia, pages 17--26, 2007.
[9]
A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and TRECVid. In MIR, pages 321--330, 2006.
[10]
A. F. Smeaton, P. Over, and W. Kraaij. High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. In Multimedia Content Analysis, Theory and Applications, pages 151--174. 2009.
[11]
J. R. Smith, M. Naphade, and A. Natsev. Multimedia semantic indexing using model vectors. In ICME, 2003.
[12]
C. G. M. Snoek, M.Worring, J. C. van Gemert, J.-M. Geusebroek, and A. W. M. Smeulders. The challenge problem for automated detection of 101 semantic concepts in multimedia. In ACM Multimedia, pages 421--430, 2006.
[13]
K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek. Evaluating color descriptors for object and scene recognition. IEEE Transaction on Pattern Analysis and Machine Intelligence, 32(9):1582--1596, 2010.
[14]
X.-Y. Wei, Y.-G. Jiang, and C.-W. Ngo. Exploring inter-concept relationship with context space for semantic video indexing. In CIVR, 2009.
[15]
M. F. Weng and Y. Y. Chuang. Multi-cue fusion for semantic video indexing. In ACM Multimedia, pages 71--80, 2008.
[16]
A. Yanagawa, S.-F. Chang, L. Kennedy, and W. Hsu. Columbia University's baseline detectors for 374 LSCOM semantic visual concepts. Technical report, Columbia University, 2007.
[17]
Z.-J. Zha, T. Mei, X.-S. Hua, G.-J. Qi, and Z. Wang. Refining video annotation by exploiting pairwise concurrent relation. In ACM Multimedia, pages 345--348, 2007.

Cited By

View all
  • (2018)Precise Temporal Action Localization by Evolving Temporal ProposalsProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206029(388-396)Online publication date: 5-Jun-2018
  • (2015)Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept DetectionIEEE Transactions on Multimedia10.1109/TMM.2015.239819517:4(498-511)Online publication date: Apr-2015
  • (2013)Exploiting Semantic and Visual Context for Effective Video AnnotationIEEE Transactions on Multimedia10.1109/TMM.2013.225026615:6(1400-1414)Online publication date: 1-Oct-2013
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '10: Proceedings of the 18th ACM international conference on Multimedia
October 2010
1836 pages
ISBN:9781605589336
DOI:10.1145/1873951
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 October 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. TRECVid
  2. context space
  3. context-based concept fusion
  4. semantic video indexing
  5. sparse coding

Qualifiers

  • Short-paper

Conference

MM '10
Sponsor:
MM '10: ACM Multimedia Conference
October 25 - 29, 2010
Firenze, Italy

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Precise Temporal Action Localization by Evolving Temporal ProposalsProceedings of the 2018 ACM on International Conference on Multimedia Retrieval10.1145/3206025.3206029(388-396)Online publication date: 5-Jun-2018
  • (2015)Efficient Heuristic Methods for Multimodal Fusion and Concept Fusion in Video Concept DetectionIEEE Transactions on Multimedia10.1109/TMM.2015.239819517:4(498-511)Online publication date: Apr-2015
  • (2013)Exploiting Semantic and Visual Context for Effective Video AnnotationIEEE Transactions on Multimedia10.1109/TMM.2013.225026615:6(1400-1414)Online publication date: 1-Oct-2013
  • (2013)Clustering based rescoring for semantic indexing of multimedia documents2013 11th International Workshop on Content-Based Multimedia Indexing (CBMI)10.1109/CBMI.2013.6576550(41-46)Online publication date: Jun-2013
  • (2012)A Two-View Concept Correlation Based Video Annotation RefinementIEEE Signal Processing Letters10.1109/LSP.2012.218938619:5(259-262)Online publication date: May-2012
  • (2012)Empowering Cross-Domain Internet Media with Real-Time Topic Learning from Social StreamsProceedings of the 2012 IEEE International Conference on Multimedia and Expo10.1109/ICME.2012.105(49-54)Online publication date: 9-Jul-2012
  • (2012)Two-layers re-ranking approach based on contextual information for visual concepts detection in videos2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)10.1109/CBMI.2012.6269837(1-6)Online publication date: Jun-2012
  • (2012)Temporal-Spatial refinements for video concept fusionProceedings of the 11th Asian conference on Computer Vision - Volume Part III10.1007/978-3-642-37431-9_42(547-559)Online publication date: 5-Nov-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media