skip to main content
10.1145/2072298.2071979acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Multimodal fusion for video copy detection

Published: 28 November 2011 Publication History

Abstract

Content-based video copy detection algorithms (CBCD) focus on detecting video segments that are identical or transformed versions of segments in a known video. In recent years some systems have proposed the combination of orthogonal modalities (e.g. derived from audio and video) to improve detection performance, although not always achieving consistent results. In this paper we propose a fusion algorithm that is able to combine as many modalities as available at the decision level. The algorithm is based on the weighted sum of the normalized scores, which are modified depending on how well they rank in each modality. This leads to a virtually parameter-free fusion algorithm. We performed several tests using 2010 TRECVID VCD datasets and obtain up to 46% relative improvement in min-NDCR while also improving the F1 metric on the fused results in comparison to just using the best single modality.

References

[1]
J. M. Barrios and B. Bustos. Content-based video copy detection: PRISMA at trecvid 2010. In Proc. NIST-TRECVID Workshop, 2010.
[2]
J. Haitsma and T. Kalker. A highly robust audio fingerprinting system. In Proc. ISMIR, 2002.
[3]
H. Jegou, M. Douze, G. Gravier, C. Schmid, and P. Gros. INRIA LEAR-TEXMEX: Video copy detection task. In Proc. NIST-TRECVID Workshop, 2010.
[4]
A. Joly, O. Buisson, and C. Frélicot. Content-based copy retrieval using distortion-based probabilistic similarity search. IEEE Trans. on Multimedia, 9(2):293--306, 2007.
[5]
D.-D. Le, S. Poullot, M. Crucianu, X. Wu, M. Nett, M. E. Houle, and S. Satoh. National institute of informatics, japan at TRECVID 2009. In Proc. NIST-TRECVID Workshop, 2009.
[6]
Y. Liang, B. Cao, J. Li, C. Zhu, Y. Zhang, C. Tan, G. Chen, C. Sun, J. Yuan, M. Xu, and B. Zhang. THU-ING at TRECVID 2009. In Proc. NIST-TRECVID Workshop, 2009.
[7]
B. S. Manjunath, J.-R. Ohm, V. V. Vasudevan, and A. Yamada. Color and texture descriptors. IEEE Trans. on Circuits and Systems for Video Tech., 11(6):703--715, 2001.
[8]
D. Marimon, A. Bonnin, T. Adamek, and R. Gimeno. DARTs: Efficient scale-space extraction of daisy keypoints. In Proc. CVPR, 2010.
[9]
R. Mukai, T. Kurozumi, K. Hiramatsu, T. Kawanishi, H. Nagano, and K. Kashiro. NTT communications science laboratories at TRECVID 2010 content-based copy detection. In Proc. NIST-TRECVID Workshop, 2010.
[10]
A. Narsev, S. Bao, J. Chang, M. Hill, M. Merler, J. R. Smith, D. Wang, L. Xie, R. Yan, and Y. Zhang. IBM research TRECVID-2009 video retrieval system. In Proc. NIST-TRECVID Workshop, 2009.
[11]
A. Saracoglu, E. Esen, T. Ates, B. O. Acar, U. Zubari, E. C. Ozan, E. Ozalp, A. A. Alatan, and T. Ciloglu. Content based copy detection with coarse audio-visual fingerprints. In Proc. CBMI, pages 213--218, June 2009.
[12]
A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In MIR, pages 321--330, New York, NY, USA, 2006. ACM Press.
[13]
Y. Uchida, S. Sakazawa, M. Agrawal, and M. Akbacak. KDDI labs and SRI international at trecvid 2010: Content-based copy detection. In Proc. NIST-TRECVID Workshop, 2010.
[14]
E. Younessian, X. Anguera, T. Adamek, N. Oliver, and D. Marimon. Telefonica research at trecvid 2010 content-based copy detection. In Proc. NIST-TRECVID Workshop, 2010.

Cited By

View all
  • (2015)TASCACM Transactions on Information Systems10.1145/269966233:2(1-34)Online publication date: 17-Feb-2015
  • (2014)An information theoretic similarity measure for unified multimedia document retrieval7th International Conference on Information and Automation for Sustainability10.1109/ICIAFS.2014.7069625(1-6)Online publication date: Dec-2014
  • (2014)An Unified Approach for Multimedia Document Representation and Document SimilarityProceedings of the 2014 IEEE 17th International Conference on Computational Science and Engineering10.1109/CSE.2014.76(249-256)Online publication date: 19-Dec-2014
  • Show More Cited By

Index Terms

  1. Multimodal fusion for video copy detection

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '11: Proceedings of the 19th ACM international conference on Multimedia
    November 2011
    944 pages
    ISBN:9781450306164
    DOI:10.1145/2072298
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 November 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. content-based video copy detection
    2. fusion
    3. multimodal

    Qualifiers

    • Short-paper

    Conference

    MM '11
    Sponsor:
    MM '11: ACM Multimedia Conference
    November 28 - December 1, 2011
    Arizona, Scottsdale, USA

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 22 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)TASCACM Transactions on Information Systems10.1145/269966233:2(1-34)Online publication date: 17-Feb-2015
    • (2014)An information theoretic similarity measure for unified multimedia document retrieval7th International Conference on Information and Automation for Sustainability10.1109/ICIAFS.2014.7069625(1-6)Online publication date: Dec-2014
    • (2014)An Unified Approach for Multimedia Document Representation and Document SimilarityProceedings of the 2014 IEEE 17th International Conference on Computational Science and Engineering10.1109/CSE.2014.76(249-256)Online publication date: 19-Dec-2014
    • (2014)Rotation and flipping robust region binary patterns for video copy detectionJournal of Visual Communication and Image Representation10.1016/j.jvcir.2013.12.00325:2(373-383)Online publication date: 1-Feb-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media