skip to main content
10.1145/2526188.2526204acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
research-article

An efficient access method for multimodal video retrieval

Published:05 November 2013Publication History

ABSTRACT

Efficient and effective handling of video documents depends on the availability of indexes. Manual indexing is unfeasible for large video collections. Video combines different types of data from different modalities. Using information from multiple modalities may result in a more robust and accurate video retrieval. Therefore, effective indexing for video retrieval requires a multimodal approach in which either the most appropriate modality is selected or the different modalities are used in collaborative fashion. This paper presents a new metric access method -- Slim2-tree -- which combines information from multiple modalities within a single index structure for video retrieval. Experimental studies on a large real dataset show the video similarity search performance of the proposed technique. Additionally, we present experiments comparing our method against state-of-the-art of multimodal solutions. Comparative test results demonstrate that our technique improves the performance of video similarity queries.

References

  1. J. Almeida, E. Valle, R. S. Torres, and N. J. Leite. DAHC-tree: An effective index for approximate search in high-dimensional metric spaces. JIDM, 1(3):375--390, 2010.Google ScholarGoogle Scholar
  2. P. K. Atrey, M. A. Hossain, A. El Saddik, and M. S. Kankanhalli. Multimodal fusion for multimedia analysis: a survey. Multimedia Systems, 16(6):345--379, Apr. 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. H. Bugatti. Analise da in uencia de funcoes de distancia para o processamento de consultas por similaridade em recuperacao de imagens por conteudo. Master's thesis, Universidade de Sao Paulo, 2008.Google ScholarGoogle Scholar
  4. B. Bustos, S. Kreft, and T. Skopal. Adapting metric indexes for searching in multi-metric spaces. Multimedia Tools Appl., 58(3):467--496, June 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. E. Chavez, G. Navarro, R. Baeza-Yates, and J. L. Marroquín. Searching in metric spaces. ACM Comput. Surv., 33(3):273--321, Sept. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Ciaccia and M. Patella. The M2-tree: Processing complex multi-feature queries with just one index. In 1st DELOS Workshop: ISSQDL, 2000.Google ScholarGoogle Scholar
  7. P. Ciaccia, M. Patella, and P. Zezula. M-tree: An efficient access method for similarity search in metric spaces. In Proc. 23rd VLDB'97, pages 426--435, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. B. Coimbra and R. Goularte. Segmentação multimodal de cenas em telejornais. In XVII Webmedia, pages 229--236, 2011.Google ScholarGoogle Scholar
  9. M. Doller, F. Stegmaier, S. Jans, and H. Kosch. TempoM2: A multi feature index structure for temporal video search. In AMM, volume 7131, pages 323--333. Springer, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Gaede and O. Gunther. Multidimensional access methods. ACM Comput. Surv., 30(2):170--231, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Ganchev, N. Fakotakis, and G. Kokkinakis. Comparative evaluation of various MFCC implementations on the speaker verification task. In Proc. SPECOM, pages 191--194, 2005.Google ScholarGoogle Scholar
  12. S.-T. Goh and K.-L. Tan. MOSAIC: A fast multi-feature image retrieval system. Data & Knowledge Engineering, 33(3):219 -- 239, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. He and J. Yu. MFI-tree: An effective multi-feature index structure for weighted query application. Comput. Sci. Inf. Syst., 7(1):139--152, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  14. A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV, 42(3):145--175, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Shao, H. T. Shen, and X. Zhou. Challenges and Techniques for Effective and Efficient Similarity Search in Large Video Databases. PLDV, 1(2):1598--1603, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Traina, A. Traina, B. Seeger, and C. Faloutsos. Slim-trees: High performance metric trees minimizing overlap between nodes. In 7th EDBT, pages 51--65, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. G. Vespa. Operacao de carga-rapida (bulk-loading) em metodos de acesso metricos. Master's thesis, Universidade de S~ao Paulo, Sao Carlos, 2007.Google ScholarGoogle Scholar
  18. R. Yan and A. G. Hauptmann. A review of text and image retrieval approaches for broadcast news video. Inf. Retr., 10(4-5):445--484, Oct. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Zezula, G. Amato, V. Dohnal, and M. Batko. Similarity Search: The Metric Space Approach. Advances in Database Systems. Springer, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An efficient access method for multimodal video retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WebMedia '13: Proceedings of the 19th Brazilian symposium on Multimedia and the web
      November 2013
      360 pages
      ISBN:9781450325592
      DOI:10.1145/2526188

      Copyright © 2013 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 5 November 2013

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      WebMedia '13 Paper Acceptance Rate29of87submissions,33%Overall Acceptance Rate270of873submissions,31%
    • Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader