ABSTRACT
Efficient and effective handling of video documents depends on the availability of indexes. Manual indexing is unfeasible for large video collections. Video combines different types of data from different modalities. Using information from multiple modalities may result in a more robust and accurate video retrieval. Therefore, effective indexing for video retrieval requires a multimodal approach in which either the most appropriate modality is selected or the different modalities are used in collaborative fashion. This paper presents a new metric access method -- Slim2-tree -- which combines information from multiple modalities within a single index structure for video retrieval. Experimental studies on a large real dataset show the video similarity search performance of the proposed technique. Additionally, we present experiments comparing our method against state-of-the-art of multimodal solutions. Comparative test results demonstrate that our technique improves the performance of video similarity queries.
- J. Almeida, E. Valle, R. S. Torres, and N. J. Leite. DAHC-tree: An effective index for approximate search in high-dimensional metric spaces. JIDM, 1(3):375--390, 2010.Google Scholar
- P. K. Atrey, M. A. Hossain, A. El Saddik, and M. S. Kankanhalli. Multimodal fusion for multimedia analysis: a survey. Multimedia Systems, 16(6):345--379, Apr. 2010.Google ScholarDigital Library
- P. H. Bugatti. Analise da in uencia de funcoes de distancia para o processamento de consultas por similaridade em recuperacao de imagens por conteudo. Master's thesis, Universidade de Sao Paulo, 2008.Google Scholar
- B. Bustos, S. Kreft, and T. Skopal. Adapting metric indexes for searching in multi-metric spaces. Multimedia Tools Appl., 58(3):467--496, June 2012. Google ScholarDigital Library
- E. Chavez, G. Navarro, R. Baeza-Yates, and J. L. Marroquín. Searching in metric spaces. ACM Comput. Surv., 33(3):273--321, Sept. 2001. Google ScholarDigital Library
- P. Ciaccia and M. Patella. The M2-tree: Processing complex multi-feature queries with just one index. In 1st DELOS Workshop: ISSQDL, 2000.Google Scholar
- P. Ciaccia, M. Patella, and P. Zezula. M-tree: An efficient access method for similarity search in metric spaces. In Proc. 23rd VLDB'97, pages 426--435, 1997. Google ScholarDigital Library
- D. B. Coimbra and R. Goularte. Segmentação multimodal de cenas em telejornais. In XVII Webmedia, pages 229--236, 2011.Google Scholar
- M. Doller, F. Stegmaier, S. Jans, and H. Kosch. TempoM2: A multi feature index structure for temporal video search. In AMM, volume 7131, pages 323--333. Springer, 2012. Google ScholarDigital Library
- V. Gaede and O. Gunther. Multidimensional access methods. ACM Comput. Surv., 30(2):170--231, 1998. Google ScholarDigital Library
- T. Ganchev, N. Fakotakis, and G. Kokkinakis. Comparative evaluation of various MFCC implementations on the speaker verification task. In Proc. SPECOM, pages 191--194, 2005.Google Scholar
- S.-T. Goh and K.-L. Tan. MOSAIC: A fast multi-feature image retrieval system. Data & Knowledge Engineering, 33(3):219 -- 239, 2000. Google ScholarDigital Library
- Y. He and J. Yu. MFI-tree: An effective multi-feature index structure for weighted query application. Comput. Sci. Inf. Syst., 7(1):139--152, 2010.Google ScholarCross Ref
- A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV, 42(3):145--175, 2001. Google ScholarDigital Library
- J. Shao, H. T. Shen, and X. Zhou. Challenges and Techniques for Effective and Efficient Similarity Search in Large Video Databases. PLDV, 1(2):1598--1603, 2008. Google ScholarDigital Library
- C. Traina, A. Traina, B. Seeger, and C. Faloutsos. Slim-trees: High performance metric trees minimizing overlap between nodes. In 7th EDBT, pages 51--65, 2000. Google ScholarDigital Library
- T. G. Vespa. Operacao de carga-rapida (bulk-loading) em metodos de acesso metricos. Master's thesis, Universidade de S~ao Paulo, Sao Carlos, 2007.Google Scholar
- R. Yan and A. G. Hauptmann. A review of text and image retrieval approaches for broadcast news video. Inf. Retr., 10(4-5):445--484, Oct. 2007. Google ScholarDigital Library
- P. Zezula, G. Amato, V. Dohnal, and M. Batko. Similarity Search: The Metric Space Approach. Advances in Database Systems. Springer, 2010. Google ScholarDigital Library
Index Terms
- An efficient access method for multimodal video retrieval
Recommendations
An efficient access method for multimodal video retrieval
This paper presents the Slim 2 -tree, an efficient and effective content-based video retrieval technique allowing the use of multiple modalities within a single index structure. Slim 2 -tree is capable of dealing with different distance measures for the ...
On the significance of cluster-temporal browsing for generic video retrieval: a statistical analysis
MM '06: Proceedings of the 14th ACM international conference on MultimediaIn this paper, we test statistically the effect of content-based browsing in generic video retrieval. Using TRECVID 2004 and 2005 experiments, we demonstrate that content-based browsing improves retrieval over sequential queries and relevance feedback. ...
Efficient content-based video retrieval by mining temporal patterns
MDM '08: Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008In recent years, multimedia content processing has become a hot topic with the rapid development of information technology and popularity of World Wide Web. Among the emerging research topics, content-based video retrieval is an attractive and ...
Comments