Abstract
Query by video clip (QVC) has attracted wide research interests in multimedia information retrieval. In general, QVC may include feature extraction, similarity measure, database organization, and search or query scheme. Towards an effective and efficient solution, diverse applications have different considerations and challenges on the abovementioned phases. In this paper, we firstly attempt to broadly categorize most existing QVC work into 3 levels: concept based video retrieval, video title identification, and video copy detection. This 3-level categorization is expected to explicitly identify typical applications, robust requirements, likely features, and main challenges existing between mature techniques and hard performance requirements. A brief survey is presented to concretize the QVC categorization. Under this categorization, in this paper we focus on the copy detection task, wherein the challenges are mainly due to the design of compact and robust low level features (i.e. an effective signature) and a kind of fast searching mechanism. In order to effectively and robustly characterize the video segments of variable lengths, we design a novel global visual feature (a fixed-size 144-d signature) combining the spatial-temporal and the color range information. Different from previous key frame based shot representation, the ambiguity of key frame selection and the difficulty of detecting gradual shot transition could be avoided. Experiments have shown the signature is also insensitive to color shifting and variations from video compression. As our feature can be extracted directly from MPEG compressed domain, lower computational cost is required. In terms of fast searching, we employ the active search algorithm. Combining the proposed signature and the active search, we have achieved an efficient and robust solution for video copy detection. For example, we can search for a short video clip among the 10.5 hours MPEG-1 video database in merely 2 seconds in the case of unknown query length, and in 0.011 second when fixing the query length as 10 seconds.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
http://www-nlpir.nist.gov/projects/trecvid/ Web site (2004)
Sebe, N., et al.: The state of the art in image and video retrieval. In: Proc. of CIVR 2003 (2003)
Jain, A.K., et al.: Query by video clip. Multimedia System 7, 369–384 (1999)
DeMenthon, D., et al.: Video retrieval using spatio-temporal descriptors. In: Proc. of ACM Multimedia 2003, pp. 508–517 (2003)
Cho, C.-Y., et al.: Efficient motion-vector-based video search using query by clip. In: Proc. of ICME 2004, Taiwan (2004)
Duan, L.-Y., et al.: A unified framework for semantic shot classification in sports video. In: IEEE Transaction on Multimedia (2004) (to appear)
Duan, L.-Y., et al.: Mean shift based video segment representation and applications to replay detection. In: Proc. of ICASSP 2004, pp. 709–712 (2004)
Duan, L.-Y., et al.: A Mid-level Representation Framework for Semantic Sports Video Analysis. In: Proc. of ACM Multimedia 2003, pp. 33–44 (2003)
Zhang, D.-Q., et al.: Detection image near-duplicate by stochastic attribute relational graph matching with learning. In: Proc. of ACM Multimedia 2004, NewYork (2004)
Jaimes, A., Chang, S.-F., Loui, A.C.: Detection of non-identical duplicate consumer photographs. In: Proc. of PCM 2003, Singapore (2003)
Cheung, S., Zakhor, A.: Efficient video similarity measurement with video signature. IEEE Trans. on Circuits and System for Video Technology 13, 59–74 (2003)
Cheung, S.-C., Zakhor, A.: Fast similarity search and clustering of video sequences on the world-wide-web. In: IEEE Transactions on Multimedia (2004) (to appear)
Chen, L., Chua, T.S.: A match and tiling approach to content-based video retrieval. In: Proc. of ICME 2001, pp. 301–304 (2001)
Kulesh, V., et al.: Video clip recognition using joint audio-visual processing model. In: Proc. of ICPR 2002, vol. 1, pp. 500–503 (2002)
Naphade, M.R., et al.: A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures. In: Proc. SPIE, Storage and Retrieval for Media Databases 2000, vol. 3972, pp. 564–572 (2000)
Hampapur, A., Hyun, K., Bolle, R.: Comparison of Sequence Matching Techniques for Video Copy Detection. In: SPIE. Storage and Retrieval for Media Databases 2002, San Jose, CA, USA, vol. 4676, pp. 194–201 (January 2002)
Kashino, K., et al.: A Quick Search Method forAudio andVideo Signals Based on Histogram Pruning. IEEE Trans. on Multimedia 5(3), 348–357 (2003)
Kashino, K., et al.: Aquick video search method based on local and global feature clustering. In: Proc. of ICPR 2004, Cambridge, UK (August 2004)
Ferman, A.M., et al.: Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans. on Image Processing 1(5) (May 2002)
Joly, A., Frelicot, C., Buisson, O.: Robust content-based video copy identification in a large reference database. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 414–424. Springer, Heidelberg (2003)
Pua, K.M., et al.: Real time repeated video sequence identification. Journal of Computer Vision and Image Understanding 93, 310–327 (2004)
Hoad, T.C., et al.: Fast video matching with signature alignment. In: SIGIR Multimedia Information Retrieval Workshop 2003 (MIR 2003), Toronto, pp. 263–269 (2003)
Kasutani, E., et al.: An adaptive feature comparison method for real-time video identification. In: Proc. of ICIP 2003 (2003)
Diakopoulos, N., et al.: Temporally Tolerant Video Matching. In: SIGIR Multimedia Information Retrieval Workshop 2003 (MIR 2003), Toronto, Canada (August 2003)
Yuan, J., et al.: Fast and Robust Short Video Clip Search Using an Index Structure. In: ACM Multimedia Workshop on Multimedia Information Retrieval, MIR 2004 (2004)
Yuan, J., et al.: Fast and Robust Search Method for Short Video Clips from Large Video Collection. In: Proc. of ICPR 2004, Cambridge, UK (August 2004)
Kim, S.H., Park, R.-H.: An efficient algorithm for video sequence matching using the modified Hausdorff distance and the directed divergence. IEEE Trans. On Circuits and Systems for Video Technology 12, 592–596 (2002)
Lienhart, R., et al.: VisualGREP: A Systematic method to compare and retrieve video sequences. In: SPIE. storage and Retrieval fro Image and Video Database VI, vol. 3312 (1998)
Oostveen, J., et al.: Feature extraction and a database strategy for video fingerprinting. In: Chang, S.-K., Chen, Z., Lee, S.-Y. (eds.) VISUAL 2002. LNCS, vol. 2314, pp. 117–128. Springer, Heidelberg (2002)
Fan, J., et al.: Classview: hierarchical video shot classification, indexing and accessing. IEEE Trans. on Multimedia 6(1) (February 2004)
Hoi, C.-H., et al.: A novel scheme for video similarity detection. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 373–382. Springer, Heidelberg (2003)
Kimura, A., et al.: A Quick Search Method for Multimedia Signals Using Feature Compression Based on Piecewise Linear Maps. In: Proc. of ICASSP 2002 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yuan, J., Duan, LY., Tian, Q., Ranganath, S., Xu, C. (2004). Fast and Robust Short Video Clip Search for Copy Detection. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds) Advances in Multimedia Information Processing - PCM 2004. PCM 2004. Lecture Notes in Computer Science, vol 3332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30542-2_59
Download citation
DOI: https://doi.org/10.1007/978-3-540-30542-2_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23977-2
Online ISBN: 978-3-540-30542-2
eBook Packages: Computer ScienceComputer Science (R0)