Abstract
This paper investigates automatic video temporal segmentation techniques, also named shot boundary detection (SBD) techniques. Firstly, the existing SBD algorithms are reviewed in detail. Then, a new SBD algorithm is proposed aiming to obtain fast and accurate detection, and its performances are evaluated and compared with existing works. This algorithm computes the frame difference/similarity by such simple features as pixel difference and histogram difference, adopts motion-based difference to resist camera or object movements in the same shot and uses the flash detection to avoid false positives caused by light changes or flashes. The adopted features are computational efficient, and the combination of various features improve the detection accuracy. These properties make the algorithm suitable for real-time applications, such as broadcasted news segmentation.
Similar content being viewed by others
References
Arman F, Hsu A, Chiu MY (1994) Image processing on encoded video sequences. Multimed Syst 1(5):211–219
Boreczky J (1996) Using video and audio data for content extraction and shot and scene boundary determination. Ph.D. dissertation, University of California Berkeley
Boreczky JS, Rowe LA (1996) Comparison of video shot boundary detection techniques. J Electron Imaging 5(2):122–128
Cernekova Z, Pitas I, Nikou C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE Trans Circ Syst Video Tech 16(1):82–91
Divakaran A (2009) Multimedia content analysis: theory and applications. Springer, Boston
Dugad R, Ratakonda K, Ahuja N (1998) Robust video shot change detection. IEEE Workshop on Multimedia Signal Processing, December 1998
Ekin A, Tekalp AM, Mehrotra R (2003) Automatic soccer video analysis and summarization. IEEE Trans Image Process 12(7):796–807
Feng H, Fang W, Liu S, Fang Y (2005) A new general framework for shot boundary detection based on SVM. In Proc IEEE ICNN&B, IEEE Computer Society 2:1112–1117
Gao X, Tang X (2002) Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing. IEEE Trans Circuits Syst Video Technol 12(9):765–776
Gong Y, Xu W (2007) Machine learning for multimedia content analysis. Springer, Secaucus
Hampapur A, Jain R, Weymouth T (1994) Digital video segmentation. Proceedings of the ACM Multimedia 94, San Francisco, CA, October, pp 357–364
Han B, Hu Y, Wang G, Wu W, Yoshigahara T (2007) Enhanced sports video shot boundary detection based on middle-level features and a unified model. IEEE Trans Consumer Electron 53(3):1168–1176
Joyce RA, Liu B (2006) Temporal segmentation of video using frame and histogram space. IEEE Trans Multimed 8(1):130–140
Kasturi R, Jain R (1991) Dynamic vision. In: Kasturi R, Jain R (eds) Computer vision: principles. IEEE Computer Society Press, Washington, DC
Kawai Y, Sumiyoshi H, Yagi N (2007) Shot boundary detection at TRECVID 2007. Proceedings of the TRECVID Workshop 2007. http://www-nlpir.nist.gov/projects/tvpubs/tv7.papers/nhk.pdf
Little TDC, Ahanger G, Folz RJ, Gibbon JF, Reeve FW, Schelleng DH, Venkatesh D (1993) A digital on-demand video service supporting content-based queries. In: Proceedings of the ACM Multimedia 93, Anaheim, CA, August, pp 427–436
Matsumoto K, Naito M, Hoashi K, Sugaya F (2006) SVM-based shot boundary detection with a novel feature. In Proceedings of the IEEE international conference multimedia and expo 2006, pp 1837–1840
Nagasaka A, Tanaka Y (1992) Automatic video indexing and full-video search for object appearances. In: Knuth E, Wegner L (eds) Visual database systems II. Elsevier, Amsterdam, pp 113–127
Nam J, Tewfik AH (2005) Detection of gradual transitions in video sequences using B-spline interpolation. IEEE Trans Multimed 7(4):667–679
Ngo CW (2003) A robust dissolve detector by support vector machine. In: Proceedings of the ACM international conference multimedia 2003, ACM, New York, pp 283–286
Shahraray B (1995) Scene change detection and content-based sampling of video sequences. In: Digital video compression: algorithms and technologies, In: Arturo R, Robert S, Edward D (eds) Proceedings of the SPIE 2419:2–13
Swanberg D, Shu CF, Jain R (1993) Knowledge-guided parsing and retrieval in video databases. In: Wayne N (ed) Storage and retrieval for image and video databases, Proceeding of the SPIE 1908:173–187
Truong BT, Dorai C, Venkatesh S (2000) New enhancements to cut, fade, and dissolve detection processes in video segmentation. In: Proceedings of the ACM multimedia 2000, ACM, New York, pp 219–227
Ueda H, Miyatake T, Yoshizawa S (1991) IMPACT: An interactive natural motion picture dedicated multimedia authoring system. In: Proceedings of CHI 1991 (New Orleans, Louisiana, April–May, 1991) ACM, New York, pp 343–350
Wang Y, Liu Z, Huang JC (2000) Multimedia content analysis: using both audio and visual clues. IEEE Signal Process Mag 17:12–36
Yuan J, Wang H, Xiao L, Zheng W, Li J, Lin F, Zhang B (2007) A formal study of shot boundary detection. IEEE Trans Circ Syst Video Technology 17(2):168–186
Yusoff Y, Christmas W, Kittler J (2000) Video shot cut detection using adaptive thresholding. Proceedings of the 11th British machine vision conference (BMVC2000)
Zabih R, Miller J, Mai, K (1993) A feature-based algorithm for detecting and classifying scene breaks. Proceedings of the ACM Multimedia 95, San Francisco, CA, November, 1993, pp 189–200
Zabih R, Miller J, Mai K (1999) A feature-based algorithm for detecting and classifying production effects. Multimed Syst 7(2):119–128
Zhang HJ, Kankanhalli A, Smoliar SW (1993) Automatic partitioning of full-motion video. Multimed Syst 1(1):10–28
Acknowledgments
This work was partially supported by the Invenio Project launched by France Telecom.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lian, S. Automatic video temporal segmentation based on multiple features. Soft Comput 15, 469–482 (2011). https://doi.org/10.1007/s00500-009-0527-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-009-0527-9