Abstract
This paper presents a graph-based multilevel temporal video segmentation method. In each level of the segmentation, a weighted undirected graph structure is implemented. The graph is partitioned into clusters which represent the segments of a video. Three low-level features are used in the calculation of temporal segments’ similarities: visual content, motion content and shot duration. Our strength factor approach contributes to the results by improving the efficiency of the proposed method. Experiments show that the proposed video scene detection method gives promising results in order to organize videos without human intervention.
Similar content being viewed by others
References
Adams, B., Dorai, C., Venkatesh, S.: Novel approach to determining tempo and dramatic story sections in motion pictures. In: Proceedings of IEEE International Conference on Image Processing, vol. 2, pp. 283–286, 10–13 Sept, 2000
Ding, C.H.Q., He, X., Zha, H., Gu, M., Simon, H.D.: A min–max cut algorithm for graph partitioning and data clustering. In: Proceedings of IEEE International Conference on Data Mining, pp. 107–114, 29 Nov–2 Dec 2001
Gargi U., Kasturi R., Strayer S.H.: Performance characterization of video shot change detection methods. IEEE Trans. Circuits Syst. Video Technol. 10, 1–13 (2000)
Gong Y.: Summarizing audiovisual contents of a video program. EURASIP J. Appl. Signal Process. 2003, 160–169 (2003)
Gu, Z., Mei, T., Hua, X.-S., Wu, X., Li, S.: EMS: Energy minimization based video scene segmentation. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp. 520–523, 2–5 July, 2007
Hanjalic A., Lagendijk R.L., Biemond J.: Automated high-level movie segmentation for advanced video-retrieval systems. IEEE Trans. Circuits Syst. Video Technol. 9, 580–588 (1999)
Hanjalic A.: Shot boundary detection: Unraveled and resolved?. IEEE Trans. Circuits Syst. Video Technol. 12, 90–105 (2002)
Hanjalic A.: Towards theoretical performance limits of video parsing. IEEE Trans. Circuits Syst. Video Technol. 17, 261–272 (2007)
Kender, J.R., Yeo, B.-L.: Video scene segmentation via continuous video coherence. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 367–373, 23–25 June, 1998
Koprinska I., Carrato S.: Temporal video segmentation: A survey. Signal Process. Image Commun. 16, 477–500 (2001)
Lu, S., King, I., Lyu, M.R.: Novel video summarization framework for document preparation and archival applications. In: Proceedings of IEEE Aerospace Conference, pp. 1–10, 5–12 Mar 2005
Ngo C.W., Ma Y.F., Zhang H.J.: Video summarization and scene detection by graph modeling. IEEE Trans. Circuits Syst. Video Technol. 15, 296–305 (2005)
Odobez J.-M., Gatica-Perez D., Guillemot M.: Spectral structuring of home videos. In: Bakker, E.M., Lew, M.S., Huang, T.S., Sebe, N., Zhou, X.S. (eds) CIVR 2003. LNCS, vol. 2728, pp. 310–320. Springer, Heidelberg (2003)
Pavan M., Pelillo M.: Dominant sets and pairwise clustering. IEEE Trans. Pattern Anal. Mach. Intell. 29, 167–172 (2007)
Peng Y., Ngo C.W.: Clip-based similarity measure for query-dependent clip retrieval and video summarization. IEEE Trans. Circuits Syst. Video Technol. 16, 612–627 (2006)
Perona P., Freeman W.: A factorization approach to grouping. In: Burkhardt, H., Neumann, B. (eds) ECCV”98. LNCS, vol. 1406, pp. 655–670. Springer, Heidelberg (1998)
Petersohn, C.: Fraunhofer HHI at TRECVID 2004: Shot boundary detection system. TREC Video Retrieval Evaluation Online Proceedings. TRECVID 2004. http://www-nlpir.nist.gov/projects/tvpubs/tvpapers04/fraunhofer.pdf. Accessed 27 Apr 2006
Radhakrishnan, R., Divakaran, A., Xiong, Z., Otsuka, I.: A content-adaptive analysis and representation framework for audio event discovery from unscripted multimedia. EURASIP J. Appl. Signal Process. (2006). doi:10.1155/ASP/2006/89013
Rasheed Z., Shah M.: Detection and representation of scenes in videos. IEEE Trans. Multimedia 7, 1097–1105 (2005)
Rui Y., Huang T.S., Mehrotra S.: Constructing table of content for videos. Multimedia Syst. 7, 359–368 (1999)
Sakarya U., Telatar Z.: Graph-based multilevel temporal segmentation of scripted content videos. In: Escolano, F., Vento, M. (eds) GbRPR 2007. LNCS, vol. 4538, pp. 168–179. Springer, Heidelberg (2007)
Sakarya, U., Telatar, Z.: Graph partition based scene boundary detection. In: Petrou, M., Saramäki, T., Erçil, A., Lončarić, S. (eds.) Proceedings of the 5th International Symposium on Image and Signal Processing and Analysis (ISPA 2007), Istanbul, Turkey, pp. 544–549, 27–29 Sept 2007
Shi J., Malik J.: Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22, 888–905 (2000)
Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and TRECVid. In: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval (Santa Barbara, CA, USA, 26–27 Oct, 2006). MIR’06, pp. 321–330. ACM Press, New York (2006)
Song B.C., Ra J.B.: Automatic shot change detection algorithm using multi-stage clustering for MPEG-compressed videos. J. Vis. Commun. Image Represent. 12, 364–385 (2001)
Sundaram H., Chang S.F.: Computable scenes and structures in films. IEEE Trans. Multimedia 4, 482–491 (2002)
Tavanapong W., Zhou J.: Shot clustering techniques for story browsing. IEEE Trans. Multimedia 6, 517–527 (2004)
The Internet Movie Database. http://www.imdb.com/. Accessed 21 Jan 2008
Uchihashi, S., Foote, J.: Summarizing video using a shot importance measure and a frame-packing algorithm. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1999, vol. 6, pp. 3041–3044 (1999)
Vasconcelos N., Lippman A.: Statistical models of video structure for content analysis and characterization. IEEE Trans. Image Process. 9, 3–19 (2000)
Wang S., Siskind J.M.: Image segmentation with ratio cut. IEEE Trans. Pattern Anal. Mach. Intell. 25, 675–690 (2003)
Yaşaroğlu Y., Alatan A.A.: Summarizing video: Content, features, and HMM topologies. In: García, N., Martínez, J.M., Salgado, L. (eds) VLBV 2003. LNCS, vol. 2849, pp. 101–110. Springer, Heidelberg (2003)
Yeung M., Yeo B.L., Liu B.: Segmentation of video by clustering and graph analysis. Comput. Vis. Image Underst. 71, 94–109 (1998)
Yuan J., Wang H., Xiao L., Zheng W., Li J., Lin F., Zhang B.: A formal study of shot boundary detection. IEEE Trans. Circuits Syst. Video Technol. 17, 168–186 (2007)
Zhai Y., Shah M.: Video scene segmentation using Markov chain Monte Carlo. IEEE Trans. Multimedia 8, 686–697 (2006)
Zhao, Y., Wang, T., Wang, P., Hu, W., Du, Y., Zhang, Y., Xu, G.: Scene segmentation and categorization using Ncuts. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–7, 17–22 June, 2007
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Changsheng Xu.
Rights and permissions
About this article
Cite this article
Sakarya, U., Telatar, Z. Graph-based multilevel temporal video segmentation. Multimedia Systems 14, 277–290 (2008). https://doi.org/10.1007/s00530-008-0145-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-008-0145-x