Abstract
This paper presents a novel scheme that automatically extracts out the hierarchical structure of MPEG video by detecting cuts and scenes, and a very user‐friendly hierarchical video browser. The proposed scheme first conducts a fast, robust cut detection, chooses the representative frames of shots through clustering, calculates the connectivity between shots based on the similarities of the shots including a certain number of neighbors by using fuzzy reasoning, and extracts scenes by examining the connectivity change. The scheme is overall robust against threshold setting, cut miss‐detection and over‐detection. We applied our scheme to parts of movies, the cut and scene detection rates are respectively higher than 95% and 75%. Considering the scheme is of high speed and in no need of knowledge on specific kinds of videos, these detection rates are quite satisfactory. The scheme is also feasible for other kinds of videos if they possess the feature that “scene consists of a group of similar shots”.
Similar content being viewed by others
References
F. Arman, R. Depommier, A. Hsu and M. Chiu, Content-based browsing of video sequences, in: Proceedings of ACM Multimedia '94 (1994) pp. 97-103.
Information technology — Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbits/s — Part 2: Video, ISO/IEC 11172-2 (1993).
D. Le Gall, MPEG: A video compression standard for multimedia applications, Communications of the ACM 34(4) (1991) 46-58.
H. Liu and G. Zick, Scene detection of MPEG compressed video, SPIE, Vol. 2419 (1995) pp. 26-37.
J. Meng, Y. Juan and S. Chang, Scene change detection in a MPEG compressed video sequence, SPIE, Vol. 2419 (1995) pp. 14-25.
R. Muranoi, J. Zhao, K. Ohta, R. Hayasaka and Y. Matsushita, Content-based analysis of video structure, in: Proceedings of the 54th National Conference of IPSJ, Vol. 2 (1997) pp. 411-412.
Y. Nakajima, A video browsing using fast scene cut detection for an efficient networked video database access, IEICE Transactions on Information and Systems E77-D(12) (1994) 1355-1364.
G. Pass, R. Zabih and J. Miller, Comparing images using color coherence vectors, in: Proceedings of ACM Multimedia '96 (1996) pp. 65-73.
M. Shibata, A description model of video content and its application for video structuring, Transactions of IEICE J78-D-II(5) (1995) 754-764.
Y. Taniguchi, A. Akutsu, Y. Tonomura and H. Hamada, An intuitive and efficient access interface to real-time incoming video based on automatic indexing, in: Proceedings of ACM Multimedia '95 (1995) pp. 25-33.
Y. Taniguchi, Y. Tonomura and H. Hamada, A method for detecting shot changes and its application to access interfaces to video, Transactions of IEICE J79-D-II(4) (1996) 538-546.
Y. Tonomura, A. Akutsu, Y. Taniguchi and G. Suzuki, Structured video computing, IEEE Multimedia 1(3) (1994) 34-43.
H. Ueda, T. Miyatake, S. Sumino and A. Nagasaka, Visualization of video structure by motion picture analysis and its application, Transactions of IEICE J76-D-II(8) (1993) 1572-1580.
M. Yeung, B. Yeo and B. Liu, Extracting story units from long programs for video browsing and navigation, in: Proceedings of MULTIMEDIA '96 (1996) pp. 296-305.
H. Zhang, C. Low, S. Smoliar and J. Wu, Video parsing, retrieval and browsing: An integrated and content-based solution, in: Proceedings of ACM Multimedia '95 (1995) pp. 15-23.
H. Zhang, S. Smoliar and J. Wu, Content-based video browsing tools, SPIE, Vol. 2417 (1995) pp. 389-398.
H. Zhang, S. Tan, S. Smoliar and Y. Gong, Automatic parsing and indexing of news video, Multimedia Systems 2 (1995) 256-266.
J. Zhao, R. Hayasaka, R. Muranoi and Y. Matsushita, Shot similarity based MPEG video structure analysis and its application, in: Proceedings of IASTED International Conference on Parallel and Distributed Computing and Networks, Singapore (1997) pp. 209-212.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Zhao, J., Hayasaka, R., Muranoi, R. et al. A MPEG video structure analysis scheme and its application to hierarchical video browser. Telecommunication Systems 9, 403–422 (1998). https://doi.org/10.1023/A:1019116511362
Issue Date:
DOI: https://doi.org/10.1023/A:1019116511362