Automatic video data structuring through shot partitioning and key-frame computing

Xiong, Wei; Lee, Chung-Mong; Ma, Rui-Hua

doi:10.1007/s001380050059

Automatic video data structuring through shot partitioning and key-frame computing

Published: June 1997

Volume 10, pages 51–65, (1997)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Wei Xiong¹,
Chung-Mong Lee¹ &
Rui-Hua Ma¹

109 Accesses
19 Citations
3 Altmetric
Explore all metrics

Abstract.

In video processing, a common first step is to segment the videos into physical units, generally called shots. A shot is a video segment that consists of one continuous action. In general, these physical units need to be clustered to form more semantically significant units, such as scenes, sequences, programs, etc. This is the so-called story-based video structuring. Automatic video structuring is of great importance for video browsing and retrieval. The shots or scenes are usually described by one or several representative frames, called key-frames. Viewed from a higher level, key frames of some shots might be redundant in terms of semantics. In this paper, we propose automatic solutions to the problems of: (i) video partitioning, (ii) key frame computing, (iii) key frame pruning. For the first problem, an algorithm called “net comparison” is devised. It is accurate and fast because it uses both statistical and spatial information in an image and does not have to process the entire image. For the last two problems, we develop an original image similarity criterion, which considers both spatial layout and detail content in an image. For this purpose, coefficients of wavelet decomposition are used to derive parameter vectors accounting for the above two aspects. The parameters exhibit (quasi-) invariant properties, thus making the algorithm robust for many types of object/camera motions and scaling variances. The novel “seek and spread” strategy used in key frame computing allows us to obtain a large representative range for the key frames. Inter-shot redundancy of the key-frames is suppressed using the same image similarity measure. Experimental results demonstrate the effectiveness and efficiency of our techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A static video summarization method based on the sparse coding of features and representativeness of frames

Article Open access 22 June 2016

A novel compact yet rich key frame creation method for compressed video summarization

Article 05 June 2017

Shot Boundary Detection and Key Frame Extraction for Sports Video Summarization Based on Spectral Entropy and Mutual Information

Author information

Authors and Affiliations

Department of Computer Science, Hong Kong University of Science and Technology, Clear water Bay, Kowloon, Hong Kong, , , , , , HK
Wei Xiong, Chung-Mong Lee & Rui-Hua Ma

Authors

Wei Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Mong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Rui-Hua Ma
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xiong, W., Lee, CM. & Ma, RH. Automatic video data structuring through shot partitioning and key-frame computing. Machine Vision and Applications 10, 51–65 (1997). https://doi.org/10.1007/s001380050059

Download citation

Issue Date: June 1997
DOI: https://doi.org/10.1007/s001380050059

Key words: Automatic video data structuring – Video partitioning – Image similarity measure – Wavelet – Invariant parameters – Key-frame computing – Key-frame pruning

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic video data structuring through shot partitioning and key-frame computing

Abstract.

Access this article

Similar content being viewed by others

A static video summarization method based on the sparse coding of features and representativeness of frames

A novel compact yet rich key frame creation method for compressed video summarization

Shot Boundary Detection and Key Frame Extraction for Sports Video Summarization Based on Spectral Entropy and Mutual Information

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Automatic video data structuring through shot partitioning and key-frame computing

Abstract.

Access this article

Similar content being viewed by others

A static video summarization method based on the sparse coding of features and representativeness of frames

A novel compact yet rich key frame creation method for compressed video summarization

Shot Boundary Detection and Key Frame Extraction for Sports Video Summarization Based on Spectral Entropy and Mutual Information

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation