Abstract
Video abstraction is defined as creating a video abstract which includes only important information in the original video streams. There are two general types of video abstracts, namely the dynamic and static ones. The dynamic video abstract is a 3-dimensional representation created by temporally arranging important scenes while the static video abstract is a 2-dimensional representation created by spatially arranging only keyframes of important scenes. In this paper, we propose a unified method of automatically creating these two types of video abstracts considering the semantic content targeting especially on broadcasted sports videos. For both types of video abstracts, the proposed method firstly determines the significance of scenes. A play scene, which corresponds to a play, is considered as a scene unit of sports videos, and the significance of every play scene is determined based on the play ranks, the time the play occurred, and the number of replays. This information is extracted from the metadata, which describes the semantic content of videos and enables us to consider not only the types of plays but also their influence on the game. In addition, user’s preferences are considered to personalize the video abstracts. For dynamic video abstracts, we propose three approaches for selecting the play scenes of the highest significance: the basic criterion, the greedy criterion, and the play-cut criterion. For static video abstracts, we also propose an effective display style where a user can easily access target scenes from a list of keyframes by tracing the tree structures of sports games. We experimentally verified the effectiveness of our method by comparing our results with man-made video abstracts as well as by conducting questionnaires.













Similar content being viewed by others
References
Babaguchi N, Kawai Y, Ogura T, Kitahashi T (2004) Personalized abstraction of broadcasted american football video by highlight selection. IEEE Trans Multimedia 6(4):575–586 (August)
Boreczky J, Girgensohn A, Golovchinsky G, Uchihashi S (2000) An interactive comic book presentation for exploring video. In: Proc. ACM CHI’00, pp 185–192, The Hague, 1–6 April 2000
Chang S-F, Sundaram H (2000) Structural and semantic analysis of video. In: Proc. IEEE ICME 2000, New York, 30 July–2 August 2000
Chiu P, Girgensohn A, Liu Q (2004) Stained-glass visualization for highly condensed video summaries. In: Proc. IEEE ICME 2004, Taipei, 27–30 June 2004
Ekin A, Tekalp AM, Mehrotra R (2003) Automatic soccer video analysis and summarization. IEEE Trans Image Process 12(7):796–807 (July)
Ferman AM, Tekalp AM (2003) Two-stage hierarchical video summary extraction to match low-level user browsing preferences. IEEE Trans Multimed 5(2):244–256 (June)
Hanjalic A (2003) Generic approach to highlights extraction from a sport video. In: Proc. IEEE ICIP 2003, vol 1. IEEE, Piscataway, pp 1–4 (September)
He L, Sanocki E, Gupta A, Grudin J (1999) Auto-summarization of audio-video presentations. In: Proc. ACM Multimedia’99. ACM, New York, pp 489–498 (November)
Jaimes A, Echigo T, Teraguchi M, Satoh F (2002) Learning personalized video highlights from detailed MPEG-7 metadata. In: Proc. IEEE ICIP 2002, I. IEEE, Piscataway, pp 133–136 (September)
Komlodi A, Marchionini G (1998) Key frame preview techniques for video browsing. In: Proc. ACM international conference on digital libraries. ACM, New York, pp 118–125 (June)
Lee J-H, Lee G-G, Kim W-Y (2003) Automatic video summarizing tool using MPEG-7 descriptors for personal video recorder. IEEE Trans Consum Electron 49(3):742–749 (August)
Lienhart R (2000) Dynamic video summarization of home video. SPIE 3972:378–389 (January)
Ma Y-F, Zhang H-J (2002) A model of motion attention for video skimming. In: Proc. ICIP, vol 1. IEEE, Piscataway, pp 129–132
Martinez JM (2001) Overview of the MPEG-7 standard (version 6.0). ISO/IEC JTC1/SC29/WG11 N4509 (December)
Masumitsu K, Echigo T (2001) Meta-data framework for constructing individualized video digest. In: Proc. IEEE ICIP 2001, vol 3. IEEE, Piscataway, pp 390–393 (October)
Nam J, Tewfik AT (1999) Dynamic video summarization and visualization. In: Proc. ACM international conference on Multimedia. ACM, New York, pp 53–56
Ngo C-W, Ma Y-F, Zhang H-J (2005) Video summarization and scene detection by graph modeling. IEEE Trans Circuits Syst Video Technol 15(2):296–305 (February)
Oh JH, Hua KA (2000) An efficient technique for summarizing videos using visual contents. In: Proc. IEEE ICME 2000. IEEE, Piscataway, pp 1167–1170 (July)
Peker KA, Divakaran A (2004) Adaptive fast playback-based video skimming using a compressed-domain visual complexity measure. In: Proc. IEEE ICME 2004, Taipei, 27–30 June 2004
Rui Y, Gupta A, Acero A (2000) Automatically extracting highlights for TV baseball programs. In: Proc. ACM Multimedia 2000. ACM, New York, pp 105–115 (October)
Smith MA, Kanade T (1997) Video skimming and characterization through the combination of image and language understanding techniques. In: Proc. IEEE CVPR’97. IEEE, Piscataway, pp 775–781 (June)
Sundaram H, Xie L, Chang S-F (2002) A utility framework for the automatic generation of audio-visual skims. In: Proc. ACM international conference on Multimedia. ACM, New York, pp 189–198
Takahashi Y, Nitta N, Babaguchi N (2004) Automatic video summarization of sports videos using metadata. In: Proc. IEEE PCM 2004, vol 2. IEEE, Piscataway, pp 272–280 (December)
Takahashi Y, Nitta N, Babaguchi N (2005) Video summarization for large sports video archives. In: Proc. IEEE ICME 2005, Amsterdam, 6–8 July 2005
Tjondronegoro D, Chen Y-PP, Pham B (2004) Integrating highlights for more complete sports video summarization. IEEE Multimed 11(4):22–37 (October)
Tse T, Marchionini G, Ding W, Slaughter L, Komlodi A (1998) Dynamic key frame presentation techniques for augmenting video browsing. In: Proc. advanced visual interfaces. ACM, New York, pp 185–194
Tseng BL, Lin C-Y, Smith JR (2004a) Using MPEG-7 and MPEG-21 for personalizing video. IEEE Multimed 11(1):42–52 (January–March)
Tseng BL, Lin C-Y, Smith JR (2004b) Video personalization and summarization system for usage environment. J Vis Commun Image Represent 15:370–392 (May)
Uchihashi S, Foote J (1999) Summarizing video using a shot importance measure and a frame-packing algorithm. In: Proc. ICASSP’99, vol 6. IEEE, Piscataway, pp 3041–3044 (March)
Uchihashi S, Foote J, Girgensohn A, Boreczky J (1999) Video manga: generating semantically meaningful video summaries. In: Proc. ACM Multimedia’99. ACM, New York, pp 383–392 (October)
Xiong Z, Radhakrishnan R, Divakaran A, Huang TS (2005) Highlights extraction from sports video based on an audio-visual marker detection framework. In: Proc. IEEE ICME 2005, Amsterdam, 6–8 July 2005
Xu C, Wang J, Wan K, Li Y, Duan L (2006) Live sports event detection based on broadcast video and web-casting text. In: Proc. ACM Multimedia. ACM, New York, pp 221–230 (October)
Xu H, Chua T-S (2006) Fusion of AV features and external information sources for event detection in team sports video. ACM Trans Multimed Comput Commun Appl 2(1):44–67 (February)
Zhou W, Vellaikal A, Kuo CCJ (2000) Rule-based video classification system for basketball video indexing. In: Proc. ACM workshops on Multimedia. ACM, New York, pp 213–216 (November)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nitta, N., Takahashi, Y. & Babaguchi, N. Automatic personalized video abstraction for sports videos using metadata. Multimed Tools Appl 41, 1–25 (2009). https://doi.org/10.1007/s11042-008-0217-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-008-0217-0