Automatic personalized video abstraction for sports videos using metadata

Nitta, Naoko; Takahashi, Yoshimasa; Babaguchi, Noboru

doi:10.1007/s11042-008-0217-0

Automatic personalized video abstraction for sports videos using metadata

Published: 27 August 2008

Volume 41, pages 1–25, (2009)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Naoko Nitta¹,
Yoshimasa Takahashi¹ &
Noboru Babaguchi¹

362 Accesses
Explore all metrics

Abstract

Video abstraction is defined as creating a video abstract which includes only important information in the original video streams. There are two general types of video abstracts, namely the dynamic and static ones. The dynamic video abstract is a 3-dimensional representation created by temporally arranging important scenes while the static video abstract is a 2-dimensional representation created by spatially arranging only keyframes of important scenes. In this paper, we propose a unified method of automatically creating these two types of video abstracts considering the semantic content targeting especially on broadcasted sports videos. For both types of video abstracts, the proposed method firstly determines the significance of scenes. A play scene, which corresponds to a play, is considered as a scene unit of sports videos, and the significance of every play scene is determined based on the play ranks, the time the play occurred, and the number of replays. This information is extracted from the metadata, which describes the semantic content of videos and enables us to consider not only the types of plays but also their influence on the game. In addition, user’s preferences are considered to personalize the video abstracts. For dynamic video abstracts, we propose three approaches for selecting the play scenes of the highest significance: the basic criterion, the greedy criterion, and the play-cut criterion. For static video abstracts, we also propose an effective display style where a user can easily access target scenes from a list of keyframes by tracing the tree structures of sports games. We experimentally verified the effectiveness of our method by comparing our results with man-made video abstracts as well as by conducting questionnaires.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Babaguchi N, Kawai Y, Ogura T, Kitahashi T (2004) Personalized abstraction of broadcasted american football video by highlight selection. IEEE Trans Multimedia 6(4):575–586 (August)
Article Google Scholar
Boreczky J, Girgensohn A, Golovchinsky G, Uchihashi S (2000) An interactive comic book presentation for exploring video. In: Proc. ACM CHI’00, pp 185–192, The Hague, 1–6 April 2000
Chang S-F, Sundaram H (2000) Structural and semantic analysis of video. In: Proc. IEEE ICME 2000, New York, 30 July–2 August 2000
Chiu P, Girgensohn A, Liu Q (2004) Stained-glass visualization for highly condensed video summaries. In: Proc. IEEE ICME 2004, Taipei, 27–30 June 2004
Ekin A, Tekalp AM, Mehrotra R (2003) Automatic soccer video analysis and summarization. IEEE Trans Image Process 12(7):796–807 (July)
Article Google Scholar
Ferman AM, Tekalp AM (2003) Two-stage hierarchical video summary extraction to match low-level user browsing preferences. IEEE Trans Multimed 5(2):244–256 (June)
Article Google Scholar
Hanjalic A (2003) Generic approach to highlights extraction from a sport video. In: Proc. IEEE ICIP 2003, vol 1. IEEE, Piscataway, pp 1–4 (September)
Google Scholar
He L, Sanocki E, Gupta A, Grudin J (1999) Auto-summarization of audio-video presentations. In: Proc. ACM Multimedia’99. ACM, New York, pp 489–498 (November)
Google Scholar
Jaimes A, Echigo T, Teraguchi M, Satoh F (2002) Learning personalized video highlights from detailed MPEG-7 metadata. In: Proc. IEEE ICIP 2002, I. IEEE, Piscataway, pp 133–136 (September)
Google Scholar
Komlodi A, Marchionini G (1998) Key frame preview techniques for video browsing. In: Proc. ACM international conference on digital libraries. ACM, New York, pp 118–125 (June)
Chapter Google Scholar
Lee J-H, Lee G-G, Kim W-Y (2003) Automatic video summarizing tool using MPEG-7 descriptors for personal video recorder. IEEE Trans Consum Electron 49(3):742–749 (August)
Article Google Scholar
Lienhart R (2000) Dynamic video summarization of home video. SPIE 3972:378–389 (January)
Article Google Scholar
Ma Y-F, Zhang H-J (2002) A model of motion attention for video skimming. In: Proc. ICIP, vol 1. IEEE, Piscataway, pp 129–132
Google Scholar
Martinez JM (2001) Overview of the MPEG-7 standard (version 6.0). ISO/IEC JTC1/SC29/WG11 N4509 (December)
Masumitsu K, Echigo T (2001) Meta-data framework for constructing individualized video digest. In: Proc. IEEE ICIP 2001, vol 3. IEEE, Piscataway, pp 390–393 (October)
Google Scholar
Nam J, Tewfik AT (1999) Dynamic video summarization and visualization. In: Proc. ACM international conference on Multimedia. ACM, New York, pp 53–56
Google Scholar
Ngo C-W, Ma Y-F, Zhang H-J (2005) Video summarization and scene detection by graph modeling. IEEE Trans Circuits Syst Video Technol 15(2):296–305 (February)
Article Google Scholar
Oh JH, Hua KA (2000) An efficient technique for summarizing videos using visual contents. In: Proc. IEEE ICME 2000. IEEE, Piscataway, pp 1167–1170 (July)
Google Scholar
Peker KA, Divakaran A (2004) Adaptive fast playback-based video skimming using a compressed-domain visual complexity measure. In: Proc. IEEE ICME 2004, Taipei, 27–30 June 2004
Rui Y, Gupta A, Acero A (2000) Automatically extracting highlights for TV baseball programs. In: Proc. ACM Multimedia 2000. ACM, New York, pp 105–115 (October)
Chapter Google Scholar
Smith MA, Kanade T (1997) Video skimming and characterization through the combination of image and language understanding techniques. In: Proc. IEEE CVPR’97. IEEE, Piscataway, pp 775–781 (June)
Google Scholar
Sundaram H, Xie L, Chang S-F (2002) A utility framework for the automatic generation of audio-visual skims. In: Proc. ACM international conference on Multimedia. ACM, New York, pp 189–198
Google Scholar
Takahashi Y, Nitta N, Babaguchi N (2004) Automatic video summarization of sports videos using metadata. In: Proc. IEEE PCM 2004, vol 2. IEEE, Piscataway, pp 272–280 (December)
Google Scholar
Takahashi Y, Nitta N, Babaguchi N (2005) Video summarization for large sports video archives. In: Proc. IEEE ICME 2005, Amsterdam, 6–8 July 2005
Tjondronegoro D, Chen Y-PP, Pham B (2004) Integrating highlights for more complete sports video summarization. IEEE Multimed 11(4):22–37 (October)
Article Google Scholar
Tse T, Marchionini G, Ding W, Slaughter L, Komlodi A (1998) Dynamic key frame presentation techniques for augmenting video browsing. In: Proc. advanced visual interfaces. ACM, New York, pp 185–194
Google Scholar
Tseng BL, Lin C-Y, Smith JR (2004a) Using MPEG-7 and MPEG-21 for personalizing video. IEEE Multimed 11(1):42–52 (January–March)
Article Google Scholar
Tseng BL, Lin C-Y, Smith JR (2004b) Video personalization and summarization system for usage environment. J Vis Commun Image Represent 15:370–392 (May)
Article Google Scholar
Uchihashi S, Foote J (1999) Summarizing video using a shot importance measure and a frame-packing algorithm. In: Proc. ICASSP’99, vol 6. IEEE, Piscataway, pp 3041–3044 (March)
Google Scholar
Uchihashi S, Foote J, Girgensohn A, Boreczky J (1999) Video manga: generating semantically meaningful video summaries. In: Proc. ACM Multimedia’99. ACM, New York, pp 383–392 (October)
Google Scholar
Xiong Z, Radhakrishnan R, Divakaran A, Huang TS (2005) Highlights extraction from sports video based on an audio-visual marker detection framework. In: Proc. IEEE ICME 2005, Amsterdam, 6–8 July 2005
Xu C, Wang J, Wan K, Li Y, Duan L (2006) Live sports event detection based on broadcast video and web-casting text. In: Proc. ACM Multimedia. ACM, New York, pp 221–230 (October)
Google Scholar
Xu H, Chua T-S (2006) Fusion of AV features and external information sources for event detection in team sports video. ACM Trans Multimed Comput Commun Appl 2(1):44–67 (February)
Article Google Scholar
Zhou W, Vellaikal A, Kuo CCJ (2000) Rule-based video classification system for basketball video indexing. In: Proc. ACM workshops on Multimedia. ACM, New York, pp 213–216 (November)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Engineering, Osaka University, 2-1 Yamada-oka, Suita, Osaka, 565-0871, Japan
Naoko Nitta, Yoshimasa Takahashi & Noboru Babaguchi

Authors

Naoko Nitta
View author publications
You can also search for this author inPubMed Google Scholar
Yoshimasa Takahashi
View author publications
You can also search for this author inPubMed Google Scholar
Noboru Babaguchi
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Naoko Nitta.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nitta, N., Takahashi, Y. & Babaguchi, N. Automatic personalized video abstraction for sports videos using metadata. Multimed Tools Appl 41, 1–25 (2009). https://doi.org/10.1007/s11042-008-0217-0

Download citation

Published: 27 August 2008
Issue Date: January 2009
DOI: https://doi.org/10.1007/s11042-008-0217-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic personalized video abstraction for sports videos using metadata

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

User-centred personalised video abstraction approach adopting SIFT features

A Domain Independent Approach to Video Summarization

Generating Summary Videos Based on Visual and Sound Information from Movies

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Automatic personalized video abstraction for sports videos using metadata

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

User-centred personalised video abstraction approach adopting SIFT features

A Domain Independent Approach to Video Summarization

Generating Summary Videos Based on Visual and Sound Information from Movies

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now