Abstract
Automatic annotation of semantic events allows effective retrieval of video content. In this work, we present solutions for highlights detection in sports videos. This application is particularly interesting for broadcasters, since they extensively use manual annotation to select interesting highlights that are edited to create new programmes. The proposed approach exploits the typical structure of a wide class of sports videos, namely, those related to sports which are played in delimited venues with playfields of well known geometry, like soccer, basketball, swimming, track and field disciplines, and so on. For this class of sports, a modeling scheme based on a limited set of visual cues and on finite state machines (FSM) that encode the temporal evolution of highlights is presented. Algorithms for model checking and for visual cues estimation are discussed, as well as applications of the representation to different sport domains.
Similar content being viewed by others
References
Assfalg J, Bertini M, Del Bimbo A, Nunziati W, Pala P (2002) Soccer highlights detection and recognition using HMMs. In: Proceedings of the IEEE international conference on multimedia and expo (ICME 2002), Lausanne, Switzerland, August 2002
Baldi G, Colombo C, Del Bimbo A (1999) A compact and retrieval-oriented video representation using mosaics. In: Proceedings of the 3rd international conference on visual information systems (VISUAL’99), Amsterdam, The Netherlands, June 1999, pp 171–178
Bengio Y (1998) Markovian models for sequential data. Neural Comp Surv 2:129–162
Bertini M, Del Bimbo A, Pala P (2001) Content-based indexing and retrieval of TV news. Pattern Recogn Lett 22(5):503–516
Brand M, Oliver N, Pentland A (1997) Coupled hidden Markov models for complex action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR’97), San Juan, Puerto Rico, June 1997
Hampapur A (1999) Semantic video indexing: approach and issues. SIGMOD Record 28(1):32–39
Hartley R, Zisserman A (2000) Multiple view geometry in computer vision. Cambridge University Press, Cambridge
Ekin A, Murat Tekalp A, Mehrotra R (2003) Automatic soccer video analysis and summarization. IEEE Trans Image Process 12(7):796–807
Intille SS, Bobick AF (2001) Recognizing planned, multi-person action. Comput Vis Image Und 81(3):414–445
Jordan MI (1999) Learning in graphical models. MIT Press, Cambridge, Massachusetts
Kittler JV, Messer K, Christmas W, Levienaise-Obadia B, Koubaroulis D (2001) Generation of semantic cues for sports video annotation. In: Proceedings of the IEEE international conference on image processing (ICIP 2001), Thessaloniki, Greece, October 2001, vol 3, pp 26–29
Leonardi R, Migliorati P (2002) Semantic indexing of multimedia documents. IEEE Multimedia 9(2):44–51
Mottaleb M, Ravitz G (2003) Detection of plays and breaks in football games using audiovisual features and HMM. In: Proceedings of the 9th international conference on distributed multimedia systems (DMS 2003), Miami, Florida, September 2003, pp 154–160
Pavlovic V, Sharma R, Huang T (1997) Visual interpretation of hand gestures for human–computer interaction: a review. IEEE Trans Pattern Anal Mach Intell 19(7):677–695
Rabiner LR (1989) A tutorial on HMM and selected applications in speech recognition. Proc IEEE 77(2):257–286
Russell S, Norvig P (1995) Artificial intelligence: a modern approach. Prentice Hall, Englewood Cliffs, New Jersey
Sudhir G, Lee JCM, Jain AK (1998) Automatic classification of tennis video for high-level content-based retrieval. In: Proceedings of the international workshop on content-based access of image and video databases (CAIVD’98), Bombay, India, January 1998, pp 81–90
Tovinkere V, Qian RJ (2001) Detecting semantic events in soccer games: towards a complete solution. In: Proceedings of the international conference on multimedia and expo (ICME 2001), Tokyo, Japan, August 2001, pp 1040–1043
Xie L, Xu P, Chang S-F, Divakaran A, Sun H (2002) Structure analysis of soccer video with domain knowledge and hidden Markov models. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP 2002), Orlando, Florida, May 2002, pp 4096–4099
Xiong Z, Radhakrishnan R, Divakaran A, Huang TS (2003) Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework. In: Proceedings of the IEEE international conference on multimedia and expo (ICME 2003), Baltimore, Maryland, July 2003, pp 401–404
Yu X, Xu C, Leong HW, Tian Q, Tang Q, Wan KW (2003) Trajectory-based ball detection and tracking with applications to semantic analysis of broadcast soccer video. In: Proceedings of the 11th ACM international conference on multimedia (MM 2003), Berkeley, California, November 2003
Zhou W, Vellaikal A, Kuo CCJ (2000) Rule-based video classification system for basketball video indexing. In: Proceedings of the 8th ACM international conference on multimedia (MM 2000), Los Angeles, California, October/November 2000pp 213–216
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bertini, M., Del Bimbo, A. & Nunziati, W. Highlights modeling and detection in sports videos. Pattern Anal Applic 7, 411–421 (2004). https://doi.org/10.1007/s10044-004-0234-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-004-0234-1