Abstract
Evolution of spatial relationships between objects often provides important clues for semantic video analysis. We present a symbolic representation that describes spatiotemporal characteristics and facilitates tactics detection based on string matching. To find typical spatiotemporal patterns of a targeted tactic, we organize training sequences as a tree, and effectively discover frequent patterns from the structure. Tactics detection is conducted by comparing a given test sequence with these frequent patterns. To realize the proposed idea, we develop elaborate audio/video processes to transform broadcasting tennis videos into symbolic sequences, and comprehensively tackle event detection and tactics analysis. We experiment on ten most important tennis championships in the year 2008, and report promising detection results on seven events/tactics. We demonstrate not only the effectiveness of the proposed methods, but also study the impacts brought by the results of tactics analysis.










Similar content being viewed by others
References
Cai R, Lu L, Zhang H-J, Cai LH (2003) Highlight sound effects detection in audio stream. In Proceedings of IEEE International Conference on Multimedia & Expo, vol. 3, 37–40
Chang C-C, Lin C-J (2001) LIBSVM: a library for support vector machine. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Chang S-F, Chen W, Meng HJ, Sundaram H, Zong D (1998) A fully automated content-based video search engine supporting spatiotemporal queries. IEEE Trans Circuits Syst Video Technol 8(5):602–615
Cheng W-H, Chu W-T, Wu J-L (2003) Semantic context detection based on hierarchical audio models. In Proceedings of the ACM SIGMM International Workshop on Multimedia Information Retrieval, pp 109–115
Choi J, Joen WJ, Lee S-C (2008) Spatio-temporal pyramid matching for sports videos. In Proceedings of ACM International Conference on Multimedia Information Retrieval, pp. 291–297
Chu W-T, Wu J-L (2008) Explicit semantic events detection and development of realistic applications for broadcasting baseball videos. Multimedia Tools and Applications 38(1):27–50
Chu W-T, Tien M-C, Wang Y-T, Chou C-W, Hsieh K-Y, Wu J-L (2007) Event detection in tennis matches based on real-world audiovisual cues. In Proceedings of the 20th Computer Vision, Graphics, and Image Processing Conference, pp 541-548
Duan L-Y, Xu M, Chua T-S, Tian Q, Xu C-S (2003) A mid-level representation framework for semantic video analysis. In Proceedings of ACM Multimedia, pp. 33–44
Duan L-Y, Xu M, Tian Q, Xu C-S, Jin JS (2005) A unified framework for semantic shot classification in sports video. IEEE Trans Multimedia 7(6):1066–1083
Ekin A, Tekalp AM, Mehrota R (2003) Automatic soccer video analysis and summarization. IEEE Trans Image Process 12(7):796–807
Farin D, Krabbe S, de With PHN, Effelsberg W (2004) Robust camera calibration for sport videos using court models. In Proceedings of SPIE Storage and Retrieval Methods and Applications for Multimedia, vol. 5307, 80–91
Galmar E, Huet B (2008) Spatiotemporal modeling and matching of video shots. In Proceedings of 1st ICIP workshop on Multimedia Information Retrieval: New Trends and Challenges
Han M, Hua W, Xu W, Gong Y (2002) An integrated baseball digest system using maximum entropy method. In Proceedings of ACM Multimedia, pp 347–350
Han J, Farin D, de With PHN (2006) Multi-level analysis of sports video sequences. In Proceedings of SPIE Conference on Multimedia Content Analysis, Management, and Retrieval
Hanjalic A (2002) Shot-boundary detection: unraveled and resolved? IEEE Trans Circuits Syst Video Technol 12(2):90–105
Hartley R, Zisserman A (2000) Multiple view geometry in computer vision. Cambridge University Press
Huang Y-P, Chiou C-L, Sandnes FE (2009) An intelligent strategy for the automatic detection of highlights in tennis video recordings. Expert Syst Appl 36:9907–9918
Kijak E, Gravier G, Oisel L, Gros P (2006) Audiovisual integration for tennis broadcast structuring. Multimedia Tools and Applications 30:289–311
Kolonias I, Christmas W, Kittler J (2004) Automatic evolution tracking for tennis matches using an HMM-based architecture. In Proceedings of IEEE Workshop on Machine Learning for Signal Processing, pp 615–624
Lan D-J, Ma Y-F, Ma W-Y, Zhang H-J (2004) Spatio-temporal pattern mining in sports video. In Proceedings of Pacific-Rim Conference on Multimedia, pp 306–313
Leonardi R, Migliorati P, Prandini M (2004) Semantic indexing of soccer audio-visual sequences: a multimodal approach based on controlled Markov chains. IEEE Trans Circuits Syst Video Technol 14(5):634–643
Liu Y, Jiang S, Ye Q, Gao W, Huang Q (2005) Playfield detection using adaptive GMM and its applications. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, 421–424
Pallavi V, Mukherjee J, Majumdar AK, Sural S (2008) Graph-based multiplayer detection and tracking in broadcast soccer videos. IEEE Trans Multimedia 10(5):794–805
Pantic M, Patras I, Valstar MF (2005) Learning spatio-temporal models of facial expressions. In Proceedings of International Conference on Measuring Behaviour
Rea N, Dahyot R, Kokaram A (2005) Classification and representation of semantic content in broadcast tennis videos. In Proceedings of IEEE International Conference on Image Processing, vol. 3, pp. 1204–1207
Rui Y, Gupta A, Acero A (2000) Automatically extracting highlights for TV baseball programs. In Proceedings of ACM Multimedia, pp 105–115
Sadlier DA, O’connor NE (2005) Event detection in field sports videos using audio-visual features and a support vector machine. IEEE Trans Circuits Syst Video Technol 15(10):1225–1233
Snoek CGM, Worring M, Geusebroek J-M, Koelma DC, Seinstra FJ, Smeulders AWM (2006) The semantic pathfinder: using an authoring metaphor for generic multimedia indexing. IEEE Trans Pattern Anal Mach Intell 28(10):1678–1689
Song X, Fan G (2007) Selecting salient frames for spatiotemporal video modeling and segmentation. IEEE Trans Image Process 16(2):3035–3046
Sudhir G, Lee JC, Jain AK (1998) Automatic classification of tennis videos for high-level content-based retrieval. In Proceedings of International Workshop on Content-Based Access of Image and Video Databases, pp. 81–90
Tagagi S, Hattori S, Yokoyama K, Kodate A, Tominaga H (2003) Sports video categorizing method using camera motion parameters. In Proceedings of IEEE International Conference on Multimedia & Expo, vol. 2, pp 461–464
United States Tennis Association (1996) Tennis tactics—winning patterns of play. Human Kinetics Publishers
Wang JR, Parameswaran N (2005) Analyzing tennis tactics from broadcasting tennis video clips. In Proceedings of International Multimedia Modelling Conference, pp 102–106
Wang Y, Liu Z, Huang JC (2000) Multimedia content analysis using both audio and visual cues. IEEE Signal Process Mag 17(6):12–36
Wang P, Cui R, Yang S-Q (2004) A tennis video indexing approach through pattern discovery in interactive process. In Proceedings of Pacific-Rim Conference on Multimedia, pp 49–56
Xie L, Xu P, Chang S-F, Divakaran A, Sun H (2004) Structure analysis of soccer video with domain knowledge and hidden Markov models. Pattern Recogn Lett 26(7):767–775
Xu H, Chua T-S (2006) Fusion of AV features and external information sources for event detection in team sports video. ACM Transactions on Multimedia Computing, Communications, and Applications 2(1):44–67
Xu M, Duan L-Y, Xu C-S, Tian Q (2003) A fusion scheme of visual and auditory modalities for event detection in sports video. In Proceedings IEEE International Conference on Acoustics, Speech, & Signal Processing, vol. 3, pp. 189–192
Yu X, Xu C, Leong HW, Tian Q, Tang Q, Wan KW (2003) Trajectory-based ball detection and tracking with applications to semantic analysis of broadcast soccer video. In Proceedings of ACM Multimedia, pp 11–20
Zhang D, Chang S-F (2002) Event detection in baseball video using superimposed caption recognition. In Proceedings of ACM Multimedia, pp. 315–318
Zhu G, Xu C, Huang Q, Gao W (2006) Automatic multi-player detection and tracking in broadcast sports video using support vector machine and particle filter. In Proceedings of IEEE International Conference on Multimedia & Expo, pp 1629–1632
Zhu, G., Huang, Q., Xu, C., Rui, Y., Jiang, S., Gao, W., and Yao, H. (2007) Trajectory based event tactics analysis in broadcast sports video. In Proceedings of ACM Multimedia, pp. 58–67
Zhu G, Huang Q, Xu C, Xing L, Gao W, Yao H (2007) Human behavior analysis for highlight ranking in broadcast racket sports video. IEEE Trans Multimedia 9(6):1167–1182
Zhu G, Xu C, Huang Q, Liu H (2008) Event tactic analysis based on player and ball trajectory in broadcast video. In Proceedings of ACM International Conference on Image and Video Retrieval, pp 515–523
Acknowledgements
This work was partially supported by the National Science Council of ROC under NSC 96-2218-E-194-005 and 97-2221-E-194-050. The authors would like to thank anonymous reviewers for giving valuable comments, and thank Ming-Chun Tien, Yi-Tang Wang, Chen-Wei Chou, Kuei-Yi Hsieh, and Ja-Ling Wu for co-developing former version of the system.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chu, WT., Tsai, WH. Modeling spatiotemporal relationships between moving objects for event tactics analysis in tennis videos. Multimed Tools Appl 50, 149–171 (2010). https://doi.org/10.1007/s11042-009-0363-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-009-0363-z