Abstract
Automatic content analysis of sports videos is a valuable and challenging task. Motivated by analogies between a class of sports videos and languages, the authors propose a novel approach for sports video analysis based on compiler principles. It integrates both semantic analysis and syntactic analysis to automatically create an index and a table of contents for a sports video. Each shot of the video sequence is first annotated and indexed with semantic labels through detection of events using domain knowledge. A grammar-based parser is then constructed to identify the tree structure of the video content based on the labels. Meanwhile, the grammar can be used to detect and recover errors during the analysis. As a case study, a sports video parsing system is presented in the particular domain of diving. Experimental results indicate the proposed approach is effective.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Dimitrova N, Zhang H J, Shahraray B, Sezan I, Huang T, Zakhor A. Applications of video-content analysis and retrieval.IEEE Multimedia, 2002, 9(3): 42–55.
Li F C, Gupta A, Sanccki E, He L, Rui Y. Browsing digital video. InProc. ACM Conference on Human Factors in Computing Systems, Hague, Netherlands, Apr. 2000, pp.169–176.
Zhang H J, Gong Y H, Smoliar S W, Tan S Y. Automatic parsing of news video. InProc. IEEE International Conference on Multimedia Computing and Systems, Boston, MA, USA, May 1994, pp.45–54.
Wang W, Gao W. Automatic segmentation of news items based on video and audio features.Journal of Computer Science and Technology, Mar. 2002, 17(2): 189–195.
Babaguchi N, Kawai Y, Kitahashi T. Event based indexing of broadcasted sports video by intermodal collaboration.IEEE Trans. Multimedia, Mar. 2002, 4(1): 68–75.
Duan L Y, Xu M, Chua T S, Tian Q, Xu C S. A midlevel representation framework for semantic sports video analysis. InProc. ACM Multimedia, Berkeley, CA, USA, Nov. 2003, pp.33–44.
Rui Y, Gupta A, Acero A. Automatically extracting highlights for TV baseball programs. InProc. ACM Multimedia, Marina del Rey, CA, USA, Oct. 2000, pp.105–115.
Hanjalic A. Generic approach to highlights extraction from a sports video. InProc. IEEE International Conference on Image Processing, Barcelona, Spain, Sep. 2003, 1: 1–4.
Assfalg J, Bertini M, Colombo C, Bimbo A D. Semantic annotation of sports videos.IEEE Multimedia, 2002, 9(2): 52–60.
Duan L Y, Xu M, Yu X D, Tian Q. A unified framework for semantic shot classification in sports videos. InProc. ACM Multimedia, Juan-les-Pins, France, Dec. 2002, pp.419–420.
Zhong D, Chang S F. Structure analysis of sports video using domain models. InProc. IEEE International Conference on Multimedia and Expo, Tokyo, Japan, Aug. 2001, pp.713–716.
Xie L, Chang S F, Divakaran A, Sun H. Structure analysis of soccer video with hidden Markov models. InProc. International Conference on Acoustic, Speech, and Signal Processing, Orlando, FL, USA, May 2002, 4: 4096–4099.
Rui Y, Huang T S, Mehrotra S. Constructing table-of-content for videos.Multimedia Systems, 1999, 7(5): 359–368.
Lienhart R. Comparison of automatic shot boundary detection algorithms. InProc. SPIE Storage and Retrieval for Imagee and Video Databases, San Jose, CA, USA, Jan. 1999, pp.291–301.
Zhong Y, Zhang H J, Jain A K. Automatic caption localization in compressed video.IEEE Trans. Pattern Analysis and Machine Intelligence, Apr. 2002, 22(4): 385–392.
Wang F, Li J T, Zhang Y D, Lin S X. Automatically extracting highlights for diving video. InProc. China National Computer Conference, Beijing, China, Nov. 2003, 1: 471–475.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported in part by the State Physical Culture Administration of China under Grant No.02005.
Fei Wang was born in 1977. He is a Ph.D. candidate at Institute of Computing Technology (ICT), the Chinese Academy of Sciences (CAS). He received the B.S. degree in electrical engineering from Zhejiang University in 1999 and the M.S degree in computer science from Graduate School of the Chinese Academy of Sciences in 2001. His current research interests include content-based video analysis and retrieval.
Jin-Tao Li was born in 1962. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia data compression, virtual reality, and home network.
Yong-Dong Zhang was born in 1973. He is an associate professor at ICT, CAS. His main research areas include multimedia data compression and multimedia information retrieval.
Shou-Xun Lin was born in 1948. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia technology and systems.
Rights and permissions
About this article
Cite this article
Wang, F., Li, JT., Zhang, YD. et al. Semantic and structural analysis of TV diving programs. J. Comput. Sci. & Technol. 19, 928–935 (2004). https://doi.org/10.1007/BF02973456
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/BF02973456