Abstract
Text facilitated sports video analysis has achieved extensive success in video indexing, retrieval and summarization. A commonly adopted basis in previous work is the separate alignment of timestamps between sports video and game text, which isn’t a robust method for generic cross-media analysis. In this paper, we propose a hierarchical semantics-matching approach to annotate sports video. Our key idea is to link video and text with high-level semantics rather than low-level features and find the optimal video-text alignment based on the integral structure rather than individual conditions. For accurate event location, the whole algorithm is implemented in a hierarchical way to generate both refined and accurate video annotation result. Experiments conducted on both basketball and football matches demonstrate that our proposed approach is effective for text facilitated sports video annotation.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ekin, A., Tekalp, A.M., Mehrotra, R.: Automatic soccer video analysis and summarization. IEEE Trans. on Image Processing 12:7(5), 796–807 (2003)
Huang, C.L., Shih, H.C., Chao, C.Y.: Semantic analysis of soccer video using dynamic Bayesian network. IEEE Trans. on Multmedia 8(4), 749–760 (2006)
Rui, Y., Gupta, A., Acero, A.: Automatically extracting highlights for TV baseball programs. In: Proc. of ACM Multimedia, Los Angeles, CA, pp. 105–115 (2000)
Duan, L.Y., Xu, M., Chua, T.S., Tian, Q., Xu, C.S.: A mid-level representation framework for semantic sports video analysis. In: Proc. of ACM Multimedia, Berkeley, USA, pp. 33–44 (2003)
Babaguchi, N., Kawai, Y., Kitahashi, T.: Event based indexing of broadcasted sports video by intermodal collaboration. IEEE Trans. on Multimedia 4, 68–75 (2002)
Xu, H.X., Chua, T.S.: The Fusion of Audio-Visual Features and External Knowledge for Event Detection in Team Sports Video. In: Proc. of Workshop on Multimedia Information Retrieval, New York, USA, pp. 127–134 (2004)
Xu, C.S., Wang, J.J., Lu, H.Q., Zhang, Y.F.: A novel framework for semantic annotation and personalized retrieval of sports video. IEEE Trans. on Multimedia 10(3) (2008)
Wang, L., Lew, M., Xu, G.Y.: Offense Based Temporal Segmentation for Event Detection in Soccer Video. In: Proc. of Workshop on Multimedia Information Retrieval, New York, USA, pp. 259–266 (2004)
Dufaux, F., Konrad, J.: Efficient, Robust and Fast Global Motion Estimation for Video Coding. IEEE Trans. on Image Processing 9(3) (2000)
Zhang, Y.F., Xu, C.S.: Rui. Y., Wang, J.Q., Lu, H.Q.: Semantic Extraction From Basketball Games Using Multi-modal Analysis. In: Proc. of IEEE International Conference on Multimedia and Expo., Beijing, China, pp. 2190–2193 (2007)
Xu, M., Duan, L.Y., Xu, C.S., Kankanhalli, M., Tian, Q.: Event Detection in Basketball Video Using Multiple Modalities. In: Proc. of IEEE Pacific Rim Conference on Multimedia, Singapore, vol. 3, pp. 1526–1530 (2003)
Needleman, S.B., Wunsch, C.D.: A General Method Applicable to The Search for Similarities in The Amino Acid Sequence of Two Proteins. J. Mol. Biol. 48(3), 443–453 (1970)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liang, C., Zhang, Y., Xu, C., Wang, J., Lu, H. (2009). A Hierarchical Semantics-Matching Approach for Sports Video Annotation. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_60
Download citation
DOI: https://doi.org/10.1007/978-3-642-10467-1_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10466-4
Online ISBN: 978-3-642-10467-1
eBook Packages: Computer ScienceComputer Science (R0)