Abstract
A new method to improve the performance of the anchorperson shot extraction from news programs is proposed in this paper. The anchorperson voice information is used for the verification of anchorperson shot candidates extracted by visual information. The algorithm starts with the anchorperson voice shot extraction using time and silence condition. The anchorperson voice models are created after segregating anchorperson voice shots containing 2 or more voices. The anchorperson voice model verifies the anchorperson shot candidates obtained from visual information. 720 minutes of news programs are tested and experimental results are demonstrated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhang, H., Gong, Y., Smoliar, S.W., Tan, S.Y.: Automatic parsing of news video, Multimedia Computing and Systems. In: Proceedings of the International Conference on, pp. 45–54 (1994)
Hanjalic, A., Lagensijk, R.L., Biemond, J.: Template-based Detection of Anchorperson Shots in News Program. In: Proceedings of International Conference on Image Processing, ICIP 1998, vol. 3, pp. 148–152 (1998)
Choi, J., Jeong, D.: Storyboard construction using segmentation of MPEG encoded news video. In: Proceedings of the 43rd IEEE Midwest Symposium on Circuits and Systems, vol. 2, pp. 758–761 (2000)
Bertini, M., Del Bimbo, A., Pala, P.: Content based indexing and retrieval of TV news. Pattern Recognition Letter 22, 503–516 (2001)
Gao, X., Li, J., Yang, B.: A Graph-Theoretical Clustering based Anchorperson Shot Detection for news Video Indexing. In: ICCIMA (2003)
Nakajima, Y., Yamaguchi, D., Kato, H., Yanagihara, H., Hatori, Y.: Automatic anchorperson detection from an MPEG coded TV program. In: International Conference on Consumer Electronics, ICCE. 2002 Digest of Technical Papers, pp. 122–123 (2002)
Irii, H., Itoh, K., Kitawaki, N.: Multi-lingual speech database for speech quality measurements and its statistic characteristic. Trans. Committee on Speech Research, Acoust. Soc. Jap S87–69 (1987)
Furui, S.: Digital Speech Processing, Synthesis, and Recognition. Marcel Dekker, New York (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, SK., Hwang, D.S., Kim, JY., Seo, YS. (2004). An Effective Anchorperson Shot Extraction Method Robust to False Alarms. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds) Advances in Multimedia Information Processing - PCM 2004. PCM 2004. Lecture Notes in Computer Science, vol 3331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30541-5_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-30541-5_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23974-1
Online ISBN: 978-3-540-30541-5
eBook Packages: Computer ScienceComputer Science (R0)