Abstract
In this paper, we propose an efficient one-pass algorithm for shot boundary detection and a cost-effective anchor shot detection method with search space reduction, which are unified scheme in news video story parsing. First, we present the desired requirements for shot boundary detection from the perspective of news video story parsing, and propose a new shot boundary detection method, based on singular value decomposition, and a newly developed algorithm, viz., Kernel-ART, which meets all of these requirements. Second, we propose a new anchor shot detection system, viz., MASD, which is able to detect anchor person cost-effectively by reducing the search space. It consists of skin color detector, face detector, and support vector data descriptions with non-negative matrix factorization sequentially. The experimental results with the qualitative analysis illustrate the efficiency of the proposed method.






Similar content being viewed by others
References
Baraldi EC (1998) Simplified ART: a new class of ART algorithms. International Computer Science Institute, TR 98-004
Cernekova Z, Kotropoulos C, Pitas I (2003) Video shot segmentation using singular value decomposition. In: Procs of international conference on acoustics, speech, and signal processing, vol 3, pp 181–184
Cernekova Z, Pitas I, Nikou C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE Trans Circuits Syst Video Technol 16(1):82–91
Chaisorn L, Chua T, Lee C (2003) A multi-modal approach to story segmentation for news video. In: World Wide Web: internet and web information systems, vol 6, pp 187–208
Colace F, Foggia P, Percannella G (2005) A probabilistic framework for TV-news stories detection and classification. In: Procs of international conference on multimedia and expo, pp 1350–1353
Cooper M, Liu T, Rieffel E (2007) Video segmentation via temporal pattern classification. IEEE Trans Multimedia 9(3):610–618
Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, UK
Fang H, Jiang J, Feng Y (2006) A fuzzy logic approach for detection of video shot boundaries. Pattern Recogn 39:2092–2100
Fang Y, Zhai X, Fan J (2006) News video story segmentation. In: Procs of the international conference on multi-media modeling, pp 397–400
Feng H, Fang W, Liu S, Fang Y (2005) A new general framework for shot boundary detection based on SVM. In: Procs of international conference on neural networks and brain, vol 2, pp 1112–1117
Gao X, Tang X (2002) Unsupervised video shot segmentation and model free anchor person detection for news video story parsing. IEEE Trans Circuits Syst Video Technol 12(9): 765–776
Gao X, Li J, Yang B (2003) A graph-theoretical clustering based anchorperson shot detection for news video indexing. In: Procs of international conference on computational intelligence and multimedia applications, Washington, DC, USA, pp 108–113
Golub G, Van Loan C (1996) Matrix computations, 3rd edn. The Johns Hopkins University Press, USA
Gong Y, Liu X (2000) Video summarization using singular value decomposition. In: Procs of international conference on computer vision and pattern recognition, vol 2, pp 174–180
Hanjalic A, Lagendijk R, Biemond J (1998) Template-based detection of anchorperson shots in news programs. In: Procs of IEEE international conference on image processing, pp 148–152
Ko C, Xie W (2008) News video segmentation and categorization techniques for content-demand browsing. In: Procs of congress on image and signal processing, vol 2, pp 530–534
Lan D, Ma Y, Zhang H (2004) Multi-level anchorperson detection using multimodal association. In: Procs of 17th international conference on pattern recognition, vol 3, pp 890–893
Lee D, Seung H (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401:788–791
Ling X, Yuanxin Q, Huan L, Zhang X (2008) A method for fast shot boundary detection based on SVM. In: Procs of congress on image and signal processing, vol 2, pp 445–449
Luan X, Xie Y, Wu L, Wen J, Lao S (2005) AnchorClu: an anchorperson shot detection method based on clustering. In: Procs of 6th international conference on parallel and distributed computing, applications and technologies, pp 840–844
Santo M, Foggia P, Sansone C, Percannella G, Vento M (2006) An unsupervised algorithm for anchor shot detection. In: Procs of 18th international conference on pattern recognition, vol 2, pp 1238–1241
Solina F, Peer P, Batagelj B, Juvan S, Kovac J (2003) Color-based face detection in the 15 seconds of fame art installation. In: Procs of mirage 2003. INRIA Rocquencourt, France, pp 10–11
Tax D, Duin R (2004) Support vector data description. Mach Learn 54(1):45–66
Viola P, Jones M (2004) Robust real-time face detection. Int J Comput Vis 7(2):137–154
Yoo HW, Cho SB (2007) Video scene retrieval with interactive genetic algorithm. Multimed Tools Appl 34:317–336. doi:10.1007/s11042-007-0109-8
Yuan J, Wang H, Xiao L, Zheng W, Li J, Lin F, Zhang B (2007) A formal study of shot boundary detection. IEEE Trans Circuits Syst Video Technol 17(2):168–186
Zurada JM (1992) Introduction to artificial neural systems. Info Access Distribution, Singapore
Acknowledgements
This research was supported by a Korea University Grant; This research was financially supported by the Ministry of Education, Science Technology (MEST) and Korea Industrial Technology Foundation (KOTEF) through the Human Resource Training Project for Regional Innovation.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lee, H., Yu, J., Im, Y. et al. A unified scheme of shot boundary detection and anchor shot detection in news video story parsing. Multimed Tools Appl 51, 1127–1145 (2011). https://doi.org/10.1007/s11042-010-0462-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-010-0462-x