Abstract
To make the automatic person indexing of interview video in the TV news program, this paper proposes the method to detect the overlay name text line among the whole overlay texts in one frame. The proposed method is based on the identification of the beginning frame and the edge using Canny edge detector. The experimental results on Korean television news videos show that the proposed method efficiently detects and localizes the overlaid name text line.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hua, X.-S., Liu, W., Zhang, H.-J.: An automatic performance evaluation protocol for video text detection algorithms. IEEE Trans. Circ. Syst. Video Technol. 14(4), 498–507 (2004)
Lee, S.H., Ahn, J.I., Jo, K.H.: Automatic name line detection for person indexing based on overlay text. J. Multimedia Inf. Syst. 2(1), 163–170 (2015)
Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)
Wang, Z., Wu, X., Yang, L., Zhang, Y.: A survey on video caption extraction technology. In: The 4th International Conference on Multimedia Information Networking and Security, pp. 713–716 (2012)
Zhang, J., Kasturi, R.: Extraction of text objects in video documents: recent progress. In: The 8th IAPR Workshop on Document Analysis Systems, pp. 5–17 (2008)
Xu, J., Shivakumara, P., Lu, T., Phan, T.Q., Tan, C.L.: Graphics and scene text classification in video. In: The 22nd International Conference on Pattern Recognition, pp. 4714–4719 (2014)
Aradhye, H.B., Myers, G.K.: Exploiting videotext “Events” for improved videotext detection. In: The 9th International Conference on Document Analysis and Recognition, pp. 894–898 (2007)
Shivakumara, P., Phan, T.Q., Tan Hong, C.L., Lim, K.J.: A gradient difference based technique for video text detection. In: ICDAR, pp. 156–160 (2009)
Gargi, U., Antani, S., Woods, R.E.: Indexing text events in digital video database. Pattern Recogn. 1, 1481–1483 (1998)
Shivakumara, P., Huang, W., Tan, C.L.: An efficient edge based technique for text detection in video frames. In: DAS, pp. 307–314 (2008)
Fu, X., Gao, H.: Gray-based news video text extraction approach. In: 5th International Conference on Computer Science and Convergence Information Technology (2010)
Yang, Z., Shi, P.: Caption detection and text recognition in news video. In: 5th International Congress on Image and Signal Processing (2012)
Yen, S.-H., Chang, H.-W., Wang, C.-J., Wang, C.-W.: Robust news video text detection based on edges and line-deletion. WSEAS Trans. Sig. Process. 6(4), 186–195 (2010)
Sato, T., Kanade, T., Huges, E.K., Smith, M.A., Sato, S.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. Multimedia Syst. Issue 5(7), 385–395 (1999)
Poignant, J., Besacier, L., Quenot, G., Thollard, F.: From text detection in videos to person identification. In: International Conference on Multimedia and Expo (2012)
Satoh, S., Nakamura, Y., Kande, T.: Name-It: naming and detecting faces in news videos. In: Proceedings of IEEE Multimedia (1999)
Gay, P., Dupuy, G., Lailler, C., Odobez, J.-M., Meignier, S., Deleglise, P.: Comparison of two methods for unsupervised person identification in TV shows. In: 12th International Workshop on Content Based Multimedia Indexing (2014)
Pham, P.T., Tuytelaars, T., Mones, M.-F.: Naming people in news videos with label propagation. In: Proceedings of ICME (2010)
Jou, B., Li, H., Ellis, G., Morozoff-Abegauz, D., Chang, S.-F.: Structured exploration of who, what, when, and where in heterogeneous multimedia news source. In: Proceedings of ACM Multimedia (2013)
Poignant, J., Besacier, L., Le, V.B., Rosset, S., Quenot, G.: Unsupervised speaker identification in TV broadcast based on written names. In: Proceedings of Interspeech (2013)
Poignant, J., Bredin, H., Le, V.B., Besacier, L., Barras, C., Quenot, G.: Unsupervised speaker identification using overlay texts in TV broadcast. In: Proceedings of Interspeech (2012)
Bendris, M., Favre, B., Charlet, D., Damnati, G., Senay, G., Auguste, R., Martinet, J.: Unsupervised face identification in TV content using audio-visual sources. In: Proceedings of CBMI (2013)
Lee, C.-C., Chiang, Y.-C., Huang, H.-M., Tsai, C.-L.: A fast caption localization and detection for news videos. In: The 2nd International Conference on Innovative Computing Information and Control, pp. 226–229 (2007)
Lee, S.H., Ahn, J.I., Jo, K.H.: Comparison of text beginning frame detection methods for robust overlay text recognition. In: IWAIT 2016 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Lee, S., Ahn, J., Lee, Y., Jo, K. (2016). Beginning Frame and Edge Based Name Text Localization in News Interview Videos. In: Huang, DS., Han, K., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2016. Lecture Notes in Computer Science(), vol 9773. Springer, Cham. https://doi.org/10.1007/978-3-319-42297-8_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-42297-8_54
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42296-1
Online ISBN: 978-3-319-42297-8
eBook Packages: Computer ScienceComputer Science (R0)