Beginning Frame and Edge Based Name Text Localization in News Interview Videos

Lee, Sanghee; Ahn, Jungil; Lee, Youlkyeoung; Jo, Kanghyun

doi:10.1007/978-3-319-42297-8_54

Sanghee Lee¹⁶,
Jungil Ahn¹⁷,
Youlkyeoung Lee¹⁶ &
…
Kanghyun Jo¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9773))

Included in the following conference series:

International Conference on Intelligent Computing

2942 Accesses
4 Citations

Abstract

To make the automatic person indexing of interview video in the TV news program, this paper proposes the method to detect the overlay name text line among the whole overlay texts in one frame. The proposed method is based on the identification of the beginning frame and the edge using Canny edge detector. The experimental results on Korean television news videos show that the proposed method efficiently detects and localizes the overlaid name text line.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Hua, X.-S., Liu, W., Zhang, H.-J.: An automatic performance evaluation protocol for video text detection algorithms. IEEE Trans. Circ. Syst. Video Technol. 14(4), 498–507 (2004)
Article Google Scholar
Lee, S.H., Ahn, J.I., Jo, K.H.: Automatic name line detection for person indexing based on overlay text. J. Multimedia Inf. Syst. 2(1), 163–170 (2015)
Google Scholar
Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)
Article Google Scholar
Wang, Z., Wu, X., Yang, L., Zhang, Y.: A survey on video caption extraction technology. In: The 4th International Conference on Multimedia Information Networking and Security, pp. 713–716 (2012)
Google Scholar
Zhang, J., Kasturi, R.: Extraction of text objects in video documents: recent progress. In: The 8th IAPR Workshop on Document Analysis Systems, pp. 5–17 (2008)
Google Scholar
Xu, J., Shivakumara, P., Lu, T., Phan, T.Q., Tan, C.L.: Graphics and scene text classification in video. In: The 22nd International Conference on Pattern Recognition, pp. 4714–4719 (2014)
Google Scholar
Aradhye, H.B., Myers, G.K.: Exploiting videotext “Events” for improved videotext detection. In: The 9th International Conference on Document Analysis and Recognition, pp. 894–898 (2007)
Google Scholar
Shivakumara, P., Phan, T.Q., Tan Hong, C.L., Lim, K.J.: A gradient difference based technique for video text detection. In: ICDAR, pp. 156–160 (2009)
Google Scholar
Gargi, U., Antani, S., Woods, R.E.: Indexing text events in digital video database. Pattern Recogn. 1, 1481–1483 (1998)
Google Scholar
Shivakumara, P., Huang, W., Tan, C.L.: An efficient edge based technique for text detection in video frames. In: DAS, pp. 307–314 (2008)
Google Scholar
Fu, X., Gao, H.: Gray-based news video text extraction approach. In: 5th International Conference on Computer Science and Convergence Information Technology (2010)
Google Scholar
Yang, Z., Shi, P.: Caption detection and text recognition in news video. In: 5th International Congress on Image and Signal Processing (2012)
Google Scholar
Yen, S.-H., Chang, H.-W., Wang, C.-J., Wang, C.-W.: Robust news video text detection based on edges and line-deletion. WSEAS Trans. Sig. Process. 6(4), 186–195 (2010)
Google Scholar
Sato, T., Kanade, T., Huges, E.K., Smith, M.A., Sato, S.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. Multimedia Syst. Issue 5(7), 385–395 (1999)
Article Google Scholar
Poignant, J., Besacier, L., Quenot, G., Thollard, F.: From text detection in videos to person identification. In: International Conference on Multimedia and Expo (2012)
Google Scholar
Satoh, S., Nakamura, Y., Kande, T.: Name-It: naming and detecting faces in news videos. In: Proceedings of IEEE Multimedia (1999)
Google Scholar
Gay, P., Dupuy, G., Lailler, C., Odobez, J.-M., Meignier, S., Deleglise, P.: Comparison of two methods for unsupervised person identification in TV shows. In: 12th International Workshop on Content Based Multimedia Indexing (2014)
Google Scholar
Pham, P.T., Tuytelaars, T., Mones, M.-F.: Naming people in news videos with label propagation. In: Proceedings of ICME (2010)
Google Scholar
Jou, B., Li, H., Ellis, G., Morozoff-Abegauz, D., Chang, S.-F.: Structured exploration of who, what, when, and where in heterogeneous multimedia news source. In: Proceedings of ACM Multimedia (2013)
Google Scholar
Poignant, J., Besacier, L., Le, V.B., Rosset, S., Quenot, G.: Unsupervised speaker identification in TV broadcast based on written names. In: Proceedings of Interspeech (2013)
Google Scholar
Poignant, J., Bredin, H., Le, V.B., Besacier, L., Barras, C., Quenot, G.: Unsupervised speaker identification using overlay texts in TV broadcast. In: Proceedings of Interspeech (2012)
Google Scholar
Bendris, M., Favre, B., Charlet, D., Damnati, G., Senay, G., Auguste, R., Martinet, J.: Unsupervised face identification in TV content using audio-visual sources. In: Proceedings of CBMI (2013)
Google Scholar
Lee, C.-C., Chiang, Y.-C., Huang, H.-M., Tsai, C.-L.: A fast caption localization and detection for news videos. In: The 2nd International Conference on Innovative Computing Information and Control, pp. 226–229 (2007)
Google Scholar
Lee, S.H., Ahn, J.I., Jo, K.H.: Comparison of text beginning frame detection methods for robust overlay text recognition. In: IWAIT 2016 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical Engineering, University of Ulsan, Daehak Rd. 93, Nam-gu, Ulsan, Korea
Sanghee Lee, Youlkyeoung Lee & Kanghyun Jo
Department of Technology, Ulsan Broadcasting Corporation, Gukyo Rd. 41, Jung-gu, Ulsan, Korea
Jungil Ahn

Authors

Sanghee Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jungil Ahn
View author publications
You can also search for this author in PubMed Google Scholar
Youlkyeoung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Kanghyun Jo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kanghyun Jo .

Editor information

Editors and Affiliations

Tongji University , Shanghai, China
De-Shuang Huang
Inha University , Incheon, Korea (Republic of)
Kyungsook Han
Liverpool John Moores University , Liverpool, United Kingdom
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, S., Ahn, J., Lee, Y., Jo, K. (2016). Beginning Frame and Edge Based Name Text Localization in News Interview Videos. In: Huang, DS., Han, K., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2016. Lecture Notes in Computer Science(), vol 9773. Springer, Cham. https://doi.org/10.1007/978-3-319-42297-8_54

Download citation

DOI: https://doi.org/10.1007/978-3-319-42297-8_54
Published: 12 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42296-1
Online ISBN: 978-3-319-42297-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics