Skip to main content

Beginning Frame and Edge Based Name Text Localization in News Interview Videos

  • Conference paper
  • First Online:
Intelligent Computing Methodologies (ICIC 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9773))

Included in the following conference series:

Abstract

To make the automatic person indexing of interview video in the TV news program, this paper proposes the method to detect the overlay name text line among the whole overlay texts in one frame. The proposed method is based on the identification of the beginning frame and the edge using Canny edge detector. The experimental results on Korean television news videos show that the proposed method efficiently detects and localizes the overlaid name text line.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hua, X.-S., Liu, W., Zhang, H.-J.: An automatic performance evaluation protocol for video text detection algorithms. IEEE Trans. Circ. Syst. Video Technol. 14(4), 498–507 (2004)

    Article  Google Scholar 

  2. Lee, S.H., Ahn, J.I., Jo, K.H.: Automatic name line detection for person indexing based on overlay text. J. Multimedia Inf. Syst. 2(1), 163–170 (2015)

    Google Scholar 

  3. Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)

    Article  Google Scholar 

  4. Wang, Z., Wu, X., Yang, L., Zhang, Y.: A survey on video caption extraction technology. In: The 4th International Conference on Multimedia Information Networking and Security, pp. 713–716 (2012)

    Google Scholar 

  5. Zhang, J., Kasturi, R.: Extraction of text objects in video documents: recent progress. In: The 8th IAPR Workshop on Document Analysis Systems, pp. 5–17 (2008)

    Google Scholar 

  6. Xu, J., Shivakumara, P., Lu, T., Phan, T.Q., Tan, C.L.: Graphics and scene text classification in video. In: The 22nd International Conference on Pattern Recognition, pp. 4714–4719 (2014)

    Google Scholar 

  7. Aradhye, H.B., Myers, G.K.: Exploiting videotext “Events” for improved videotext detection. In: The 9th International Conference on Document Analysis and Recognition, pp. 894–898 (2007)

    Google Scholar 

  8. Shivakumara, P., Phan, T.Q., Tan Hong, C.L., Lim, K.J.: A gradient difference based technique for video text detection. In: ICDAR, pp. 156–160 (2009)

    Google Scholar 

  9. Gargi, U., Antani, S., Woods, R.E.: Indexing text events in digital video database. Pattern Recogn. 1, 1481–1483 (1998)

    Google Scholar 

  10. Shivakumara, P., Huang, W., Tan, C.L.: An efficient edge based technique for text detection in video frames. In: DAS, pp. 307–314 (2008)

    Google Scholar 

  11. Fu, X., Gao, H.: Gray-based news video text extraction approach. In: 5th International Conference on Computer Science and Convergence Information Technology (2010)

    Google Scholar 

  12. Yang, Z., Shi, P.: Caption detection and text recognition in news video. In: 5th International Congress on Image and Signal Processing (2012)

    Google Scholar 

  13. Yen, S.-H., Chang, H.-W., Wang, C.-J., Wang, C.-W.: Robust news video text detection based on edges and line-deletion. WSEAS Trans. Sig. Process. 6(4), 186–195 (2010)

    Google Scholar 

  14. Sato, T., Kanade, T., Huges, E.K., Smith, M.A., Sato, S.: Video OCR: Indexing digital news libraries by recognition of superimposed caption. Multimedia Syst. Issue 5(7), 385–395 (1999)

    Article  Google Scholar 

  15. Poignant, J., Besacier, L., Quenot, G., Thollard, F.: From text detection in videos to person identification. In: International Conference on Multimedia and Expo (2012)

    Google Scholar 

  16. Satoh, S., Nakamura, Y., Kande, T.: Name-It: naming and detecting faces in news videos. In: Proceedings of IEEE Multimedia (1999)

    Google Scholar 

  17. Gay, P., Dupuy, G., Lailler, C., Odobez, J.-M., Meignier, S., Deleglise, P.: Comparison of two methods for unsupervised person identification in TV shows. In: 12th International Workshop on Content Based Multimedia Indexing (2014)

    Google Scholar 

  18. Pham, P.T., Tuytelaars, T., Mones, M.-F.: Naming people in news videos with label propagation. In: Proceedings of ICME (2010)

    Google Scholar 

  19. Jou, B., Li, H., Ellis, G., Morozoff-Abegauz, D., Chang, S.-F.: Structured exploration of who, what, when, and where in heterogeneous multimedia news source. In: Proceedings of ACM Multimedia (2013)

    Google Scholar 

  20. Poignant, J., Besacier, L., Le, V.B., Rosset, S., Quenot, G.: Unsupervised speaker identification in TV broadcast based on written names. In: Proceedings of Interspeech (2013)

    Google Scholar 

  21. Poignant, J., Bredin, H., Le, V.B., Besacier, L., Barras, C., Quenot, G.: Unsupervised speaker identification using overlay texts in TV broadcast. In: Proceedings of Interspeech (2012)

    Google Scholar 

  22. Bendris, M., Favre, B., Charlet, D., Damnati, G., Senay, G., Auguste, R., Martinet, J.: Unsupervised face identification in TV content using audio-visual sources. In: Proceedings of CBMI (2013)

    Google Scholar 

  23. Lee, C.-C., Chiang, Y.-C., Huang, H.-M., Tsai, C.-L.: A fast caption localization and detection for news videos. In: The 2nd International Conference on Innovative Computing Information and Control, pp. 226–229 (2007)

    Google Scholar 

  24. Lee, S.H., Ahn, J.I., Jo, K.H.: Comparison of text beginning frame detection methods for robust overlay text recognition. In: IWAIT 2016 (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kanghyun Jo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Lee, S., Ahn, J., Lee, Y., Jo, K. (2016). Beginning Frame and Edge Based Name Text Localization in News Interview Videos. In: Huang, DS., Han, K., Hussain, A. (eds) Intelligent Computing Methodologies. ICIC 2016. Lecture Notes in Computer Science(), vol 9773. Springer, Cham. https://doi.org/10.1007/978-3-319-42297-8_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42297-8_54

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42296-1

  • Online ISBN: 978-3-319-42297-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics