An automatic video text detection method based on BP-adaboost

Wu, Hui; Zou, Bei-ji; Zhao, Yu-qian; Fu, Hong-pu

doi:10.1007/s11042-015-2690-6

An automatic video text detection method based on BP-adaboost

Published: 14 August 2015

Volume 75, pages 7715–7738, (2016)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Hui Wu^1,2,
Bei-ji Zou^1,2,
Yu-qian Zhao^1,2,3 &
…
Hong-pu Fu^1,2

545 Accesses
2 Citations
Explore all metrics

Abstract

Video text usually provides us a lot of useful information that is important for video analysis, indexing and retrieval. However, it is still a challenging work to detect text from video images due to variation of text patterns and complexity of background. In this paper, an automatic video text detection method is proposed. Firstly, K-means is utilized to classify pixels in gradient images into text and non-text regions. Subsequently, morphological operations are performed on text regions to form connected candidate text components, followed by projection profile boundary refinement. Finally, the detection results are verified by geometry and BP-Adaboost identifications. The experimental results on our manually selected dataset and the publicly available Microsoft Asia dataset show the effectiveness and feasibility of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Decade research on text detection in images/videos: a review

Article 06 June 2019

Multi-oriented Text Detection from Video Using Sub-pixel Mapping

Automatic video superimposed text detection based on Nonsubsampled Contourlet Transform

Article 25 March 2017

References

Cai M, Song J, Lyu MR (2002) A new approach for video text detection. In: Proceedings of IEEE International Conference on Image Processing, pp I-117
Gui W, Liu J, Yang C, Chen N, Liao X (2013) Color co-occurrence matrix based froth image texture extraction for mineral flotation. Miner Eng 46:60–67
Article Google Scholar
Haralick RM, Shanmugam K, Dinstein IH (1973) Textural features for image classification. IEEE Trans Syst Man Cybern 6:610–621
Article Google Scholar
He M, Yang C, Wang X, Gui W, Wei L (2013) Nonparametric density estimation of froth colour texture distribution for monitoring sulphur flotation process. Miner Eng 53:203–212
Article Google Scholar
Hua XS, Wenyin L, Zhang HJ (2004) An automatic performance evaluation protocol for video text detection algorithms. IEEE Trans Circ Syst Vid 14(4):498–507
Article Google Scholar
Kim W, Kim C (2009) A new approach for overlay text detection and extraction from complex video scene. IEEE Trans Image Process 18(2):401–411
Article MathSciNet Google Scholar
Li Z, Liu G, Qian X, Guo D, Jiang H (2011) Effective and efficient video text extraction using key text points. IET Image Process 5(8):671–683
Article MathSciNet Google Scholar
Liu X, Wang W (2012) Robustly extracting captions in videos based on stroke-like edges and spatio-temporal analysis. IEEE Trans Multimed 14(2):482–489
Article Google Scholar
Liu C, Wang C, Dai R (2005) Text detection in images based on unsupervised classification of edge-based features. In: Proceedings of IEEE International Conference on Document Analysis and Recognition, pp 610–614
Mariano VY, Kasturi R (2000) Locating uniform-colored text in video frames. In: Proceedings of IEEE International Conference on Pattern Recognition, pp 539–542
Phan TQ, Shivakumara P, Tan CL (2009) A Laplacian method for video text detection. In: Proceedings of IEEE International Conference on Document Analysis and Recognition, pp 66–70
Qian X, Wang H, Hou X (2014) Video text detection and localization in intra-frames of H. 264/AVC compressed video[J]. Multimed Tools Appl 70(3):1487–1502
Article Google Scholar
Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian approach to multi-oriented text detection in video. IEEE Trans Pattern Anal Mach Intell 33(2):412–419
Article Google Scholar
Shivakumara P, Sreedhar RP, Phan TQ, Lu S, Tan CL (2012) Multioriented video scene text detection through Bayesian classification and boundary growing. IEEE Trans Circ Syst Vid 22(8):1227–1235
Article Google Scholar
Suzuki K, Horiba I, Sugie N (2003) Linear-time connected-component labeling based on sequential local operations. Comput Vis Image Und 89(1):1–23
Article MATH Google Scholar
Wei YC, Lin CH (2012) A robust video text detection approach using SVM. Expert Syst Appl 39(12):10832–10840
Article Google Scholar
Wong EK, Chen M (2003) A new robust algorithm for video text extraction. Pattern Recognit 36(6):1397–1406
Article MATH Google Scholar
Wu Y, Shivakumara P, Wei W, et al. A new ring radius transform-based thinning method for multi-oriented video characters [J]. Int J Doc Anal Recog (IJDAR), 2015: 1–15
Yang H, Quehl B, Sack H (2014) A framework for improved video text detection and recognition[J]. Multimed Tools Appl 69(1):217–245
Article Google Scholar
Zhao M, Li S, Kwok J (2010) Text detection in images using sparse representation with discriminative dictionaries. Image Vision Comput 28(12):1590–1599
Article Google Scholar

Download references

Acknowledgments

This work is partly supported by the National Natural Science Foundation of China (Grant Nos. 61172184, 61173122, 61174210, 61379107, and 61402539), Key Project of Hunan Provincial Natural Science Foundation of China (Grant No. 12JJ2038), Program for New Century Excellent Talents in University of Education Ministry in China (Grant No. NCET-13-0603), Specialized Research Fund for the Doctoral Program of Higher Education in China (Grant No. 20130162110016), Program for Hunan Province Science and Technology Basic Construction (Grant No. 20131199), and China Postdoctoral Science Foundation (Grant No. 2012 M521554), the Fundamental Research Funds for the Central Universities of Central South University (Grant No. 2015zzts052).

Author information

Authors and Affiliations

School of Information Science and Engineering, Central South University, Changsha, 410083, China
Hui Wu, Bei-ji Zou, Yu-qian Zhao & Hong-pu Fu
Mobile Health Ministry of Education, China Mobile Joint Laboratory, Changsha, Hunan, 410012, China
Hui Wu, Bei-ji Zou, Yu-qian Zhao & Hong-pu Fu
School of Geosciences and Info-Physics, Central South University, Changsha 410083, China
Yu-qian Zhao

Authors

Hui Wu
View author publications
You can also search for this author in PubMed Google Scholar
Bei-ji Zou
View author publications
You can also search for this author in PubMed Google Scholar
Yu-qian Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Hong-pu Fu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu-qian Zhao.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, H., Zou, Bj., Zhao, Yq. et al. An automatic video text detection method based on BP-adaboost. Multimed Tools Appl 75, 7715–7738 (2016). https://doi.org/10.1007/s11042-015-2690-6

Download citation

Received: 21 October 2014
Revised: 20 March 2015
Accepted: 12 May 2015
Published: 14 August 2015
Issue Date: July 2016
DOI: https://doi.org/10.1007/s11042-015-2690-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An automatic video text detection method based on BP-adaboost

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Decade research on text detection in images/videos: a review

Multi-oriented Text Detection from Video Using Sub-pixel Mapping

Automatic video superimposed text detection based on Nonsubsampled Contourlet Transform

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

An automatic video text detection method based on BP-adaboost

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Decade research on text detection in images/videos: a review

Multi-oriented Text Detection from Video Using Sub-pixel Mapping

Automatic video superimposed text detection based on Nonsubsampled Contourlet Transform

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation