Skip to main content
Log in

An automatic video text detection method based on BP-adaboost

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Video text usually provides us a lot of useful information that is important for video analysis, indexing and retrieval. However, it is still a challenging work to detect text from video images due to variation of text patterns and complexity of background. In this paper, an automatic video text detection method is proposed. Firstly, K-means is utilized to classify pixels in gradient images into text and non-text regions. Subsequently, morphological operations are performed on text regions to form connected candidate text components, followed by projection profile boundary refinement. Finally, the detection results are verified by geometry and BP-Adaboost identifications. The experimental results on our manually selected dataset and the publicly available Microsoft Asia dataset show the effectiveness and feasibility of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  1. Cai M, Song J, Lyu MR (2002) A new approach for video text detection. In: Proceedings of IEEE International Conference on Image Processing, pp I-117

  2. Gui W, Liu J, Yang C, Chen N, Liao X (2013) Color co-occurrence matrix based froth image texture extraction for mineral flotation. Miner Eng 46:60–67

    Article  Google Scholar 

  3. Haralick RM, Shanmugam K, Dinstein IH (1973) Textural features for image classification. IEEE Trans Syst Man Cybern 6:610–621

    Article  Google Scholar 

  4. He M, Yang C, Wang X, Gui W, Wei L (2013) Nonparametric density estimation of froth colour texture distribution for monitoring sulphur flotation process. Miner Eng 53:203–212

    Article  Google Scholar 

  5. Hua XS, Wenyin L, Zhang HJ (2004) An automatic performance evaluation protocol for video text detection algorithms. IEEE Trans Circ Syst Vid 14(4):498–507

    Article  Google Scholar 

  6. Kim W, Kim C (2009) A new approach for overlay text detection and extraction from complex video scene. IEEE Trans Image Process 18(2):401–411

    Article  MathSciNet  Google Scholar 

  7. Li Z, Liu G, Qian X, Guo D, Jiang H (2011) Effective and efficient video text extraction using key text points. IET Image Process 5(8):671–683

    Article  MathSciNet  Google Scholar 

  8. Liu X, Wang W (2012) Robustly extracting captions in videos based on stroke-like edges and spatio-temporal analysis. IEEE Trans Multimed 14(2):482–489

    Article  Google Scholar 

  9. Liu C, Wang C, Dai R (2005) Text detection in images based on unsupervised classification of edge-based features. In: Proceedings of IEEE International Conference on Document Analysis and Recognition, pp 610–614

  10. Mariano VY, Kasturi R (2000) Locating uniform-colored text in video frames. In: Proceedings of IEEE International Conference on Pattern Recognition, pp 539–542

  11. Phan TQ, Shivakumara P, Tan CL (2009) A Laplacian method for video text detection. In: Proceedings of IEEE International Conference on Document Analysis and Recognition, pp 66–70

  12. Qian X, Wang H, Hou X (2014) Video text detection and localization in intra-frames of H. 264/AVC compressed video[J]. Multimed Tools Appl 70(3):1487–1502

    Article  Google Scholar 

  13. Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian approach to multi-oriented text detection in video. IEEE Trans Pattern Anal Mach Intell 33(2):412–419

    Article  Google Scholar 

  14. Shivakumara P, Sreedhar RP, Phan TQ, Lu S, Tan CL (2012) Multioriented video scene text detection through Bayesian classification and boundary growing. IEEE Trans Circ Syst Vid 22(8):1227–1235

    Article  Google Scholar 

  15. Suzuki K, Horiba I, Sugie N (2003) Linear-time connected-component labeling based on sequential local operations. Comput Vis Image Und 89(1):1–23

    Article  MATH  Google Scholar 

  16. Wei YC, Lin CH (2012) A robust video text detection approach using SVM. Expert Syst Appl 39(12):10832–10840

    Article  Google Scholar 

  17. Wong EK, Chen M (2003) A new robust algorithm for video text extraction. Pattern Recognit 36(6):1397–1406

    Article  MATH  Google Scholar 

  18. Wu Y, Shivakumara P, Wei W, et al. A new ring radius transform-based thinning method for multi-oriented video characters [J]. Int J Doc Anal Recog (IJDAR), 2015: 1–15

  19. Yang H, Quehl B, Sack H (2014) A framework for improved video text detection and recognition[J]. Multimed Tools Appl 69(1):217–245

    Article  Google Scholar 

  20. Zhao M, Li S, Kwok J (2010) Text detection in images using sparse representation with discriminative dictionaries. Image Vision Comput 28(12):1590–1599

    Article  Google Scholar 

Download references

Acknowledgments

This work is partly supported by the National Natural Science Foundation of China (Grant Nos. 61172184, 61173122, 61174210, 61379107, and 61402539), Key Project of Hunan Provincial Natural Science Foundation of China (Grant No. 12JJ2038), Program for New Century Excellent Talents in University of Education Ministry in China (Grant No. NCET-13-0603), Specialized Research Fund for the Doctoral Program of Higher Education in China (Grant No. 20130162110016), Program for Hunan Province Science and Technology Basic Construction (Grant No. 20131199), and China Postdoctoral Science Foundation (Grant No. 2012 M521554), the Fundamental Research Funds for the Central Universities of Central South University (Grant No. 2015zzts052).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yu-qian Zhao.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, H., Zou, Bj., Zhao, Yq. et al. An automatic video text detection method based on BP-adaboost. Multimed Tools Appl 75, 7715–7738 (2016). https://doi.org/10.1007/s11042-015-2690-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2690-6

Keywords

Navigation