Abstract
Text detection in video/images is challenging due to the presence of multiple blur caused by defocus and motion. In this paper, we present a new method for detecting texts in blurred/non-blurred images. Unlike the existing methods that use deblurring or classifiers, the proposed method estimates degree of blur in images based on contrast variations in neighbor pixels and a low pass filter, which results in candidate pixels for deblurring. We consider gradient values of each pixel as the weight for the degree of blur. The proposed method then performs K-means clustering on weighted values of candidate pixels to get text candidates irrespective of blur types. Next, Bhattacharyya distance is used to extract symmetry property of texts to remove false text candidates, which provides text components. Further, the proposed method fixes bounding box for each text component based on the nearest neighbor criteria and direction of the text component. Experimental results on defocus, motion, non-blurred images and standard datasets of curved text show that the proposed method outperforms the existing methods.
Similar content being viewed by others
References
Cao S, Ren W, Zuo W, Guo X, Foroosh H (2015) Scene text deblurring using text specific multiscale dicionaries. IEEE Trans Image Processing 24:1302–1314
M. G. Chun and S. G. Kong, Focusing in thermal imagery using morphological gradient operator, Pattern Recogn Lett, Vo. 38, 2014, pp 20–25.
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing:489–501
Keerthi S, Shevade SK, Bhattacharyya C, Murthy KRK (2000) A fast iterative nearest point algorithm for support vector machine classifier design. IEEE Trans. NN, pp124–136
Khare V, Shivakumara P, Raveendran P, Blumenstein M (2016) A blind deconvolutional model for scene text detection and recognition. Pattern Recogn 54:128–148
Khare V, Shivakumara P, Kumar A, Chan CS, Lu T, Blumenstien M (2016) A quad tree based method for blurred and non-blurred video text frames classification through quality metrics, In Proc ICPR, pp 4023–4028
Khare V, Shivakumara P, Paramesran R, Blumenstein M (2017) Arbitrarily-oriented multi-lingual text detection in video. Mutlimedia Tools and Applications 76:16625–16655
Lee H, Kim C (2014) Blurred image region detection and segmentation, In Proc ICIP, pp 4427–4431
Liao M, Shi B, Bai X, Wang X, Liu W (2017) Textboxes: a fast text detector with a single deep neural network. In Proc, AAAI
Liu Y, Jin L (2017) Deep matching prior network: toward tighter multi-oriented text detection. In Proc. CVPR:3454–3461
Liu J, Su H, Yi Y, Hu W (2016) Robust text detection via multi-degree of sharpening and blurring. Signal Process 124:259–265
Nwe TL, Hieu NT, Limbu DK (2013) Bhattacharyya distance based emotional dissimilarity measurs for emotion classification, In Proc. ICASSP, pp7512–7516
Risnumawan A, Shivakumara P, Chan CS, Tan CL (2014) A robust arbitrary text detection system for natural scene images. Expert Syst Appl 41:8027–8048
Semwal VB, Mondal K, Nandi GC (2017) Robust and accurate feature selection for humanoid push recovery and classification: deep learning approach. Neural Comput & Applic, pp 565–574
Semwal VB, Gaud N, Nandi GC (2019) Human gait state prediction using cellular automata and classification using ELM. In Proc MISP, pp 135–145
Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments, In Proc. CVPR, pp 3482–3490
Tian Z, Huang W, He T, He P, Qiao Y (2016) Detecting text in natural image with connectionist text proposal network, In Proc. ECCV, pp 56–72
Tian Z, Huang W, He T, He P, Qiao Y (2016) Detecting text in natural image with connectionist text proposal network, In Proc. ECCV, pp 56–72
Veit A, Matera T, Neumann L, Matas J, Belongie S (2017) COCO-Text: Dataset and Benchmark for text detection and recognition in natural scene images, arXiv:1601.07140v2
Wang X, Song Y, Zhang Y, Xin J (2017) A hierachical recursive method for text detection in natural scene images. Multimed Tools Appl 76:26201–26223
Wei Y, Zhang Z, Shen W, Zeng D, Fang M, Zhou S (2017) Text detection in scene images based on exhaustive segmentation. Signal Processing: Communication 50:1–8
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In Proc. CVPR, pp. 1083–1090
Zhang Z, Zhang C, Shen W, Yao C, Liu W, Bai X (2016) Multi-oriented text detection with fully convolutional networks, In Proc CVPR, pp 4159–4167
Zhang X, Gao X, Tian C (2018) Text detection in natural scene images based on color prior guided MSER. Neurocomputing 307:61–71
Zhao F, Yang Y, Zhang HY, Yang LL, Zhang L (2018) Sign text detection in street view images using an integrated features. Multimed Tools Appl:1–28
Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W (2017) EAST: an efficient and accurate scene text detector, In Proc. CVPR, pp 2645–2651
Acknowledgments
The work described in this paper was supported by the Natural Science Foundation of China under Grant No. 61672273 and No. 61272218, and the Science Foundation for Distinguished Young Scholars of Jiangsu under Grant No. BK20160021. This work is also partly supported by University of Malaya under Grant No: UM.0000520/HRU.BK (BKS003-2018).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xue, M., Shivakumara, P., Zhang, C. et al. Curved text detection in blurred/non-blurred video/scene images. Multimed Tools Appl 78, 25629–25653 (2019). https://doi.org/10.1007/s11042-019-7721-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7721-2