Abstract
Text information in natural scene images is important for various kinds of applications. In this paper a novel method based on stroke width to detect text in unconstrained natural scene images is proposed. Firstly, we use the stroke width transform to generate a rough estimation of stroke width map, then use K-Means clustering and the elbow method to find some specific stroke width values that are both dominant and consistent. Secondly, in order to generate better edge detection and gradient direction results we use these specific stroke width values as the size parameters in the superpixel algorithm to generate smooth and uniform region boundaries. Finally, we try to refine the stroke width map and recover valid edge pixels by applying stroke width regularized constraints on the improved edge detection and gradient direction results computed from these region boundaries. Our method was evaluated on three benchmark datasets: ICDAR 2005, 2011 and 2013, and the experimental results show that it achieves state-of-the-art performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Süsstrunk, S.: SLIC superpixels. École Polytechnique Fédéral de Lausssanne (EPFL), Tech. Rep. 149300 (2010)
Chen, H., Tsai, S.S., Schroth, G., Chen, D.M., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International Conference on Image Processing (ICIP), pp. 2609–2612. IEEE (2011)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970. IEEE (2010)
Fabrizio, J., Cord, M., Marcotegui, B.: Text extraction from street level images. City Models, Roads and Traffic (CMRT) 3 (2009)
Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In: Proc. IEEE Int. Conf. Comp. Vis., pp. 1241–1248 (2013)
Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Mestre, S.R., Mas, J., Mota, D.F., Almazan, J.A., de las Heras, L.P., et al.: ICDAR 2013 robust reading competition. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1484–1493. IEEE (2013)
Lucas, S.M.: ICDAR 2005 text locating competition results. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition 2005, pp. 80–84. IEEE (2005)
Neumann, L., Matas, J.: Text localization in real-world images using efficiently pruned exhaustive search. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 687–691. IEEE (2011)
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3538–3545. IEEE (2012)
Shahab, A., Shafait, F., Dengel, A.: ICDAR 2011 robust reading competition challenge 2: Reading text in scene images. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 1491–1496. IEEE (2011)
Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recognition Letters 34(2), 107–116 (2013)
Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S., Zhang, Z.: Scene text recognition using part-based tree-structured character detection. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2961–2968. IEEE (2013)
Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1083–1090. IEEE (2012)
Yi, C., Tian, Y., et al.: Text extraction from scene images by character appearance and structure modeling. Computer Vision and Image Understanding 117(2), 182–194 (2013)
Yin, X., Huang, K., Hao, H.: Robust text detection in natural scene images. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(5), 970–983 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Liu, S., Zhou, Y., Zhang, Y., Wang, Y., Lin, W. (2014). Text Detection in Natural Scene Images with Stroke Width Clustering and Superpixel. In: Ooi, W.T., Snoek, C.G.M., Tan, H.K., Ho, CK., Huet, B., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2014. PCM 2014. Lecture Notes in Computer Science, vol 8879. Springer, Cham. https://doi.org/10.1007/978-3-319-13168-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-13168-9_13
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13167-2
Online ISBN: 978-3-319-13168-9
eBook Packages: Computer ScienceComputer Science (R0)