Skip to main content
Log in

Robust and parallel Uyghur text localization in complex background images

  • Special Issue Paper
  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

Uyghur text localization in complex background images is a significant research for Uyghur image content analysis. In this paper, we propose a robust Uyghur text localization method in complex background images and provide a CPU–GPU heterogeneous parallelization scheme. Firstly, a multi-color-channel enhanced maximally stable extremal region is used to extract components in images, which is robust to blur and low contrast. Secondly, a two-stage component classification system is used to filter out non-text components. Finally, a component connected graph algorithm is proposed to construct text lines. Experiments on the proposed dataset demonstrate that our algorithm compares favorably with the state-of-the-art algorithms when handling Uyghur texts. Besides, the heterogeneous parallel implementation achieves 12.5 times speedup.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  1. Xie, H., Gao, K., Zhang, Y., Li, J., Liu, Y.: Pairwise weak geometric consistency for large scale image search. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval, p. 42. ACM (2011)

  2. Xie, H., Gao, K., Zhang, Y., Li, J., Ren, H.: Common visual pattern discovery via graph matching. In: Proceedings of the 19th ACM International Conference on Multimedia, pp. 1385–1388. ACM (2011)

  3. Huang, W., Qiao, Y., Tang, X.: Robust scene text detection with convolution neural network induced mser trees. In: European Conference on Computer Vision, pp. 497–511. Springer (2014)

  4. Yin, X.-C., Pei, W.-Y., Zhang, J., Hao, H.-W.: Multi-orientation scene text detection with adaptive clustering. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1930–1937 (2015)

    Article  Google Scholar 

  5. Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3538–3545. IEEE (2012)

  6. Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)

    Article  Google Scholar 

  7. Xie, H., Gao, K., Zhang, Y., Tang, S., Li, J., Liu, Y.: Efficient feature detection and effective post-verification for large scale near-duplicate image search. IEEE Trans. Multimed. 13(6), 1319–1332 (2011)

    Article  Google Scholar 

  8. Xie, H., Zhang, Y., Gao, K., Tang, S., Kefu, X., Guo, L., Li, J.: Robust common visual pattern discovery using graph matching. J. Vis. Commun. Image Represent. 24(5), 635–646 (2013)

    Article  Google Scholar 

  9. Xie, H., Zhang, Y., Tan, J., Guo, L., Li, J.: Contextual query expansion for image retrieval. IEEE Trans. Multimed. 16(4), 1104–1114 (2014)

    Article  Google Scholar 

  10. Liu, W., Mei, T., Zhang, Y., Che, C., Luo, J.: Multi-task deep visual-semantic embedding for video thumbnail selection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3707–3715 (2015)

  11. Liu, W., Mei, T., Zhang, Y.: Instant mobile video search with layered audio–video indexing and progressive transmission. IEEE Trans. Multimed. 16(8), 2242–2255 (2014)

    Article  Google Scholar 

  12. Liu, W., Zhang, Y., Tang, S., Tang, J., Hong, R., Li, J.: Accurate estimation of human body orientation from rgb-d sensors. IEEE Trans. Cybern. 43(5), 1442 (2013)

    Article  Google Scholar 

  13. Liu, W., Ma, H., Qi, H., Zhao, D., Chen, Z.: Deep learning hashing for mobile visual search. EURASIP J. Image Video Process. 2017(1), 17 (2017)

    Article  Google Scholar 

  14. Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell 37(7), 1480–1500 (2015)

    Article  Google Scholar 

  15. Kim, K.I., Jung, K., Kim, J.H.: Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 25(12), 1631–1639 (2003)

    Article  Google Scholar 

  16. Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004. vol. 2, pp. II–366. IEEE (2004)

  17. Hanif, S.M., Prevost, L.: Text detection and localization in complex scene images using constrained adaboost algorithm. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 1–5. IEEE (2009)

  18. Lee, J.-J., Lee, P.-H., Lee, S.-W., Yuille, A.L., Koch, C.: Adaboost for text detection in natural scene. In: ICDAR, pp. 429–434 (2011)

  19. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970. IEEE (2010)

  20. Yao, C.: Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1083–1090 (2012)

  21. Cho, H., Sung, M., Jun, B.: Canny text detector: fast and robust scene text localization algorithm. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3566–3573 (2016)

  22. Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1241–1248 (2013)

  23. Chen, H., Tsai, S.S., Schroth, G., Chen, D.M., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International Conference on Image Processing, pp. 2609–2612. IEEE (2011)

  24. Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recognit. Lett. 34(2), 107–116 (2013)

    Article  Google Scholar 

  25. Zamberletti, A., Noce, L., Gallo, I.: Text localization based on fast feature pyramids and multi-resolution maximally stable extremal regions. In: Asian Conference on Computer Vision, pp. 91–105. Springer (2014)

  26. Neumann, L., Matas, J.: Text localization in real-world images using efficiently pruned exhaustive search. In: 2011 International Conference on Document Analysis and Recognition, pp. 687–691. IEEE (2011)

  27. Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Asian Conference on Computer Vision, pp. 770–783. Springer, Berlin (2010)

  28. Sun, L., Huo, Q., Jia, W., Chen, K.: A robust approach for text detection from natural scene images. Pattern Recognit. 48(9), 2906–2920 (2015)

    Article  Google Scholar 

  29. Shahab, A., Shafait, F., Dengel, A.: ICDAR 2011 robust reading competition challenge 2: reading text in scene images. In: International conference on document analysis and recognition (ICDAR), pp. 1491–1496. doi:10.1109/ICDAR.2011.296

  30. Jaderberg, M., Vedaldi, A., Zisserman, A.: Deep Features for Text Spotting. Springer International Publishing, Berlin (2014)

    Book  Google Scholar 

  31. Jaderberg, M., Simonyan, K., Vedaldi, A., Zisserman, A.: Reading text in the wild with convolutional neural networks. Int. J. Comput. Vis. 116(1), 1–20 (2016)

    Article  MathSciNet  Google Scholar 

  32. Zhang, Z., Shen, W., Yao, C., Bai, X.: Symmetry-based text line detection in natural scenes. In: Computer Vision and Pattern Recognition, pp. 2558–2567 (2015)

  33. He, T., Huang, W., Qiao, Y., Yao, J.: Text-attentional convolutional neural network for scene text detection. IEEE Trans. Image Process. 25(6), 2529–2541 (2016)

    Article  MathSciNet  Google Scholar 

  34. Bai, J., Chen, Z., Feng, B., Xu, B.: Chinese image text recognition on grayscale pixels. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1380–1384. IEEE (2014)

  35. Bai, J., Chen, Z., Feng, B., Xu, B.: Image character recognition using deep convolutional neural network learned from different languages. In: 2014 IEEE International Conference on Image Processing (ICIP), pp. 2560–2564. IEEE (2014)

  36. Moradi, M., Mozaffari, S., Orouji, A.A.: Farsi/arabic text extraction from video images by corner detection. In: 2010 6th Iranian Conference on Machine Vision and Image Processing, pp. 1–6. IEEE (2010)

  37. Zayene, O., Hennebert, J., Touj, S.M., Ingold, R., Ben, A., Najoua, E.: A dataset for arabic text detection, tracking and recognition in news videos-activ. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 996–1000. IEEE (2015)

  38. Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)

    Article  Google Scholar 

  39. Donoser, M., Bischof, H.: Efficient maximally stable extremal region (mser) tracking. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 553–560 (2006)

  40. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), vol. 1, pp. 886–893. IEEE (2005)

  41. Wolf, C., Jolion, J.-M.: Object count/area graphs for the evaluation of object detection and segmentation algorithms. Int. J. Doc. Anal. Recognit. (IJDAR) 8(4), 280–296 (2006)

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Nature Science Foundation of China (61303171, 61303175), the “Strategic Priority Research Program” of the Chinese Academy of Sciences (XDA06031000) and Natural Science Foundation of Hunan Province (2016JJ2005).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongtao Xie.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Song, Y., Chen, J., Xie, H. et al. Robust and parallel Uyghur text localization in complex background images. Machine Vision and Applications 28, 755–769 (2017). https://doi.org/10.1007/s00138-017-0837-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00138-017-0837-3

Keywords

Navigation