Skip to main content
Log in

Detonator coded character spotting based on convolutional neural networks

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

To facilitate the management of detonators, an automatic spotting system is proposed for detonator coded characters based on convolutional neural networks. The system contains a multi-scale detection network, a multi-label recognition network and a post-processing layer. An improved fully convolution network (FCN) is designed as the multi-scale detection network to obtain the accurate response map of the detonator image. Two subnetworks are parallel integrated into the FCN to perform a coarse-to-fine detection. An improved Jaccard loss function with a regularization term is defined to train the FCN. The region of interest (ROI) of the detonator image is achieved when the response map is post-processed by the post-processing layer. Finally, a modified multi-label network is used to recognize the detonator coded characters in the ROI. The experimental results indicate that the proposed system achieves a better spotting performance for detonator coded characters than the state-of-the-art text spotting methods in terms of accuracy and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Sun, M., Zhang, L., Gao, H.: The design of recognition system for numbers of detonators based on support vector machine. J. Wuhan Univ. 59(3), 245–248 (2013)

    Google Scholar 

  2. Zhu, A., Wang, G., Dong, Y.: Robust text segmentation in low quality images via adaptive stroke width estimation and stroke based superpixel grouping. In: 12th Asian Conference on Computer Vision, pp. 119–133. Springer (2014)

  3. Gopalan, C., Manjula, D.: Statistical modeling for the detection, localization and extraction of text from heterogeneous textual images using combined feature scheme. SIViP 5(2), 165–183 (2011)

    Article  Google Scholar 

  4. Nirmala, S., Nagabhushan, P.: Foreground text segmentation in complex color document images using Gabor filters. SIViP 6(4), 669–678 (2012)

    Article  Google Scholar 

  5. Hadjadj, Z., Cheriet, M., Meziane, A., Cherfa, Y.: A new efficient binarization method: application to degraded historical document images. SIViP 11(6), 1155–1162 (2017)

    Article  Google Scholar 

  6. Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: 21st International Conference on Pattern Recognition (ICPR2012), pp. 3304–3308. IEEE (2012)

  7. Jaderberg, M., Vedaldi, A., Zisserman, A.: Deep features for text spotting. In: 13th European Conference on Computer Vision, pp. 512–528. Springer (2014)

  8. He, T., Huang, W., Qiao, Y., Yao, J.: Text-attentional convolutional neural network for scene text detection. IEEE Trans. Image Process. 25(6), 2529–2541 (2016)

    Article  MathSciNet  Google Scholar 

  9. Liao, M., Shi, B., Bai, X., Wang, X., Liu, W.: Textboxes: A fast text detector with a single deep neural network. In: 31st AAAI Conference on Artificial Intelligence, pp. 4161–4167. AAAI (2017)

  10. Tian, Z., Huang, W., He, T., He P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: 15th European Conference on Computer Vision, pp. 56–72. Springer (2016)

  11. Zhang, Z., Zhang, C., Shen, W., Yao, C., Liu, W., Bai, X.: Multi-oriented text detection with fully convolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4159–4167. IEEE (2016)

  12. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A.C.: SSD: Single shot multibox detector. In: 15th European Conference on Computer Vision, pp. 21–37. Springer (2016)

  13. Cai, J., Lu, L., Xie, Y., Xing, F., Yang, L.: Improving deep pancreas segmentation in CT and MRI images via recurrent neural contextual learning and direct loss function. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 674–682. Springer (2017)

  14. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (2007)

    Article  MathSciNet  Google Scholar 

  15. Jaderberg, M., Simonyan, K., Vedaldi, A., Zisserman, A.: Synthetic data and artificial neural networks for natural scene text recognition. arXiv: 1406.2227 (2014)

  16. Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: 22nd ACM International Conference on Multimedia, pp. 675–678. ACM (2014)

  17. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 640–651 (2014)

    Google Scholar 

  18. Xie, S., Tu, Z.: Holistically-nested edge detection. In: IEEE International Conference on Computer Vision, pp. 1395–1403. IEEE (2015)

Download references

Acknowledgements

This work was in part supported by the National Natural Science Foundation of China (Grant No. 61001179) and the Key Project of Industry-University-Research Collaborative Innovation in Guangzhou, China (No. 201802020010).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nian Cai.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cen, G., Cai, N., Wu, J. et al. Detonator coded character spotting based on convolutional neural networks. SIViP 14, 67–75 (2020). https://doi.org/10.1007/s11760-019-01525-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-019-01525-1

Keywords

Navigation