Automatic Watermeter Digit Recognition on Mobile Devices

Gao, Yunze; Zhao, Chaoyang; Wang, Jinqiao; Lu, Hanqing

doi:10.1007/978-981-10-8530-7_9

Yunze Gao^12,13,
Chaoyang Zhao^12,13,
Jinqiao Wang^12,13 &
…
Hanqing Lu^12,13

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 819))

Included in the following conference series:

International Conference on Internet Multimedia Computing and Service

1505 Accesses
9 Citations

Abstract

Automatic watermeter digit recognition in the wild is a challenging task, which is an application of scene text recognition in the field of computer vision. In this paper, we propose an automatic watermeter digit recognition approach on mobile devices which consists of digit detection and recognition. Specifically, we adopt Adaboost with aggregated channel features (ACF) to detect watermeter digital regions, where the computation is accelerated by the fast feature pyramid technology. Then a small attention bidirectional long short-term memory (BLSTM) is designed for end-to-end digit sequence recognition. Convolutional Neural network (CNN) is exploited to extract discriminative feature and BLSTM is able to capture the rich context in both directions within sequence data. Moreover, an attention mechanism is added to weight the most important part of incoming image features. We validate the performace of our approach on the collected complex dataset. It contains various watermeter images in real scenario which has illumination changes, messy environment, half-digit and blurring. It is observed that the proposed algorithm outperforms existing methods. Our approach runs 10 fps with 96.1% accuracy on HUAWEI Mate 8.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 107.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Liu, Y., Han, Y.-B., Zhang, Y.-L.: Image type water meter character recognition based on Embedded DSP. arXiv preprint arXiv:1508.06725 (2015)
Xiao-ping, R., Xian-feng, S.: A character recognition algorithm adapt to a specific kind of water meter. In: 2009 WRI World Congress on Computer Science and Information Engineering, vol. 5, pp. 632–636 (2009)
Google Scholar
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3538–3545. IEEE (2012)
Google Scholar
Jaderberg, M., Simonyan, K., Vedaldi, A., Zisserman, A.: Synthetic data and artificial neural networks for natural scene text recognition. arXiv preprint arXiv:1406.2227 (2014)
Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: 2012 21st International Conference on, Pattern Recognition (ICPR), pp. 3304–3308 (2012)
Google Scholar
Bissacco, A., Cummins, M., Netzer, Y., Neven, H.: Photoocr: reading text in uncontrolled conditions. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 785–792 (2013)
Google Scholar
Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39, 2298–2304 (2016)
Article Google Scholar
Lee, C.-Y., Osindero, S.: Recursive recurrent nets with attention modeling for OCR in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2231–2239 (2016)
Google Scholar
Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)
Article Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28(2), 337–407 (2000)
Article MathSciNet MATH Google Scholar
Graves, A., Mohamed, A.-R., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649 (2013)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Shi, B., Wang, X., Lyu, P., Yao, C., Bai, X.: Robust scene text recognition with automatic rectification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4168–4176 (2016)
Google Scholar
Chorowski, J.K., Bahdanau, D., Serdyuk, D., Cho, K., Bengio, Y.: Attention-based models for speech recognition. In: Advances in Neural Information Processing Systems, pp. 577–585 (2015)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 1, pp. 886–893 (2005)
Google Scholar
Ahonen, T., Hadid, A., Pietikäinen, M.: Face recognition with local binary patterns. In: European Conference on Computer Vision, pp. 469–481 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

National Lab of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Yunze Gao, Chaoyang Zhao, Jinqiao Wang & Hanqing Lu
University of Chinese Academy of Sciences, Beijing, 100190, China
Yunze Gao, Chaoyang Zhao, Jinqiao Wang & Hanqing Lu

Authors

Yunze Gao
View author publications
You can also search for this author in PubMed Google Scholar
Chaoyang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jinqiao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hanqing Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunze Gao .

Editor information

Editors and Affiliations

Multimedia Communications Department, EURECOM, Sophia Antipolis, France
Benoit Huet
Shandong University , Qingdao, China
Liqiang Nie
Hefei University of Technology , Hefei, China
Richang Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gao, Y., Zhao, C., Wang, J., Lu, H. (2018). Automatic Watermeter Digit Recognition on Mobile Devices. In: Huet, B., Nie, L., Hong, R. (eds) Internet Multimedia Computing and Service. ICIMCS 2017. Communications in Computer and Information Science, vol 819. Springer, Singapore. https://doi.org/10.1007/978-981-10-8530-7_9

Download citation

DOI: https://doi.org/10.1007/978-981-10-8530-7_9
Published: 01 March 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8529-1
Online ISBN: 978-981-10-8530-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics