research-article

MHDT: A Deep-Learning-Based Text Detection Algorithm for Unstructured Data in Banking

Authors:
Shenglan Ma

Division of Science and Technology, Fujian Rural Credit Union, Fujian, China

Division of Science and Technology, Fujian Rural Credit Union, Fujian, China
View Profile

,
Lingling Yang

Division of Innovation, China UnionPay Co., Ltd, Fujian, China

Division of Innovation, China UnionPay Co., Ltd, Fujian, China
View Profile

,
Hao Wang

Department of Computer Science, Norwegian University of Sci. & Tech., Gjøvik, Norway

Department of Computer Science, Norwegian University of Sci. & Tech., Gjøvik, Norway
View Profile

,
Hong Xiao

College of Computer, Guangdong University of Technology, Guangzhou, China

College of Computer, Guangdong University of Technology, Guangzhou, China
View Profile

,
Hong-Ning Dai

Macau University of Sci. and Tech., Fujian, China

Macau University of Sci. and Tech., Fujian, China
View Profile

,
Shuhan Cheng

Division of Science and Technology, Fujian Rural Credit Union, Fujian, China

Division of Science and Technology, Fujian Rural Credit Union, Fujian, China
View Profile

,
Tongsen Wang

Division of Science and Technology, Fujian Rural Credit Union, Fujian, China

Division of Science and Technology, Fujian Rural Credit Union, Fujian, China
View Profile

ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and ComputingFebruary 2019Pages 295–300https://doi.org/10.1145/3318299.3318327

Published:22 February 2019Publication History

ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and Computing

Pages 295–300

ABSTRACT

Text detection in natural scene images becomes highly demanded for unstructured data in banking. In this paper, we propose a new deep learning algorithm called MSER, Hu-moment and Deep learning for Text detection (MHDT) based on Maximum Stable Extremal Regions (MSER) and Hu-moment features. Firstly, we extract MSERs as candidate characters. Secondly, a character classifier is introduced with Hu-moment features to reduce the number of input for clustering. After single linkage clustering, a text classifier trained from a Deep Brief Network is used to delete non-text. The proposed algorithm is evaluated on the ICDAR database, and the experimental results show that the proposed algorithm yields high precision and recall rate.

References

Balducci, Bitty, and Detelina Marinova. Unstructured data in marketing. Journal of the Academy of Marketing Science (2018): 1--34.Google Scholar
Edge, Darren, Jonathan Larson, and Christopher White. 2018. Bringing AI to BI: Enabling Visual Analytics of Unstructured Data in a Modern Business Intelligence Platform. Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems. ACM, 2018. Google ScholarDigital Library
Yin, Xu-Cheng, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao. 2014. Robust text detection in natural scene images. IEEE transactions on pattern analysis and machine intelligence 36, 5 (2014): 970--983.Google Scholar
Chen, Xiangrong, and Alan L. Yuille. 2004. Detecting and reading text in natural scenes. 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 2. IEEE, 2004. Google ScholarDigital Library
Lee, Jung-Jin, et al. 2011. Adaboost for text detection in natural scene. 2011 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2011. Google ScholarDigital Library
Yin, Xuwang, et al. 2012. Effective text localization in natural scene images with MSER, geometry-based grouping and AdaBoost. 2012 21st International Conference on Pattern Recognition (ICPR). IEEE, 2012.Google Scholar
Epshtein, Boris, Eyal Ofek, and Yonatan Wexler. 2010. Detecting text in natural scenes with stroke width transform. 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2010.Google ScholarCross Ref
Yi, Chucai, and Yingli Tian. 2012. Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Transactions on Image Processing 21, 9 (2012): 4256--4268. Google ScholarDigital Library
Yi, Chucai, and Yingli Tian. 2013. Text extraction from scene images by character appearance and structure modeling. Computer Vision and Image Understanding 117, 2 (2013): 182--194. Google ScholarDigital Library
Chen, Huizhong, et al. 2011. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. 2011 18th IEEE International Conference on Image Processing (ICIP). IEEE, 2011.Google ScholarCross Ref
Liu, Weibo, et al. 2017. A survey of deep neural network architectures and their applications. Neurocomputing 234 (2017): 11--26.Google ScholarCross Ref
Kim, Yelin, Honglak Lee, and Emily Mower Provost. 2013. Deep learning for robust feature generation in audiovisual emotion recognition. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2013.Google ScholarCross Ref
Zhou, Shusen, Qingcai Chen, and Xiaolong Wang. 2010. Discriminative deep belief networks for image classification. 2010 17th IEEE International Conference on Image Processing (ICIP), IEEE, 2010.Google ScholarCross Ref
Huang, Chenchen, et al. 2014. A research of speech emotion recognition based on deep belief network and SVM. Mathematical Problems in Engineering 2014 (2014).Google Scholar
Wang, Hai, Yingfeng Cai, and Long Chen. "A vehicle detection algorithm based on deep belief network." The scientific world journal 2014 (2014).Google Scholar
Matas, Jiri, et al. 2004. Robust wide-baseline stereo from maximally stable extremal regions. Image and vision computing 22, 10 (2004): 761--767.Google Scholar
Mikolajczyk, Krystian, et al. 2005. A comparison of affine region detectors. International journal of computer vision 65, 1--2 (2005): 43--72. Google ScholarDigital Library
Hu, Ming-Kuei. 1962. Visual pattern recognition by moment invariants. IRE transactions on information theory 8, 2 (1962): 179--187.Google Scholar
"Peak Noise to Signal Ratio". {online}. Available: http://en.wikipedia.org/wiki/Peak_signal-to-noise_ratioGoogle Scholar
Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. 2006. A fast learning algorithm for deep belief nets. Neural computation 18, 7 (2006): 1527--1554. Google ScholarDigital Library
Hinton, Geoffrey E. 2012. A practical guide to training restricted Boltzmann machines. Neural networks: Tricks of the trade. Springer, Berlin, Heidelberg, 2012. 599--619.Google Scholar
Shahab, Asif, Faisal Shafait, and Andreas Dengel. 2011. ICDAR 2011 robust reading competition challenge 2: Reading text in scene images. 2011 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2011. Google ScholarDigital Library
Breiman, Leo. Classification and regression trees. Routledge, 2017.Google ScholarCross Ref
Koo, Hyung Il, and Duck Hoon Kim. 2013. Scene text detection via connected component clustering and nontext filtering. IEEE transactions on image processing 22, 6 (2013): 2296--2305. Google ScholarDigital Library

Index Terms

MHDT: A Deep-Learning-Based Text Detection Algorithm for Unstructured Data in Banking
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Neural networks

Recommendations

An enhanced text detection technique for the visually impaired to read text

An enhanced text detection technique (ETDT) is proposed, which is expected to aid the visually impaired to overcome their reading challenges. This work enhances the edge-preserving maximally stable extremal regions (eMSER) algorithm using the pyramid ...
Read More
Multi-Lingual Scene Text Detection Using One-Class Classifier

The main purpose of scene text recognition is to detect texts in a given image. The problem of text detection and recognition in such images has gained great attention in recent years due to rising demand of several applications like visual based ...
Read More
Text detection in natural scene images based on color prior guided MSER
Abstract
In this paper, we focus on text detection in natural scene images which is conducive to content-based wild image analysis and understanding. This task is still an open problem and usually includes two key issues: text candidate ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and Computing
February 2019
563 pages
ISBN:9781450366007
DOI:10.1145/3318299

Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 February 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Text detection
deep learning
unstructured data
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 100
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

MHDT: A Deep-Learning-Based Text Detection Algorithm for Unstructured Data in Banking

ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

An enhanced text detection technique for the visually impaired to read text

Multi-Lingual Scene Text Detection Using One-Class Classifier

Text detection in natural scene images based on color prior guided MSER

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

MHDT: A Deep-Learning-Based Text Detection Algorithm for Unstructured Data in Banking

ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

An enhanced text detection technique for the visually impaired to read text

Multi-Lingual Scene Text Detection Using One-Class Classifier

Text detection in natural scene images based on color prior guided MSER

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media