ABSTRACT
Text detection in natural scene images becomes highly demanded for unstructured data in banking. In this paper, we propose a new deep learning algorithm called MSER, Hu-moment and Deep learning for Text detection (MHDT) based on Maximum Stable Extremal Regions (MSER) and Hu-moment features. Firstly, we extract MSERs as candidate characters. Secondly, a character classifier is introduced with Hu-moment features to reduce the number of input for clustering. After single linkage clustering, a text classifier trained from a Deep Brief Network is used to delete non-text. The proposed algorithm is evaluated on the ICDAR database, and the experimental results show that the proposed algorithm yields high precision and recall rate.
- Balducci, Bitty, and Detelina Marinova. Unstructured data in marketing. Journal of the Academy of Marketing Science (2018): 1--34.Google Scholar
- Edge, Darren, Jonathan Larson, and Christopher White. 2018. Bringing AI to BI: Enabling Visual Analytics of Unstructured Data in a Modern Business Intelligence Platform. Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems. ACM, 2018. Google ScholarDigital Library
- Yin, Xu-Cheng, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao. 2014. Robust text detection in natural scene images. IEEE transactions on pattern analysis and machine intelligence 36, 5 (2014): 970--983.Google Scholar
- Chen, Xiangrong, and Alan L. Yuille. 2004. Detecting and reading text in natural scenes. 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Vol. 2. IEEE, 2004. Google ScholarDigital Library
- Lee, Jung-Jin, et al. 2011. Adaboost for text detection in natural scene. 2011 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2011. Google ScholarDigital Library
- Yin, Xuwang, et al. 2012. Effective text localization in natural scene images with MSER, geometry-based grouping and AdaBoost. 2012 21st International Conference on Pattern Recognition (ICPR). IEEE, 2012.Google Scholar
- Epshtein, Boris, Eyal Ofek, and Yonatan Wexler. 2010. Detecting text in natural scenes with stroke width transform. 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2010.Google ScholarCross Ref
- Yi, Chucai, and Yingli Tian. 2012. Localizing text in scene images by boundary clustering, stroke segmentation, and string fragment classification. IEEE Transactions on Image Processing 21, 9 (2012): 4256--4268. Google ScholarDigital Library
- Yi, Chucai, and Yingli Tian. 2013. Text extraction from scene images by character appearance and structure modeling. Computer Vision and Image Understanding 117, 2 (2013): 182--194. Google ScholarDigital Library
- Chen, Huizhong, et al. 2011. Robust text detection in natural images with edge-enhanced maximally stable extremal regions. 2011 18th IEEE International Conference on Image Processing (ICIP). IEEE, 2011.Google ScholarCross Ref
- Liu, Weibo, et al. 2017. A survey of deep neural network architectures and their applications. Neurocomputing 234 (2017): 11--26.Google ScholarCross Ref
- Kim, Yelin, Honglak Lee, and Emily Mower Provost. 2013. Deep learning for robust feature generation in audiovisual emotion recognition. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2013.Google ScholarCross Ref
- Zhou, Shusen, Qingcai Chen, and Xiaolong Wang. 2010. Discriminative deep belief networks for image classification. 2010 17th IEEE International Conference on Image Processing (ICIP), IEEE, 2010.Google ScholarCross Ref
- Huang, Chenchen, et al. 2014. A research of speech emotion recognition based on deep belief network and SVM. Mathematical Problems in Engineering 2014 (2014).Google Scholar
- Wang, Hai, Yingfeng Cai, and Long Chen. "A vehicle detection algorithm based on deep belief network." The scientific world journal 2014 (2014).Google Scholar
- Matas, Jiri, et al. 2004. Robust wide-baseline stereo from maximally stable extremal regions. Image and vision computing 22, 10 (2004): 761--767.Google Scholar
- Mikolajczyk, Krystian, et al. 2005. A comparison of affine region detectors. International journal of computer vision 65, 1--2 (2005): 43--72. Google ScholarDigital Library
- Hu, Ming-Kuei. 1962. Visual pattern recognition by moment invariants. IRE transactions on information theory 8, 2 (1962): 179--187.Google Scholar
- "Peak Noise to Signal Ratio". {online}. Available: http://en.wikipedia.org/wiki/Peak_signal-to-noise_ratioGoogle Scholar
- Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. 2006. A fast learning algorithm for deep belief nets. Neural computation 18, 7 (2006): 1527--1554. Google ScholarDigital Library
- Hinton, Geoffrey E. 2012. A practical guide to training restricted Boltzmann machines. Neural networks: Tricks of the trade. Springer, Berlin, Heidelberg, 2012. 599--619.Google Scholar
- Shahab, Asif, Faisal Shafait, and Andreas Dengel. 2011. ICDAR 2011 robust reading competition challenge 2: Reading text in scene images. 2011 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2011. Google ScholarDigital Library
- Breiman, Leo. Classification and regression trees. Routledge, 2017.Google ScholarCross Ref
- Koo, Hyung Il, and Duck Hoon Kim. 2013. Scene text detection via connected component clustering and nontext filtering. IEEE transactions on image processing 22, 6 (2013): 2296--2305. Google ScholarDigital Library
Index Terms
- MHDT: A Deep-Learning-Based Text Detection Algorithm for Unstructured Data in Banking
Recommendations
An enhanced text detection technique for the visually impaired to read text
An enhanced text detection technique (ETDT) is proposed, which is expected to aid the visually impaired to overcome their reading challenges. This work enhances the edge-preserving maximally stable extremal regions (eMSER) algorithm using the pyramid ...
Multi-Lingual Scene Text Detection Using One-Class Classifier
The main purpose of scene text recognition is to detect texts in a given image. The problem of text detection and recognition in such images has gained great attention in recent years due to rising demand of several applications like visual based ...
Text detection in natural scene images based on color prior guided MSER
AbstractIn this paper, we focus on text detection in natural scene images which is conducive to content-based wild image analysis and understanding. This task is still an open problem and usually includes two key issues: text candidate ...
Comments