Novel Text Recognition Based on Modified K-Clustering and Hidden Markov Models

Shen, Victor R. L.; Chiou, Gwo-Jen; Lin, Yi-Nan; Jhan, Jhao-Yuan

doi:10.1007/s11277-019-06926-6

Novel Text Recognition Based on Modified K-Clustering and Hidden Markov Models

Published: 23 November 2019

Volume 111, pages 1453–1474, (2020)
Cite this article

Wireless Personal Communications Aims and scope Submit manuscript

Victor R. L. Shen^1,2,
Gwo-Jen Chiou³,
Yi-Nan Lin⁴ &
…
Jhao-Yuan Jhan^1,2

133 Accesses
Explore all metrics

Abstract

Currently, many researchers have paid more attention to identifying scene texts from the image with background interferences. This study aims to develop an App software system with text recognition on smartphones. Otsu edge detection is applied to binarize the image and to find the parameters (i.e. weights) in a K-cluster. The modified K-cluster algorithm is used to detect the text from an image. The noise in complex background is also filtered out. The detected text gradients are evaluated by histogram of gradient. Accordingly, the distribution of the detected text gradients is generated. Finally, the gradient distribution is utilized by hidden Markov models to recognize the text. The experimental results have shown that the proposed approach can successfully outperform other methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 4

Fig. 5

Research on image text recognition based on canny edge detection algorithm and k-means algorithm

Article 22 August 2021

Unsupervised Text Binarization in Handwritten Historical Documents Using k-Means Clustering

A Framework for Multi-lingual Scene Text Detection Using K-means++ and Memetic Algorithms

References

Liang, J., Doermann, D., & Li, H. (2005). Camera-based analysis of text and documents: A survey. International Journal on Document Analysis and Recognition,7(2–3), 84–104.
Article Google Scholar
Jung, K., Kim, K., & Jain, A. (2004). Text information extraction in images and video: A survey. Pattern Recognition,37(5), 977–997.
Article Google Scholar
Judd, T., Ehinger, K., Durand, F., & Torralba, A. (2009). Learning to predict where humans look. In Proceedings of IEEE 12th ICCV (pp. 2106–2113).
Chen, X., & Yuille, A. (2004). Detecting and reading text in natural scenes. Proceedings of IEEE CVPR,2, 366–373.
Google Scholar
Neumann, L., & Matas, J. (2012). Real-time scene text localization and recognition. In Proceedings of IEEE CVPR (pp. 3538–3545).
Neuman, L., & Matas, J. (2010). A method for text localization and recognition in real world images. In Proceedings of ACCV (pp. 770–783).
Odobez, J. M., & Chen, D. (2002). Robust video text segmentation and recognition with multiple hypotheses. In Proceedings of ICIP (pp. 433–436).
Huang, R., Oba, S., Shivakumara, P., & Uchida, S. (2012). Scene character detection and recognition based on multiple hypotheses framework. In Proceedings of ICPR (pp. 717–720).
Jetley, S., Behlhe, S., Koppula, V. K., & Nagi, A. (2012). Two-stage hybrid binarization around fringe map based text line segmentation for document images. In Proceedings of ICPR (pp. 343–346).
Zhang, D., & Chang, S. (2003). A bayesian framework for fusing multiple word knowledge models in videotext recognition. In Proceedings of CVPR (pp. 528–533).
Lucas, S. M. (2005). Text locating competition results. In Proceedings of third international conference on document analysis and recognition (pp. 80–85).
Gao, Song, Wang, Chunheng, Xiao, Baihua, Shi, Cunzhao, Zhou, Wen, & Zhang, Zhong. (2015). Scene text recognition by learning co-occurrence of strokes based on spatiality embedded dictionary. IET Computer Vision,9, 138–148.
Article Google Scholar
Koerich, L., Sabourin, R., & Suen, Y. (2005). Recognition and verification of unconstrained handwritten words. IEEE Transactions on Pattern Analysis and Machine Intelligence,27(10), 1509–1522.
Article Google Scholar
Pedro Felipe Felzenszwalb. Introduction to computer vision edge detection [Online]. https://www.classes.cs.uchicago.edu/archive/2008/spring/35040-1/edges.pdf. Accessed 2 June 2017.
Utrecht University. Chapter 10 segmentation [Online]. http://www.cs.uu.nl/docs/vakken/ibv/reader/chapter10.pdf. Accessed 11 July 2017.
Seo, Joung-Hae, & Park, Eun-Mi. (2018). A study on financing security for smartphones using text mining. Wireless Personal Communications,98(4), 3109–3127.
Article Google Scholar
Wikipedia. Histogram of oriented gradients [Online]. https://en.wiki-pedia.org/wiki/Histogram_of_oriented_gradients. Accessed 11 July 2017.
Dietterich, Thomas, Bishop, Christopher, Heckerman, David, Jordan, Michael, & Kearns, Michael. (2010). Introduction to machine learning (2nd ed.). London: The MIT Press.
Google Scholar
Cheng, F., Zhang, H., Fan, W., & Harris, B. (2018). Image recognition technology based on deep learning. Wireless Personal Communications,102(2), 1917–1933.
Article Google Scholar
Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of IEEE,77(2), 257–285.
Article Google Scholar
Young Jung Kim and Jong Yun Lee. (2016). Algorithm of a perspective transform-based PDF₄₁₇ barcode recognition. Wireless Personal Communications,89(3), 893–911.
Article Google Scholar
Davis, R. I. A., Lovell, B. C., & Caelli, T. (2002). Improved estimation of hidden Markov model parameters from multiple observation sequences. Proceedings International Conference on Pattern Recognition,2, 168–171.
Google Scholar
Baggenstoss, P. M. (2001). A modified Baum–Welch algorithm for hidden Markov models with multiple observation spaces. IEEE Transactions on Speech and Audio Processing,9, 411–416.
Article Google Scholar
Wang, K., Babenko, B., & Belongie, S. (2011). End-to-end scene text recognition. In Proceedings ICCV (pp. 1457–1464).
Otsu, N. A. (1979). Threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man, and Cybernetics,9(1), 919–926.
Article Google Scholar
Abbyyfinereader 9.0. http://www.abbyy.com. Accessed 11 July 2017.

Download references

Acknowledgements

The authors are very grateful to the anonymous reviewers for their constructive comments which have improved the quality of this paper. Also, this work was supported by the Ministry of Science and Technology, Taiwan, under grant MOST 106- 2221- E-845- 001.

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Taipei University, 151, University Rd, Sanhsia, New Taipei City, 237, Taiwan
Victor R. L. Shen & Jhao-Yuan Jhan
Department of Information Management, Chaoyang University of Technology, Taichung City, Taiwan
Victor R. L. Shen & Jhao-Yuan Jhan
Department of Electrical Engineering, National Formosa University, Huwei Township, 632, Yunlin County, Taiwan
Gwo-Jen Chiou
Department of Electronic Engineering, Ming Chi University of Technology, New Taipei City, 243, Taiwan
Yi-Nan Lin

Authors

Victor R. L. Shen
View author publications
You can also search for this author in PubMed Google Scholar
Gwo-Jen Chiou
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Nan Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jhao-Yuan Jhan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Victor R. L. Shen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shen, V.R.L., Chiou, GJ., Lin, YN. et al. Novel Text Recognition Based on Modified K-Clustering and Hidden Markov Models. Wireless Pers Commun 111, 1453–1474 (2020). https://doi.org/10.1007/s11277-019-06926-6

Download citation

Published: 23 November 2019
Issue Date: April 2020
DOI: https://doi.org/10.1007/s11277-019-06926-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Novel Text Recognition Based on Modified K-Clustering and Hidden Markov Models

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Research on image text recognition based on canny edge detection algorithm and k-means algorithm

Unsupervised Text Binarization in Handwritten Historical Documents Using k-Means Clustering

A Framework for Multi-lingual Scene Text Detection Using K-means++ and Memetic Algorithms

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Novel Text Recognition Based on Modified K-Clustering and Hidden Markov Models

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Research on image text recognition based on canny edge detection algorithm and k-means algorithm

Unsupervised Text Binarization in Handwritten Historical Documents Using k-Means Clustering

A Framework for Multi-lingual Scene Text Detection Using K-means++ and Memetic Algorithms

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation