research-article

Sequence Recognition of Scene Text Based on CRNN and CTPN Models

Author:

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

Pages 196 - 200

https://doi.org/10.1145/3573428.3573462

Published: 15 March 2023 Publication History

Get Access

Abstract

Image-based sequence recognition has lately emerged as a prominent study subject in the science of computer vision, while text detection and identification in natural situations has emerged as an active research field. Based on scene text data, this paper addresses the theory of deep learning-based CRNN and CTPN models and the process of processing text. Using CRNN, text recognition can be turned into a time-dependent sequence learning issue, which is commonly employed for indeterminate-length text sequences. Contextual relationships between text images are learned using BLSTM and CTC, thus effectively improving text recognition accuracy and making the model more robust. It also excels in text recognition tests for wordless and lexical-based scenes, as it is not constrained by any predefined language. It produces a more efficient, but smaller, model that is more suited to real-world settings. CRNN recognition accuracy is lower for short texts with large morphological changes, such as artistic words, or texts with large changes in natural scenes. Because of the Anchor setting, CTPN can only detect horizontally distributed text, but a small improvement can detect vertical text by adding horizontal Anchor. As a result of the limitations of the framework, the irregularly inclined text can be detected very broadly.

References

[1]

Baoguang Shi, Xiang Bai, and Cong Yao. 2015. An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition.

Google Scholar

[2]

Zhi Tian, Weilin Huang, Tong He, Pan He, and Yu Qiao. 2016. Detecting Text in Natural Image with Connectionist Text Proposal Network. ECCV

Google Scholar

[3]

Girshick, R.: Fast r-cnn. 2015, in IEEE International Conference on Computer Vision (ICCV)

Google Scholar

[4]

Jaderberg, M., Simonyan, K., Vedaldi, A., Zisserman, A.: Reading text in the wild with convolutional neural networks. International Journal of Computer Vision (IJCV), 2016, Reading text in the wild with convolutional neural networks

Digital Library

Google Scholar

[5]

K. Wang, B. Babenko, and S. Belongie. End-to-end scene text recognition. In ICCV, 2011.

Google Scholar

[6]

A. Bissacco, M. Cummins, Y. Netzer, and H. Neven. Photoocr: Reading text in uncontrolled conditions. In ICCV, 2013.

Digital Library

Google Scholar

[7]

Busta, M., Neumann, L., Matas, J.: Fastext: Efficient unconstrained scene text detector, 2015, in IEEE International Conference on Computer Vision (ICCV)

Google Scholar

[8]

M. Jaderberg, K. Simonyan, A. Vedaldi, and A. Zisserman. Synthetic data and artificial neural networks for natural scene text recognition. NIPS Deep Learning Workshop, 2014.

Google Scholar

[9]

M. Jaderberg, K. Simonyan, A. Vedaldi, and A. Zisserman. Deep structured output learning for unconstrained text recognition. In ICLR, 2015.

Google Scholar

[10]

Cheng, M., Zhang, Z., Lin, W., Torr, P.: Bing: Binarized normed gradients for objectness estimation at 300fps, 2014, in IEEE Computer Vision and Pattern Recognition (CVPR)

Google Scholar

Index Terms

Sequence Recognition of Scene Text Based on CRNN and CTPN Models
1. Software and its engineering
  1. Software creation and management
    1. Software development process management
      1. Software development methods
        Agile software development

Recommendations

An optical character recognition system for printed Telugu text

Telugu is one of the oldest and popular languages of India, spoken by more than 66 million people, especially in South India. Not much work has been reported on the development of optical character recognition (OCR) systems for Telugu text. Therefore, ...
Urdu Nasta'liq text recognition system based on multi-dimensional recurrent neural network and statistical features

Character recognition for cursive script like Arabic, handwritten English and French is a challenging task which becomes more complicated for Urdu Nasta'liq text due to complexity of this script over Arabic. Recurrent neural network (RNN) has proved ...
Character and numeral recognition for non-Indic and Indic scripts: a survey
Abstract
A collection of different scripts is employed in writing languages throughout the world. Character and numeral recognition of a particular script is a key area in the field of pattern recognition. In this paper, we have presented a comprehensive ...

Comments

Information & Contributors

Information

Published In

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

October 2022

1999 pages

ISBN:9781450397148

DOI:10.1145/3573428

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 March 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

EITCE 2022

EITCE 2022: 2022 6th International Conference on Electronic Information Technology and Computer Engineering

October 21 - 23, 2022

Xiamen, China

Acceptance Rates

Overall Acceptance Rate 508 of 972 submissions, 52%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
54
Total Downloads

Downloads (Last 12 months)24
Downloads (Last 6 weeks)3

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Index Terms

Recommendations

An optical character recognition system for printed Telugu text

Urdu Nasta'liq text recognition system based on multi-dimensional recurrent neural network and statistical features

Character and numeral recognition for non-Indic and Indic scripts: a survey

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Share

Share this Publication link

Share on social media

Affiliations