research-article

Text Recognition Based on Weakly Supervised Learning

Authors:
Ai Guo Chen

School of Artificial Intelligence and Compute, Jiangnan University, Wuxi, China, China

School of Artificial Intelligence and Compute, Jiangnan University, Wuxi, China, China

0000-0001-5391-9174
View Profile

,
Ming Jie Zou

School of Artificial Intelligence and Compute, Jiangnan University, Wuxi, China, China

School of Artificial Intelligence and Compute, Jiangnan University, Wuxi, China, China

0009-0001-5019-8102
View Profile

,
Xiang Yu Zhang

School of Artificial Intelligence and Compute, Jiangnan University, Wuxi, China, China

School of Artificial Intelligence and Compute, Jiangnan University, Wuxi, China, China

0009-0000-4605-5208
View Profile

FAIML '23: Proceedings of the 2023 International Conference on Frontiers of Artificial Intelligence and Machine LearningApril 2023Pages 8–13https://doi.org/10.1145/3616901.3616904

Published:05 March 2024Publication History

FAIML '23: Proceedings of the 2023 International Conference on Frontiers of Artificial Intelligence and Machine Learning

Pages 8–13

ABSTRACT

Aiming at the problems of irregular text region and fuzzy text in picture, this paper proposes a text recognition method based on weakly supervised learning. The method is based on explicit rectify module, vision module, language module and fusion module. The vision module corrects the irregular text region through the correction module; the vision module extracts features and recognizes them through the convolution neural network and the location attention mechanism, and outputs the predicted strings; the language module learns sequence information through the attention mechanism and corrects the predicted strings by the vision module; finally, the output results of the vision module and the language module are combined according to the weight in the fusion module. Get the final prediction. The language module in this method prevents the direct interference of image blur and enhances the accuracy of text recognition, experiments on several common data sets demonstrate the effectiveness of the proposed method.

References

Long, Jonathan, Shelhamer, Fully Convolutional Networks for Semantic Segmentation[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017Google Scholar
Hochreiter, Sepp, and Jürgen Schmidhuber. "Bridging Long Time Lags by Weight Guessing and "Long Short Term Memory"." spatiotemporal models in biological & artificial systems (1996)Google Scholar
Shi B, Xiang B, Cong Y. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2016, 39(11):2298-2304Google ScholarDigital Library
Fang S, Xie H, Wang Y, Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition[J]. 2021Google Scholar
Shi B, Yang M, Wang X, ASTER: An Attentional Scene Text Recognizer with Flexible Rectification[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2018, PP:1-1Google Scholar
Vaswani A, Shazeer N, Parmar N, Attention is all you need [C]//Advances in Neural Information Processing Systems. 2017: 5998-6008Google Scholar
Merity S, Xiong C, Bradbury J, Pointer Sentinel Mixture Models[C]// ICLR. 2017Google Scholar
Yosinski J, Clune J, Bengio Y, How transferable are features in deep neural networks? [J]. MIT Press, 2014Google Scholar
Long, Jonathan, Shelhamer, Fully Convolutional Networks for Semantic Segmentation[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017Google Scholar
Zhou Z H. A Brief Introduction to Weakly Supervised Learning[J]. National Science Review, 2017(1):1Google ScholarCross Ref
Luo C, Jin L, Sun Z. MORAN: A Multi-Object Rectified Attention Network for scene text recognition[J]. Pattern Recognition, 2019, 90Google Scholar
Chung J, Gulcehre C, Cho K H, Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling[J]. Eprint Arxiv, 2014Google Scholar
Mnih V, Heess N, Graves A, Recurrent Models of Visual Attention[J]. Advances in Neural Information Processing Systems, 2014, 3Google Scholar
Karatzas D, Shafait F, Uchida S, ICDAR 2013 robust reading competition[C]// Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE Computer Society, 2013Google Scholar
Mishra A, Alahari K, Jawahar C V. Scene Text Recognition using Higher Order Language Priors. 2012Google Scholar
Kai W, Babenko B, Belongie S. End-to-end scene text recognition[C]// IEEE International Conference on Computer Vision. IEEE, 2012Google Scholar
Risnumawan A, Shivakumara P, Chan C S, A robust arbitrary text detection system for natural scene images[J]. Expert Systems with Applications, 2014, 41(18):8027-8048Google ScholarCross Ref
Karatzas D, Gomez-Bigorda L, Nicolaou A, ICDAR 2015 competition on Robust Reading[C]// International Conference on Document Analysis & Recognition. IEEE Computer Society, 2015Google Scholar
Phan T Q, Shivakumara P, Tian S, Recognizing Text with Perspective Distortion in Natural Scenes[C]// IEEE International Conference on Computer Vision. IEEE, 2014Google Scholar

Index Terms

Text Recognition Based on Weakly Supervised Learning
1. Applied computing
  1. Document management and text processing
2. Computing methodologies
  1. Artificial intelligence
  2. Machine learning

Index terms have been assigned to the content through auto-classification.

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

FAIML '23: Proceedings of the 2023 International Conference on Frontiers of Artificial Intelligence and Machine Learning
April 2023
296 pages
ISBN:9798400707544
DOI:10.1145/3616901

Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 March 2024
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 5
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Text Recognition Based on Weakly Supervised Learning

FAIML '23: Proceedings of the 2023 International Conference on Frontiers of Artificial Intelligence and Machine Learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Text-Book of Practical Therapeutics: With Especial Reference to the Application of Remedial Measures to Disease and Their Employment Upon a Rational Basis

A Text-Book of Practical Therapeutics: With Especial Reference to the Application of Remedial Measures to Disease and Their Employment Upon a Rational Basis

A Text-Book of Practical Therapeutics: With Special Reference to the Application of Remedial Measures to Disease and Their Employment Upon a Rational Basis

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Text Recognition Based on Weakly Supervised Learning

FAIML '23: Proceedings of the 2023 International Conference on Frontiers of Artificial Intelligence and Machine Learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Text-Book of Practical Therapeutics: With Especial Reference to the Application of Remedial Measures to Disease and Their Employment Upon a Rational Basis

A Text-Book of Practical Therapeutics: With Especial Reference to the Application of Remedial Measures to Disease and Their Employment Upon a Rational Basis

A Text-Book of Practical Therapeutics: With Special Reference to the Application of Remedial Measures to Disease and Their Employment Upon a Rational Basis

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media