skip to main content
10.1145/3616901.3616904acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfaimlConference Proceedingsconference-collections
research-article

Text Recognition Based on Weakly Supervised Learning

Authors Info & Claims
Published:05 March 2024Publication History

ABSTRACT

Aiming at the problems of irregular text region and fuzzy text in picture, this paper proposes a text recognition method based on weakly supervised learning. The method is based on explicit rectify module, vision module, language module and fusion module. The vision module corrects the irregular text region through the correction module; the vision module extracts features and recognizes them through the convolution neural network and the location attention mechanism, and outputs the predicted strings; the language module learns sequence information through the attention mechanism and corrects the predicted strings by the vision module; finally, the output results of the vision module and the language module are combined according to the weight in the fusion module. Get the final prediction. The language module in this method prevents the direct interference of image blur and enhances the accuracy of text recognition, experiments on several common data sets demonstrate the effectiveness of the proposed method.

References

  1. Long, Jonathan, Shelhamer, Fully Convolutional Networks for Semantic Segmentation[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017Google ScholarGoogle Scholar
  2. Hochreiter, Sepp, and Jürgen Schmidhuber. "Bridging Long Time Lags by Weight Guessing and "Long Short Term Memory"." spatiotemporal models in biological & artificial systems (1996)Google ScholarGoogle Scholar
  3. Shi B, Xiang B, Cong Y. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2016, 39(11):2298-2304Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Fang S, Xie H, Wang Y, Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition[J]. 2021Google ScholarGoogle Scholar
  5. Shi B, Yang M, Wang X, ASTER: An Attentional Scene Text Recognizer with Flexible Rectification[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2018, PP:1-1Google ScholarGoogle Scholar
  6. Vaswani A, Shazeer N, Parmar N, Attention is all you need [C]//Advances in Neural Information Processing Systems. 2017: 5998-6008Google ScholarGoogle Scholar
  7. Merity S, Xiong C, Bradbury J, Pointer Sentinel Mixture Models[C]// ICLR. 2017Google ScholarGoogle Scholar
  8. Yosinski J, Clune J, Bengio Y, How transferable are features in deep neural networks? [J]. MIT Press, 2014Google ScholarGoogle Scholar
  9. Long, Jonathan, Shelhamer, Fully Convolutional Networks for Semantic Segmentation[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017Google ScholarGoogle Scholar
  10. Zhou Z H. A Brief Introduction to Weakly Supervised Learning[J]. National Science Review, 2017(1):1Google ScholarGoogle ScholarCross RefCross Ref
  11. Luo C, Jin L, Sun Z. MORAN: A Multi-Object Rectified Attention Network for scene text recognition[J]. Pattern Recognition, 2019, 90Google ScholarGoogle Scholar
  12. Chung J, Gulcehre C, Cho K H, Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling[J]. Eprint Arxiv, 2014Google ScholarGoogle Scholar
  13. Mnih V, Heess N, Graves A, Recurrent Models of Visual Attention[J]. Advances in Neural Information Processing Systems, 2014, 3Google ScholarGoogle Scholar
  14. Karatzas D, Shafait F, Uchida S, ICDAR 2013 robust reading competition[C]// Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE Computer Society, 2013Google ScholarGoogle Scholar
  15. Mishra A, Alahari K, Jawahar C V. Scene Text Recognition using Higher Order Language Priors. 2012Google ScholarGoogle Scholar
  16. Kai W, Babenko B, Belongie S. End-to-end scene text recognition[C]// IEEE International Conference on Computer Vision. IEEE, 2012Google ScholarGoogle Scholar
  17. Risnumawan A, Shivakumara P, Chan C S, A robust arbitrary text detection system for natural scene images[J]. Expert Systems with Applications, 2014, 41(18):8027-8048Google ScholarGoogle ScholarCross RefCross Ref
  18. Karatzas D, Gomez-Bigorda L, Nicolaou A, ICDAR 2015 competition on Robust Reading[C]// International Conference on Document Analysis & Recognition. IEEE Computer Society, 2015Google ScholarGoogle Scholar
  19. Phan T Q, Shivakumara P, Tian S, Recognizing Text with Perspective Distortion in Natural Scenes[C]// IEEE International Conference on Computer Vision. IEEE, 2014Google ScholarGoogle Scholar

Index Terms

  1. Text Recognition Based on Weakly Supervised Learning
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          FAIML '23: Proceedings of the 2023 International Conference on Frontiers of Artificial Intelligence and Machine Learning
          April 2023
          296 pages
          ISBN:9798400707544
          DOI:10.1145/3616901

          Copyright © 2023 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 5 March 2024

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited
        • Article Metrics

          • Downloads (Last 12 months)5
          • Downloads (Last 6 weeks)2

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format