skip to main content
10.1145/3297067.3297073acmotherconferencesArticle/Chapter ViewAbstractPublication PagesspmlConference Proceedingsconference-collections
research-article

Improvement Research and Application of Text Recognition Algorithm Based on CRNN

Published: 28 November 2018 Publication History

Abstract

This paper is based on CRNN model to recognize the text in the images of football matches scene, and two improvements are proposed. Considering the edge feature of text is strong, this paper adds MFM layers into CRNN model aiming to enhance the contrast. In order to solve the problem of losing details of image static features in the process of getting contextual features, this paper fuses up these two kinds of features. The training and testing experiments carried out on public dataset and manual dataset respectively verify the validity of the improvements, and the recognition accurate rate is higher than original model.

References

[1]
Bissacco A, Cummins M, Netzer Y, et al. PhotoOCR: Reading Text in Uncontrolled Conditions{C}// IEEE International Conference on Computer Vision. IEEE, 2014:785--792.
[2]
Jaderberg M, Simonyan K, Vedaldi A, et al. Reading Text in the Wild with Convolutional Neural Networks{J}. International Journal of Computer Vision, 2016, 116(1):1--20.
[3]
Li S, Tang M, Guo Q, et al. Deep neural network with attention model for scene text recognition{J}. Iet Computer Vision, 2017, 11(7):605--612.
[4]
Wang F, Guo Q, Lei J, et al. Convolutional recurrent neural networks with hidden Markov model bootstrap for scene text recognition{J}. Iet Computer Vision, 2017, 11(6):497--504.
[5]
Shi B, Bai X, Yao C. An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition{J}. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2016, PP(99):1--1.
[6]
Simonyan K, Zisserman A. Very Deep Convolutional Networks for Large-Scale Image Recognition{J}. Computer Science, 2014.5
[7]
Pitts W. A logical calculus of the ideas immanent in nervous activity{J}. Bulletin of Mathematical Biology, 1943, 5(4):115--133.
[8]
Werbos P. Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Science{J}. Ph.d.dissertation Harvard University, 1974, 29(18):65--78.
[9]
Sundermeyer M, Schlüter R, Ney H. LSTM Neural Networks for Language Modeling{C}// Interspeech. 2012:601--608.
[10]
Graves A, Gomez F. Connectionist temporal classification:labelling unsegmented sequence data with recurrent neural networks{C}// International Conference on Machine Learning. ACM, 2006:369--376.
[11]
Ranling Peng. General Psychology. Third Edition{M}. Beijing Normal University press, 2004.
[12]
Wen Zhou. Boundary Recognition Using Side Suppressed Neural Networks {J}. Journal of Nanjing University of Posts and Telecommunications (NATURAL SCIENCE EDITION), 2003, 23(1):73--76.

Cited By

View all
  • (2023)Medical Prescription Label Reading Using Computer Vision and Deep LearningSoft Computing for Problem Solving10.1007/978-981-19-6525-8_9(97-108)Online publication date: 2-Mar-2023
  • (2021)A method of detecting defects of smart meter LCD screen based on LSD and deep learningMultimedia Tools and Applications10.1007/s11042-020-10481-9Online publication date: 13-Feb-2021
  • (2020)Temperature Forecasting via Convolutional Recurrent Neural Networks Based on Time-Series DataComplexity10.1155/2020/35365722020Online publication date: 20-Mar-2020
  • Show More Cited By

Index Terms

  1. Improvement Research and Application of Text Recognition Algorithm Based on CRNN

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    SPML '18: Proceedings of the 2018 International Conference on Signal Processing and Machine Learning
    November 2018
    177 pages
    ISBN:9781450366052
    DOI:10.1145/3297067
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 November 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. CNN
    2. Feature Fusion
    3. MFM
    4. RNN
    5. Text Recognition

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    SPML '18

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Medical Prescription Label Reading Using Computer Vision and Deep LearningSoft Computing for Problem Solving10.1007/978-981-19-6525-8_9(97-108)Online publication date: 2-Mar-2023
    • (2021)A method of detecting defects of smart meter LCD screen based on LSD and deep learningMultimedia Tools and Applications10.1007/s11042-020-10481-9Online publication date: 13-Feb-2021
    • (2020)Temperature Forecasting via Convolutional Recurrent Neural Networks Based on Time-Series DataComplexity10.1155/2020/35365722020Online publication date: 20-Mar-2020
    • (2020)Prediction of Future Appearances via Convolutional Recurrent Neural Networks Based on Image Time Series in Cloud ComputingCloud Computing, Smart Grid and Innovative Frontiers in Telecommunications10.1007/978-3-030-48513-9_27(328-340)Online publication date: 23-May-2020

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media