loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Heba Hassan 1 ; Marwan Torki 2 and Mohamed E. Hussein 2 ; 3

Affiliations: 1 Dept. of Computer Science and Engineering, Egypt-Japan University of Science and Technology, Egypt ; 2 Dept. of Computer and Systems Engineering, Alexandria University, Egypt ; 3 Information Sciences Institute, University of Southern California, U.S.A.

Keyword(s): Text Recognition, Multi-task Learning.

Abstract: Text recognition continues to be a challenging problem in the context of text reading in natural scenes. Bearing in mind the sequential nature of text, the problem is usually posed as a sequence prediction problem from a whole-word image. Alternatively, it can also be posed as a character prediction problem. The latter approach is typically more robust to challenging word shapes. Attempting to find the sweet spot that attains the best of the two approaches, we propose Sequence-Character Aware Network (SCAN). SCAN starts by locating and recognizing the characters, and then generates the word using a sequence-based approach. It comprises two modules: a semantic-segmentation-based character prediction, and an encoder-decoder network for word generation. The training is done over two stages. In the first stage, we adopt a multi-task training technique with both character-level and word-level losses and trainable loss weighting. In the second stage, the character-level loss is removed, en abling the use of data with only word-level annotations. Experiments are conducted on several datasets for both regular and irregular text, showing state of the art performance of the proposed approach. It also shows that the proposed approach is robust against noisy word detection. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.137.221.163

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Hassan, H.; Torki, M. and Hussein, M. (2021). SCAN: Sequence-character Aware Network for Text Recognition. In Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 5: VISAPP; ISBN 978-989-758-488-6; ISSN 2184-4321, SciTePress, pages 602-609. DOI: 10.5220/0010321106020609

@conference{visapp21,
author={Heba Hassan. and Marwan Torki. and Mohamed E. Hussein.},
title={SCAN: Sequence-character Aware Network for Text Recognition},
booktitle={Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 5: VISAPP},
year={2021},
pages={602-609},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010321106020609},
isbn={978-989-758-488-6},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 5: VISAPP
TI - SCAN: Sequence-character Aware Network for Text Recognition
SN - 978-989-758-488-6
IS - 2184-4321
AU - Hassan, H.
AU - Torki, M.
AU - Hussein, M.
PY - 2021
SP - 602
EP - 609
DO - 10.5220/0010321106020609
PB - SciTePress