short-paper

Automatic Text Recognition in Web Images

Authors:

Rodolfo Valiente,

José C. Gutiérrez,

Marcelo T. Sadaike,

Graça BressanAuthors Info & Claims

WebMedia '17: Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web

Pages 241 - 244

https://doi.org/10.1145/3126858.3131570

Published: 17 October 2017 Publication History

Abstract

Web images play an important role in delivering multimedia content on the Web. The text embedded in web images carry semantic information related to layout and content of the pages. Statistics show that there is a significant need to detect and recognize text from web images. This paper presents an architecture that efficiently integrates localization, extraction and recognition algorithms applied to text recognition in web images. In the recognition step is proposed a procedure based on super-resolution and an iterative method for improving the performance. The approach is implemented and evaluated using Matlab and cloud computing, making the system flexible, scalable and robust in detecting texts from complex web images with different orientations, dimensions and colors. Competitive results are presented, both in precision and recognition rate, when compared with other systems in the existing literature.

References

[1]

J. Sun, Z. Wang, H. Yu, F. Nishino, Y. Katsuyama, and S. Naoi, "Effective text extraction and recognition for WWW images," in Proceedings of the 2003 ACM symposium on Document engineering, Grenoble, France, 2003, pp. 115--117.

Digital Library

[2]

A. D. Costa, and M. P. d. Oliveira, Atribuição e exploração de semântica no processo de categorização de documentos, in Companion Proceedings of the XIV ACM Brazilian Symposium on Multimedia and the Web, Vila Velha, Brazil, 2008, pp. 193--196.

Digital Library

[3]

M. Ryan, and N. Hanafiah, "An Examination of Character Recognition on ID card using Template Matching Approach," International Conference on Computer Science and Computational Intelligence (Iccsci 2015), vol. 59, pp. 520--529, 2015.

[4]

R. Valiente, M. T. Sadaike, J. C. Gutiérrez, D. F. Soriano, G. Bressan, and W. V. Ruggiero, "A process for text recognition of generic identification documents over cloud computing." p. 142.

[5]

D. Karatzas, S. R. Mestre, J. Mas, F. Nourbakhsh, and P. P. Roy, "ICDAR 2011 Robust Reading Competition - Challenge 1: Reading Text in Born-Digital Images (Web and Email)." pp. 1485--1490.

[6]

A. Hooda, M. Kathuria, and V. Pankajakshan, "Application of Forgery Localization in Overlay Text Detection," in Proceedings of the 2014 ACM Indian Conference on Computer Vision Graphics and Image Processing, Bangalore, India, 2014, pp. 1--7.

Digital Library

[7]

D. Karatzas, F. Shafait, S. Uchida, M. Iwamura, L. G. i. Bigorda, S. R. Mestre, J. Mas, D. F. Mota, J. A. Almazàn, and L. P. d. l. Heras, "ICDAR 2013 Robust Reading Competition." pp. 1484--1493.

[8]

J. Zhou, and D. Lopresti, "Extracting text from WWW images." pp. 248--252.

[9]

S. J. Perantonis, B. Gatos, V. Maragos, V. Karkaletsis, and G. Petasis, "Text area identification in web images." pp. 82--92.

[10]

D. Lopresti, and J. Zhou, "Locating and recognizing text in WWW images," Information Retrieval, vol. 2, no. 2--3, pp. 177--206, 2000.

[11]

C. Liu, C. Yang, X. Ding, and J. Fan, "Text extraction from web images." pp. 78790P-78790P-14.

[12]

X. C. Yin, X. W. Yin, K. Z. Huang, and H. W. Hao, "Robust Text Detection in Natural Scene Images," Ieee Transactions on Pattern Analysis and Machine Intelligence, vol. 36, no. 5, pp. 970--983, May, 2014.

[13]

A. Gonzalez, L. M. Bergasa, J. J. Yebes, S. Bronte, and Ieee, "Text Location in Complex Images," 2012 21st International Conference on Pattern Recognition, International Conference on Pattern Recognition, pp. 617--620, New York: Ieee, 2012.

[14]

X. Peng, H. Cao, S. Setlur, V. Govindaraju, and P. Natarajan, "Multilingual OCR research and applications: an overview," in Proceedings of the 4th ACM International Workshop on Multilingual OCR, Washington, D.C., USA, 2013, pp. 1--8.

Digital Library

[15]

R. W. Soukoreff, and I. S. MacKenzie, "Measuring errors in text entry tasks: an application of the Levenshtein string distance statistic." pp. 319--320.

Cited By

R LS SS GV ST MK S(2024)Text to voice conversion of text embedded in images2024 IEEE International Conference for Women in Innovation, Technology & Entrepreneurship (ICWITE)10.1109/ICWITE59797.2024.10503275(148-154)Online publication date: 16-Feb-2024
https://doi.org/10.1109/ICWITE59797.2024.10503275
Valiente RChan DPerry ALampkins JStrelnikoff SXu JAshari A(2023)Robust Perception and Visual Understanding of Traffic Signs in the WildIEEE Open Journal of Intelligent Transportation Systems10.1109/OJITS.2023.32980314(611-625)Online publication date: 2023
https://doi.org/10.1109/OJITS.2023.3298031
Ahmed ZSingh H(2019)Text Extraction and Clustering for Multimedia: A review on Techniques and Challenges2019 International Conference on Digitization (ICD)10.1109/ICD47981.2019.9105905(38-43)Online publication date: Nov-2019
https://doi.org/10.1109/ICD47981.2019.9105905

Index Terms

Automatic Text Recognition in Web Images
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
      2. Image and video acquisition
  2. Computer graphics
    1. Image manipulation
      1. Image processing
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Image search

Recommendations

An Abused Webpage Detection Method Based on Screenshots Text Recognition
ACM ICEA '21: Proceedings of the 2021 ACM International Conference on Intelligent Computing and its Emerging Applications

With the rapid development of the Internet, webpages containing abused information such as pornography and gambling have emerged in an endless stream. These webpages are using various methods to evade traditional detection methods and which seriously ...
Image classification for mobile web browsing
WWW '06: Proceedings of the 15th international conference on World Wide Web

It is difficult for users of mobile devices such as cellular phones equipped with a small screen and a poor input interface to browse Web pages designed for desktop PCs with large displays. Many studies and commercial products have tried to solve this ...
A blind deconvolution model for scene text detection and recognition in video

Text detection and recognition in poor quality video is a challenging problem due to unpredictable blur and distortion effects caused by camera and text movements. This affects the overall performance of the text detection and recognition methods. This ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WebMedia '17: Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web

October 2017

522 pages

ISBN:9781450350969

DOI:10.1145/3126858

General Chairs:
Valter Roesler
UFRGS, Brazil
,
José Valdeni de Lima
UFRGS, Brazil
,
Program Chairs:
Celso Alberto Saibel Santos
UFES, Brazil
,
Roberto Willrich
UFSC, Brazil

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SBC: Brazilian Computer Society
CNPq: Conselho Nacional de Desenvolvimento Cientifico e Tecn
CGIBR: Comite Gestor da Internet no Brazil
CAPES: Brazilian Higher Education Funding Council

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Conselho Nacional de Desenvolvimento Científico e Tecnológico
Fundação de Apoio à Universidade de São Paulo

Conference

Webmedia '17

Sponsor:

SBC
CNPq
CGIBR
CAPES

Webmedia '17: Brazilian Symposium on Multimedia and the Web

October 17 - 20, 2017

RS, Gramado, Brazil

Acceptance Rates

WebMedia '17 Paper Acceptance Rate 38 of 138 submissions, 28%;

Overall Acceptance Rate 270 of 873 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
140
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

R LS SS GV ST MK S(2024)Text to voice conversion of text embedded in images2024 IEEE International Conference for Women in Innovation, Technology & Entrepreneurship (ICWITE)10.1109/ICWITE59797.2024.10503275(148-154)Online publication date: 16-Feb-2024
https://doi.org/10.1109/ICWITE59797.2024.10503275
Valiente RChan DPerry ALampkins JStrelnikoff SXu JAshari A(2023)Robust Perception and Visual Understanding of Traffic Signs in the WildIEEE Open Journal of Intelligent Transportation Systems10.1109/OJITS.2023.32980314(611-625)Online publication date: 2023
https://doi.org/10.1109/OJITS.2023.3298031
Ahmed ZSingh H(2019)Text Extraction and Clustering for Multimedia: A review on Techniques and Challenges2019 International Conference on Digitization (ICD)10.1109/ICD47981.2019.9105905(38-43)Online publication date: Nov-2019
https://doi.org/10.1109/ICD47981.2019.9105905

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents