short-paper

Mechanism for Structuring the Data from a Generic Identity Document Image using Semantic Analysis

Authors:

José C. Gutiérrez,

Rodolfo Valiente,

Marcelo T. Sadaike,

Daniel F. Soriano,

Graça Bressan,

Wilson V. RuggieroAuthors Info & Claims

WebMedia '17: Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web

Pages 213 - 216

https://doi.org/10.1145/3126858.3131594

Published: 17 October 2017 Publication History

Abstract

Nowadays, the enormous variety of identity documents that exist makes it difficult to standardize a system capable of extracting all the information of interest presented by them. Therefore, systems that use templates to classify information based on their positions are limited by the number of templates they could recognize. Thus, in this paper, a novel mechanism intended to automatically classify the major information of interest exposed by generic identity documents is presented. The proposal is created to be easily adaptable to any system capable of detecting and extracting text information from an identity document image. To assign meaning to the text extracted from the identity document, the proposal is based on a novel mechanism to structuring the data using semantic analysis. The mechanism consists of two main steps, first, all the text data are classified as sentences or near sentences based on the Euclidean distance between words; second, the sentences are analyzed to find keywords that allow structuring the information based on its semantic to show it as abstractions. The proposal has been designed to be able to store the data as abstractions of its meaning. This allows improving the scalability of the system and a better use of this information by different services, by the end user or to be interpreted by an automated process of decision-making.

References

[1]

T. P. Kaur and N. Garg, "Optimized Gurmukhi Text Recognition from Signboard Images Captured by Mobile Camera Using Structural Features," in 2015 Fifth International Conference on Advances in Computing and Communications (ICACC), 2015, pp. 412--416.

[2]

F. Chabchoub, Y. Kessentini, S. Kanoun, V. Eglin, and F. Lebourgeois, "SmartATID: A Mobile Captured Arabic Text Images Dataset for Multi-purpose Recognition Tasks," in 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2016, pp. 120--125.

[3]

L.-P. de las Heras, O. R. Terrades, J. Llados, D. Fernandez-Mota, and C. Canero, "Use case visual Bag-of-Words techniques for camera based identity document classification," in 2015 13th International Conference on Document Analysis and Recognition (ICDAR), 2015, pp. 721--725.

Digital Library

[4]

N. L. Sonia Bhaskar Scott Green, "Implementing Optical Character Recognition on the Android Operating System for Business Cards," 2011.

[5]

M. Simon, E. Rodner, and J. Denzler, "Fine-grained classification of identity document types with only one example," in 2015 14th IAPR International Conference on Machine Vision Applications (MVA), 2015, pp. 126--129.

[6]

L. Gomez and D. Karatzas, "Multi-script Text Extraction from Natural Scenes," in 2013 12th International Conference on Document Analysis and Recognition, 2013, pp. 467--471.

Digital Library

[7]

L. Neumann and J. Matas, "Real-time scene text localization and recognition," in 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012, pp. 3538--3545.

[8]

M. Ryan and N. Hanafiah, "An Examination of Character Recognition on ID card using Template Matching Approach," Int. Conf. Comput. Sci. Comput. Intell. (Iccsci 2015), vol. 59, pp. 520--529, 2015.

[9]

R. Valiente, M. T. Sadaike, J. C. Gutiérrez, D. F. Soriano, and G. Bressan, "A process for text recognition of generic identification documents over cloud computing," IPCV'1 International Conf. Image Process. Comput. Vision, Pattern Recognit., no. April 2017, p. 4, 2016.

[10]

J. Lyons, Linguistic semantics?: an introduction. Cambridge University Press, 1995.

[11]

Y. Zhang and B. Liu, "Semantic text classification of disease reporting," in Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '07, 2007, p. 747.

Digital Library

[12]

P. Kok Loo and C. Lim Tan, "Word and Sentence Extraction Using Irregular Pyramid," LNCS, vol. 2423, pp. 307--318, 2002.

[13]

C. L. A. Clarke and G. V. Cormack, "On the use of regular expressions for searching text," ACM Trans. Program. Lang. Syst., vol. 19, no. 3, pp. 413--426, May 1997.

Digital Library

[14]

C. Braccini, L. DeFloriani, and G. Vernazza, Image Analysis and Processing?: 8th International Conference, ICIAP'95 San Remo, Italy, September 13--15, 1995 Proceedings. Springer-Verlag, 1995.

[15]

D. Kerckhove and C. J. Lumsden, The Alphabet and the Brain?: the Lateralization of Writing. Springer Berlin Heidelberg, 1988.

Index Terms

Mechanism for Structuring the Data from a Generic Identity Document Image using Semantic Analysis
1. Applied computing
  1. Document management and text processing
    1. Document capture
      1. Document analysis
      2. Optical character recognition
2. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object identification
  2. Computer graphics
    1. Image manipulation
      1. Image processing

Recommendations

Identity Management

The Identity Solutions Symposium held in Jonesboro, Arkansas, 21 to 22 February, 2007 brought together academic, industry, and government experts working on radio frequency identification (RFID), biometrics, sensors, animal identification, identity ...
Generic text summarization using relevance measure and latent semantic analysis
SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval

In this paper, we propose two generic text summarization methods that create text summaries by ranking and extracting sentences from the original documents. The first method uses standard IR methods to rank sentence relevances, while the second method ...
Semantic analysis for focused multi-document summarization (fMDS) of text
SAC '15: Proceedings of the 30th Annual ACM Symposium on Applied Computing

Excess amounts of unstructured data are easily accessible in digital format quickly, yet there is no way for a human reader to easily 'ingest and digest' as quickly. This information overload places too heavy a burden on society for its analysis and ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WebMedia '17: Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web

October 2017

522 pages

ISBN:9781450350969

DOI:10.1145/3126858

General Chairs:
Valter Roesler
UFRGS, Brazil
,
José Valdeni de Lima
UFRGS, Brazil
,
Program Chairs:
Celso Alberto Saibel Santos
UFES, Brazil
,
Roberto Willrich
UFSC, Brazil

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SBC: Brazilian Computer Society
CNPq: Conselho Nacional de Desenvolvimento Cientifico e Tecn
CGIBR: Comite Gestor da Internet no Brazil
CAPES: Brazilian Higher Education Funding Council

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
Fundação de Apoio à Universidade de São Paulo

Conference

Webmedia '17

Sponsor:

SBC
CNPq
CGIBR
CAPES

Webmedia '17: Brazilian Symposium on Multimedia and the Web

October 17 - 20, 2017

RS, Gramado, Brazil

Acceptance Rates

WebMedia '17 Paper Acceptance Rate 38 of 138 submissions, 28%;

Overall Acceptance Rate 270 of 873 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
77
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents