research-article

MoDest: Multi-module Design Validation for Documents

Authors:
Bhanupriya Pegu

AI Garage, Mastercard, India

AI Garage, Mastercard, India
View Profile

,
Maneet Singh

AI Garage, Mastercard, India

AI Garage, Mastercard, India
View Profile

,
Kamal Kant

AI Garage, Mastercard, India

AI Garage, Mastercard, India
View Profile

,
Karamjit Singh

AI Garage, Mastercard, India

AI Garage, Mastercard, India
View Profile

,
Tanmoy Bhowmik

AI Garage, Mastercard, India

AI Garage, Mastercard, India
View Profile

CODS-COMAD '21: Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)January 2021Pages 332–340https://doi.org/10.1145/3430984.3431001

Published:02 January 2021Publication History

CODS-COMAD '21: Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)

Pages 332–340

ABSTRACT

Information extraction (IE) from Visually Rich Documents (VRDs) is a common need for businesses, where extracted information is used for various purposes such as verification, design validation, or compliance. Most of the research in IE from VRDs has focused on textual documents such as invoices and receipts, while extracting information from multi-modal VRDs remains a challenging task. This research presents a novel end-to-end design validation framework for multi-modal VRDs containing textual and visual components, for compliance against a pre-defined set of rules. The proposed Multi-mOdule DESign validaTion (referred to as MoDest) framework constitutes two steps: (i) information extraction using five modules for obtaining the textual and visual components, followed by (ii) validating the extracted components against a pre-defined set of design rules. Given an input multi-modal VRD image, the MoDest framework either accepts or rejects its design while providing an explanation for the decision. The proposed framework is tested for design validation for a particular type of VRDs: banking cards, under the real-world constraint of limited and highly imbalance training data with more than 99% of card designs belonging to one class (accepted). Experimental evaluation on real world images from our in-house dataset demonstrates the effectiveness of the proposed MoDest framework. Analysis drawn from the real-world deployment of the framework further strengthens its utility for design validation.

References

Mary Elaine Califf and Raymond J Mooney. 2003. Bottom-up relational learning of pattern matching rules for information extraction. Journal of Machine Learning Research 4 (2003), 177–210.Google ScholarDigital Library
John Canny. 1986. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence6 (1986), 679–698.Google ScholarDigital Library
Adulwit Chinapas, Pattarawit Polpinit, Narong Intiruk, and K Saikaew. 2019. Personal Verification System Using ID Card and Face Photo. International Journal of Machine Learning and Computing 9 (2019), 407–412.Google ScholarCross Ref
Vincent Poulain d’Andecy, Emmanuel Hartmann, and Marçal Rusinol. 2018. Field extraction by hybrid incremental and a-priori structural templates. In IAPR International Workshop on Document Analysis Systems. 251–256.Google Scholar
Brian Davis, Bryan Morse, Scott Cohen, Brian Price, and Chris Tensmeyer. 2019. Deep visual template-free form parsing. In International Conference on Document Analysis and Recognition. 134–141.Google ScholarCross Ref
Christopher G Harris, Mike Stephens, 1988. A combined corner and edge detector.. In Alvey vision conference, Vol. 15. 10–5244.Google Scholar
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980(2014).Google Scholar
Xiaojing Liu, Feiyu Gao, Qiong Zhang, and Huasha Zhao. 2019. Graph Convolution for Multimodal Information Extraction from Visually Rich Documents. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers). 32–39.Google ScholarCross Ref
Bodhisattwa Prasad Majumder, Navneet Potti, Sandeep Tata, James Bradley Wendt, Qi Zhao, and Marc Najork. 2020. Representation Learning for Information Extraction from Form-like Documents. In Annual Meeting of the Association for Computational Linguistics. 6495–6504.Google ScholarCross Ref
F Meyer. 1978. Contrast feature extraction. Quantitative Analysis of Micro-structures in Material Sciences, Biology and Medicine (1978).Google Scholar
Ann Nosseir and Omar Adel. 2018. Automatic Extraction of Arabic Number from Egyptian ID Cards. In International Conference on Software and Information Engineering. 56–61.Google ScholarDigital Library
Nobuyuki Otsu. 1979. A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man, and Cybernetics 9, 1(1979), 62–66.Google ScholarCross Ref
Ritesh Sarkhel and Arnab Nandi. 2019. Visual segmentation for information extraction from heterogeneous visually rich documents. In International Conference on Management of Data. 247–262.Google ScholarDigital Library
Ray Smith. 2007. An overview of the Tesseract OCR engine. In International Conference on Document Analysis and Recognition, Vol. 2. 629–633.Google ScholarCross Ref
Irwin Sobel. 2014. History and definition of the sobel operator. Retrieved from the World Wide Web 1505 (2014).Google Scholar
Niloofar Tavakolian, Azadeh Nazemi, and Donal Fitzpatrick. 2020. Real-time information retrieval from Identity cards. arXiv preprint arXiv:2003.12103(2020).Google Scholar
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, and Jiajun Liang. 2017. EAST: an efficient and accurate scene text detector. In IEEE Conference on Computer Vision and Pattern Recognition. 5551–5560.Google ScholarCross Ref

Recommendations

Measuring the effectiveness of various design validation approaches for PowerPCTM microprocessor arrays
DATE '98: Proceedings of the conference on Design, automation and test in Europe

Although several methods for array design validation have been proposed and had great success in the past, little evidence has been reported for the effectiveness of these methods with respect to the detection of design errors. In this paper, we propose ...
Read More
Software design validation tool
International Conference on Reliable Software

DECA is a computer program which is used in conjunction with a top-down dominated design methodology. The program organizes, validates, and produces a document depicting the design of a software system. The use of DECA significantly enhances the quality ...
Read More
Software design validation tool
Proceedings of the international conference on Reliable software

DECA is a computer program which is used in conjunction with a top-down dominated design methodology. The program organizes, validates, and produces a document depicting the design of a software system. The use of DECA significantly enhances the quality ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CODS-COMAD '21: Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)
January 2021
453 pages
ISBN:9781450388177
DOI:10.1145/3430984
Editors:
Jayant Haritsa,
Shourya Roy,
Manish Gupta,
Sharad Mehrotra,
Balaji Vasan Srinivasan,
Yogesh Simmhan
Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 January 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Computer Vision
Design Validation
Multi-modal Document
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate197of680submissions,29%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 50
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

MoDest: Multi-module Design Validation for Documents

CODS-COMAD '21: Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)

ABSTRACT

References

Cited By

Recommendations

Measuring the effectiveness of various design validation approaches for PowerPCTM microprocessor arrays

Software design validation tool

Software design validation tool