short-paper

High-Performance Preprocessing of Architectural Drawings for Legend Metadata Extraction via OCR

Authors:
Tamir Hassan

HP Labs, Vienna, Austria

HP Labs, Vienna, Austria
View Profile

,
Jaume Verges-Llahi

HP, Barcelona, Spain

HP, Barcelona, Spain
View Profile

,
Andres Gonzalez

HP, Barcelona, Spain

HP, Barcelona, Spain
View Profile

DocEng '17: Proceedings of the 2017 ACM Symposium on Document EngineeringAugust 2017Pages 197–200https://doi.org/10.1145/3103010.3121042

Published:31 August 2017Publication History

DocEng '17: Proceedings of the 2017 ACM Symposium on Document Engineering

Pages 197–200

ABSTRACT

This paper describes the results of an investigation into methods of preprocessing architectural plots to enable them to be processed very quickly via OCR, detecting the region containing the relevant metadata legend and obtaining it in machine-readable form for e.g. automated folding and filenaming applications. We show how a processing pipeline adapted to this type of content can vastly decrease processing time, maintaining acceptable accuracy. Initial results show a reduction in total processing time from 2--3 minutes to around 15 seconds for most documents encountered, with the folding orientation being correctly detected in 78% of cases and the legend region being completely detected in 60% of cases, high enough for the use-case at hand.

References

Christian Ah-Soon and Karl Tombre. 1997. Variations on the Analysis of Architectural Drawings ICDAR 1997: Proceedings of the Fourth International Conference on Document Analysis and Recognition.Google Scholar
S. Ahmed, M. Liwicki, M. Weber, and A. Dengel. 2011. Improved Automatic Analysis of Architectural Floor Plans ICDAR 2011: Proceedings of the 11th International Conference on Document Analysis and Recognition.Google Scholar
L. A. Fletcher and R. Kasturi. 1988. A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 10, 6 (1988).Google ScholarDigital Library
J. Gllavata, R. Ewerth, and B. Freisleben. 2004. Text Detection in Images Based on Unsupervised Classification of High-Frequency Wavelet Coefficients. In ICPR 2004: Proceedings of the 17th International Conference on Pattern Recognition.Google Scholar
M. Goebel, T. Hassan, E. Oro, and G. Orsi. 2013. ICDAR 2013 Table Competition. In ICDAR 2013: Proceedings of the 12th International Conference on Document Analysis and Recognition.Google Scholar
R. W. Lienhart and Frank Stuber. 1996. Automatic text recognition in digital videos. In Image and Video Processing IV: SPIE Proceedings 2666.Google ScholarCross Ref
G. Nagy, S. Seth, and M. Viswanathan. 1992. A prototype document image analysis system for technical journals. Computer, Vol. 25, 7 (1992).Google Scholar

Index Terms

High-Performance Preprocessing of Architectural Drawings for Legend Metadata Extraction via OCR
1. Applied computing
  1. Document management and text processing
    1. Document capture
      1. Document analysis
    2. Document management
      1. Document metadata

Recommendations

Table of Contents Recognition in OCR Documents using Image-based Machine Learning
ACM SE '19: Proceedings of the 2019 ACM Southeast Conference

The importance of automatic analysis of Optical Character Recognition (OCR) documents has been increasingly recognized to assist with efficient data managements and accessibility. However, most OCR documents are unstructured, making the analysis ...
Read More
Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents

We present our work on the paleographic analysis and recognition system intended for processing of historical Hebrew calligraphy documents. The main goal is to analyze documents of different writing styles in order to identify the locations, dates, and ...
Read More
Automatic extraction of numerical sequences in handwritten incoming mail documents

In this paper, we propose a method for the automatic extraction of numerical fields in handwritten documents. The approach exploits the known syntactic structure of the numerical field to extract, combined with a set of contextual morphological features ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DocEng '17: Proceedings of the 2017 ACM Symposium on Document Engineering
August 2017
242 pages
ISBN:9781450346894
DOI:10.1145/3103010
General Chair:
Kenneth Camilleri
University of Malta, Malta
,
Program Chair:
Alexandra Bonnici
University of Malta, Malta
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 31 August 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
document analysis
image processing
ocr preprocessing
Qualifiers
- short-paper
Conference

Acceptance Rates
DocEng '17 Paper Acceptance Rate13of71submissions,18%Overall Acceptance Rate178of537submissions,33%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 103
  Total Downloads
- Downloads (Last 12 months)8
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

High-Performance Preprocessing of Architectural Drawings for Legend Metadata Extraction via OCR

DocEng '17: Proceedings of the 2017 ACM Symposium on Document Engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Table of Contents Recognition in OCR Documents using Image-based Machine Learning

Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents

Automatic extraction of numerical sequences in handwritten incoming mail documents

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

High-Performance Preprocessing of Architectural Drawings for Legend Metadata Extraction via OCR

DocEng '17: Proceedings of the 2017 ACM Symposium on Document Engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Table of Contents Recognition in OCR Documents using Image-based Machine Learning

Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents

Automatic extraction of numerical sequences in handwritten incoming mail documents

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media