ABSTRACT
This paper describes the results of an investigation into methods of preprocessing architectural plots to enable them to be processed very quickly via OCR, detecting the region containing the relevant metadata legend and obtaining it in machine-readable form for e.g. automated folding and filenaming applications. We show how a processing pipeline adapted to this type of content can vastly decrease processing time, maintaining acceptable accuracy. Initial results show a reduction in total processing time from 2--3 minutes to around 15 seconds for most documents encountered, with the folding orientation being correctly detected in 78% of cases and the legend region being completely detected in 60% of cases, high enough for the use-case at hand.
- Christian Ah-Soon and Karl Tombre. 1997. Variations on the Analysis of Architectural Drawings ICDAR 1997: Proceedings of the Fourth International Conference on Document Analysis and Recognition.Google Scholar
- S. Ahmed, M. Liwicki, M. Weber, and A. Dengel. 2011. Improved Automatic Analysis of Architectural Floor Plans ICDAR 2011: Proceedings of the 11th International Conference on Document Analysis and Recognition.Google Scholar
- L. A. Fletcher and R. Kasturi. 1988. A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 10, 6 (1988).Google ScholarDigital Library
- J. Gllavata, R. Ewerth, and B. Freisleben. 2004. Text Detection in Images Based on Unsupervised Classification of High-Frequency Wavelet Coefficients. In ICPR 2004: Proceedings of the 17th International Conference on Pattern Recognition.Google Scholar
- M. Goebel, T. Hassan, E. Oro, and G. Orsi. 2013. ICDAR 2013 Table Competition. In ICDAR 2013: Proceedings of the 12th International Conference on Document Analysis and Recognition.Google Scholar
- R. W. Lienhart and Frank Stuber. 1996. Automatic text recognition in digital videos. In Image and Video Processing IV: SPIE Proceedings 2666.Google ScholarCross Ref
- G. Nagy, S. Seth, and M. Viswanathan. 1992. A prototype document image analysis system for technical journals. Computer, Vol. 25, 7 (1992).Google Scholar
Index Terms
- High-Performance Preprocessing of Architectural Drawings for Legend Metadata Extraction via OCR
Recommendations
Table of Contents Recognition in OCR Documents using Image-based Machine Learning
ACM SE '19: Proceedings of the 2019 ACM Southeast ConferenceThe importance of automatic analysis of Optical Character Recognition (OCR) documents has been increasingly recognized to assist with efficient data managements and accessibility. However, most OCR documents are unstructured, making the analysis ...
Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents
We present our work on the paleographic analysis and recognition system intended for processing of historical Hebrew calligraphy documents. The main goal is to analyze documents of different writing styles in order to identify the locations, dates, and ...
Automatic extraction of numerical sequences in handwritten incoming mail documents
In this paper, we propose a method for the automatic extraction of numerical fields in handwritten documents. The approach exploits the known syntactic structure of the numerical field to extract, combined with a set of contextual morphological features ...
Comments