skip to main content
10.1145/1815330.1815354acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Document image segmentation using discriminative learning over connected components

Published: 09 June 2010 Publication History

Abstract

Segmentation of a document image into text and non-text regions is an important preprocessing step for a variety of document image analysis tasks, like improving OCR, document compression etc. Most of the state-of-the-art document image segmentation approaches perform segmentation using pixel-based or zone(block)-based classification. Pixel-based classification approaches are time consuming, whereas block-based methods heavily depend on the accuracy of block segmentation step. In contrast to the state-of-the-art document image segmentation approaches, our segmentation approach introduces connected component based classification, thereby not requiring a block segmentation beforehand. Here we train a self-tunable multi-layer perceptron (MLP) classifier for distinguishing between text and non-text connected components using shape and context information as a feature vector. Experimental results prove the effectiveness of our proposed algorithm. We have evaluated our method on subset of UW-III, ICDAR 2009 page segmentation competition test images and circuit diagrams datasets and compared its results with the state-of-the-art leptonica's page segmentation algorithm.

References

[1]
A. Antonacopoulos, D. Bridson, C. Papadopoulos, and S. Pletschacher. ICDAR 2009 page segmentation competition. In Proc. Int. Conf. Documnet Analysis and Recognition (ICDAR2009), pages 1370--1374, Barcelona, Spain, 2009.
[2]
D. S. Bloomberg and F. R. Chen. Extraction of text-related features for condensing image documents. In SPIE Conf. 2660, Document Recgnition III, pages 72--88, San Jose, CA, 1996.
[3]
T. M. Breuel and F. Shafait. Automlp: Simple, effective, fully automated learning rate and size adjustment. In The Learning Workshop, Snowbird, Utah, 2010.
[4]
Z. Chi and K. W. Wong. A two-stage binarization approach for document images. In Proc. Int. Symp. Intelligent Multimedia, Video and Speech Processing (ISIMP'01), pages 275--278, 2001.
[5]
D. Keysers, F. Shafait, and T. M. Breuel. Document image zone classification- a simple high-performance approach. In Proc. 2nd Int. Conf. Computer Vision Theory and Applications, pages 44--51, Barcelona, Spain, Mar. 2007.
[6]
S. Marinai, M. Gori, and G. Soda. Artificial neural networks for document analysis and recognition. In IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 27(1), Jan. 2005.
[7]
M. A. Moll and H. S. Baird. Segmentation-based retrieval of document images from diverse collections. In Document Recognition and Retrieval XV, Proc. of the SPIE, volume 6815, pages 68150L--68150L, 2008.
[8]
M. A. Moll, H. S. Baird, and C. An. Truthing for pixel-accurate segmentation. In Document Analysis Systems, the Eighth IAPR Int. Workshop, pages 379--385, Sep. 2008.
[9]
O. Okun, D. Doermann, and M. Pietikainen. Page segmentation and zone calssification: the state of art. In Technical Report LAM-TR-036, CAR-TR-927, CS-TR-4079, University of Maryland, College Park, Nov. 1999.
[10]
N. Rondel and G. Breuel. Coorperation of multilayer perceptrons for the estimation of skew angle in text document images. Proc. Int. Conf. Documnet Analysis and Recognition (ICDAR'95), pages 1141--1144, 1995.
[11]
F. Shafait, D. Keysers, and T. M. Breuel. Performance evaluation and benchmarking of six page segmentation algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(6):941--954, Jun 2008.
[12]
Y. Wang, I. Phillips, and R. Haralick. Document zone content classification and its performance evaluation. In Pattern Recognition, volume 39, pages 57--73, 2006.
[13]
C. S. Won. Image extraction in digital documents. In Journal of Electronic Imaging, volume 17, page 033016, 2008.

Cited By

View all
  • (2024)Digitizing History: Transitioning Historical Paper Documents to Digital Content for Information Retrieval and Mining—A Comprehensive SurveyIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.337841911:5(6151-6180)Online publication date: Oct-2024
  • (2024)Multi-Level Graph Convolutional Network for Document Information Extraction2024 IEEE 36th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI62512.2024.00044(247-255)Online publication date: 28-Oct-2024
  • (2024)DCT-CompSegNet: fast layout segmentation in DCT compressed JPEG document images using deep feature learningMultimedia Tools and Applications10.1007/s11042-024-18204-083:25(66201-66221)Online publication date: 22-Jan-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

DAS '10

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)1
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Digitizing History: Transitioning Historical Paper Documents to Digital Content for Information Retrieval and Mining—A Comprehensive SurveyIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.337841911:5(6151-6180)Online publication date: Oct-2024
  • (2024)Multi-Level Graph Convolutional Network for Document Information Extraction2024 IEEE 36th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI62512.2024.00044(247-255)Online publication date: 28-Oct-2024
  • (2024)DCT-CompSegNet: fast layout segmentation in DCT compressed JPEG document images using deep feature learningMultimedia Tools and Applications10.1007/s11042-024-18204-083:25(66201-66221)Online publication date: 22-Jan-2024
  • (2024)The digitization of historical astrophysical literature with highly localized figures and figure captionsInternational Journal on Digital Libraries10.1007/s00799-023-00350-925:3(471-491)Online publication date: 1-Sep-2024
  • (2023)Segmentation-Less Extraction of Text and Non-Text Regions From JPEG 2000 Compressed Document Images Through Partial and Intelligent DecompressionIEEE Access10.1109/ACCESS.2023.324996111(20673-20687)Online publication date: 2023
  • (2023)Integrated document segmentation and region identification: textual, equation and graphicalMultimedia Systems10.1007/s00530-023-01171-129:6(3447-3466)Online publication date: 12-Sep-2023
  • (2023)Document Region ClassificationDocument Layout Analysis10.1007/978-981-99-4277-0_4(43-65)Online publication date: 1-Aug-2023
  • (2023)Document Region SegmentationDocument Layout Analysis10.1007/978-981-99-4277-0_3(31-42)Online publication date: 1-Aug-2023
  • (2022)Figure and Figure Caption Extraction for Mixed Raster and Vector PDFs: Digitization of Astronomical Literature with OCR FeaturesLinking Theory and Practice of Digital Libraries10.1007/978-3-031-16802-4_5(52-67)Online publication date: 20-Sep-2022
  • (2021)Vision-Based Layout Detection from Scientific Literature using Recurrent Convolutional Neural Networks2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9412557(6455-6462)Online publication date: 10-Jan-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media