skip to main content
10.1145/1815330.1815340acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Context-aware and content-based dynamic Voronoi page segmentation

Published:09 June 2010Publication History

ABSTRACT

This paper presents a dynamic approach to document page segmentation based on inter-component relationships, local patterns and context features. State-of-the art page segmentation algorithms segment zones based on local properties of neighboring connected components such as distance and orientation, and do not typically consider additional properties other than size. Our proposed approach uses a contextually aware and dynamically adaptive page segmentation scheme. The page is first over-segmented using a dynamically adaptive scheme of separation features based on [2] and adapted from [13]. A decision to form zones is then based on the context built from these local separation features and high-level content features. Zone-based evaluation was performed on sets of printed and handwritten documents in English and Arabic scripts with multiple font types, sizes and we achieved an increase of 15% over the accuracy reported in [2].

References

  1. W. Abd Almageed, M. Agrawal, W. Seo, and D. Doermann. Document-zone classification using partial least squares and hybrid classifiers. Int'l Conf. on Patt. Reco., pages 1--4, 2008.Google ScholarGoogle Scholar
  2. M. Agrawal and D. Doermann. Voronoi++: A dynamic page segmentation approach based on voronoi and docstrum features. In Proc. 10th Int'l Conf. on Doc. Analysis and Reco., pages 1011--1015, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Antonacopoulos and R. Ritchings. Flexible page segmentation using the background. In Proc. 12th Int'l Conf. on Patt. Reco., volume 2, pages 339--344, Oct 1994.Google ScholarGoogle ScholarCross RefCross Ref
  4. H. S. Baird. Background structure in document images. In Advances in Structural and Syntactic Pattern Recognition, pages 17--34. World Scientific, 1994.Google ScholarGoogle Scholar
  5. T. M. Breuel. Two geometric algorithms for layout analysis. In Workshop on Document Analysis Systems, pages 188--199. Springer-Verlag, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. V. Ferrari, L. Fevrier, F. Jurie, and C. Schmid. Groups of adjacent contour segments for object detection. IEEE Trans. Patt. Anal. Mach. Intell., 30(1):36--51, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. A. Fletcher and R. Kasturi. A robust algorithm for text string separation from mixed text/graphics images. IEEE Trans. Pattern Anal. Mach. Intell., 10(6):910--918, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. I. Guyon, R. M. Haralick, J. J. Hull, and I. T. Phillips. Data sets for ocr and document image understanding research. In Proc. of SPIE - Document Recognition IV, pages 779--799. World Scientific, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  9. F. Hones and J. Lichter. Layout extraction of mixed-mode documents. Mach. Vision Appl., 7(4):237--246, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Jain and Y. Zhong. Page segmentation using texture analysis. Patt. Reco., 29(5):743--770, May 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. K. Jain and S. Bhattacharjee. Text segmentation using gabor filters for automatic document processing. Mach. Vision Appl., 5(3):169--184, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. N. Kato, M. Suzuki, S. Omachi, H. Aso, and Y. Nemoto. A handwritten character recognition system using directional element feature and asymmetric mahalanobis distance. IEEE Trans. Patt. Anal. Mach. Intell., 21(3):258--262, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. K. Kise, A. Sato, and M. Iwata. Segmentation of page images using the area voronoi diagram. Comput. Vis. Image Underst., 70(3):370--382, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. G. Lowe. Distinctive image features from scale-invariant keypoints. Int'l J. Comput. Vision, 60(2):91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. J. M. Roth and D. Doermann. Gedi: Ground truth. editor and document interface. In Summit on Arabic and Chinese Handwriting Recognition, 2006.Google ScholarGoogle Scholar
  16. S. Mao and T. Kanungo. Automatic training of page segmentation algorithms: An optimization approach. In Proc. of Int'l Conf. on Patt. Reco., pages 531--534, 2000.Google ScholarGoogle Scholar
  17. G. Nagy, S. Seth, and M. Viswanathan. A prototype document image analysis system for technical journals. Computer, 25(7):10--22, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. N. Normand and C. Viard-Gaudin. A background based adaptive page segmentation algorithm. In Proc. 3rd Int'l Conf. on Doc. Analysis and Reco., page 138, Washington, DC, USA, 1995. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. L. O'Gorman. The document spectrum for page layout analysis. IEEE Trans. Patt. Anal. Mach. Intell., 15(11):1162--1173, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. T. Ojala, M. Pietikäinen, and T. Mäenpää. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell., 24(7):971--987, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. T. Pavlidis and J. Zhou. Page segmentation and classification. CVGIP: Graph. Models Image Process., 54(6):484--496, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. I. Sekita, R. Mori, K. Yamamoto, H. Yamada, and K. Toraichi. Feature extraction of handwritten japanese characters by spline functions for relaxation matching. Patt. Reco., 21(1):9--17, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. W. Seo, M. Agrawal, and D. Doermann. Performance evaluation tools for zone segmentation and classification (PETS). Int'l Conf. on Patt. Reco., 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. F. Shafait, D. Keysers, and T. M. Breuel. Performance comparison of six algorithms for page segmentation. In 7th IAPR Workshop on Document Analysis Systems, pages 368--379. Springer, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Y. Wang, I. T. Phillips, and R. M. Haralick. Document zone content classification and its performance evaluation. Patt. Reco., 39(1):57--73, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. K. Y. Wong, R. G. Casey, and F. M. Wahl. Document Analysis System. j-IBM-JRD, 26(6):647--656, Nov. 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
    June 2010
    490 pages
    ISBN:9781605587738
    DOI:10.1145/1815330

    Copyright © 2010 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 9 June 2010

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader