Paper
7 February 2011 Adaptive removal of background and white space from document images using seam categorization
Claude Fillion, Zhigang Fan, Vishal Monga
Author Affiliations +
Proceedings Volume 7879, Imaging and Printing in a Web 2.0 World II; 78790A (2011) https://doi.org/10.1117/12.877266
Event: IS&T/SPIE Electronic Imaging, 2011, San Francisco Airport, California, United States
Abstract
Document images are obtained regularly by rasterization of document content and as scans of printed documents. Resizing via background and white space removal is often desired for better consumption of these images, whether on displays or in print. While white space and background are easy to identify in images, existing methods such as naïve removal and content aware resizing (seam carving) each have limitations that can lead to undesirable artifacts, such as uneven spacing between lines of text or poor arrangement of content. An adaptive method based on image content is hence needed. In this paper we propose an adaptive method to intelligently remove white space and background content from document images. Document images are different from pictorial images in structure. They typically contain objects (text letters, pictures and graphics) separated by uniform background, which include both white paper space and other uniform color background. Pixels in uniform background regions are excellent candidates for deletion if resizing is required, as they introduce less change in document content and style, compared with deletion of object pixels. We propose a background deletion method that exploits both local and global context. The method aims to retain the document structural information and image quality.
© (2011) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Claude Fillion, Zhigang Fan, and Vishal Monga "Adaptive removal of background and white space from document images using seam categorization", Proc. SPIE 7879, Imaging and Printing in a Web 2.0 World II, 78790A (7 February 2011); https://doi.org/10.1117/12.877266
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image quality

Detection and tracking algorithms

Visualization

Printing

Current controlled current source

Electrical engineering

Electronic imaging

RELATED CONTENT

Building a scalable storage for images in a social network
Proceedings of SPIE (February 21 2012)
Improve artwork designs through data ranking system
Proceedings of SPIE (February 16 2011)
Layout hierarchies for interactive design reuse
Proceedings of SPIE (February 21 2012)
A URL shortener for mobile web consumption
Proceedings of SPIE (February 21 2012)
Partitioning of the degradation space for OCR training
Proceedings of SPIE (January 16 2006)

Back to Top