Paper
4 February 2013 Local projection-based character segmentation method for historical Chinese documents
Linjie Yang, Liangrui Peng
Author Affiliations +
Proceedings Volume 8658, Document Recognition and Retrieval XX; 86580O (2013) https://doi.org/10.1117/12.2008338
Event: IS&T/SPIE Electronic Imaging, 2013, Burlingame, California, United States
Abstract
Digitization of historical Chinese documents includes two key technologies, character segmentation and character recognition. This paper focuses on developing character segmentation algorithm. As a preprocessing step, we combine several effective measures to remove noises in a historical Chinese document image. After binarization, a new character segmentation algorithm segment single characters based on projections of a cost image in local windows. The cost image is constructed by utilizing the information of stroke bounding boxes and a skeleton image extracted from the binarized image. We evaluate the proposed algorithm based on matching degrees of character bounding boxes between segmentation results and ground-truth data, and achieve a recall rate of 74.3% on a test set, which shows the effectiveness of the proposed algorithm.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Linjie Yang and Liangrui Peng "Local projection-based character segmentation method for historical Chinese documents", Proc. SPIE 8658, Document Recognition and Retrieval XX, 86580O (4 February 2013); https://doi.org/10.1117/12.2008338
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Image processing algorithms and systems

Feature extraction

Detection and tracking algorithms

Algorithm development

Optical character recognition

Denoising

RELATED CONTENT


Back to Top