Abstract
In this paper, we intensively study the behavior of three part-based methods for handwritten digit recognition. The principle of the proposed methods is to represent a handwritten digit image as a set of parts and recognize the image by aggregating the recognition results of individual parts. Since part-based methods do not rely on the global structure of a character, they are expected to be more robust against various deformations which may damage the global structure. The proposed three methods are based on the same principle but different in their details, for example, the way of aggregating the individual results. Thus, those methods have different performances. Experimental results show that even the simplest part-based method can achieve recognition rate as high as 98.42% while the improved one achieved 99.15%, which is comparable or even higher than some state-of-the-art method. This result is important because it reveals that characters can be recognized without their global structure. The results also show that the part-based method has robustness against deformations which usually appear in handwriting.
Similar content being viewed by others
References
Bart E, Ullman S. Class-based matching of object parts. In: Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop. 2004, 173
Zhang J, MarszaÅĆek M, Lazebnik S, Schmid C. Local features and kernels for classification of texture and object categories: a comprehensive study. International Journal of Computer Vision, 2007, 73(2): 213–238
Mikolajczyk K, Schmid C. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(10): 1615–1630
Plamondon R, Srihari S N. Online and off-line handwriting recognition: a comprehensive survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(1): 63–84
Carneiro G. The automatic design of feature spaces for local image descriptors using an ensemble of non-linear feature extractors. In: Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition. 2010, 3509–3516
Lim K L, Galoogahi H K. Shape classification using local and global features. In: Proceedings of the 4th Pacific-Rim Symposium on Image and Video Technology. 2010, 115–120
Keren D. Painter identification using local features and naive bayes. In: Proceedings of the 2002 International Conference on Pattern Recognition. 2002, 474–477
Bart E, Byvatov E, Ullman S. View-invariant recognition using corresponding object fragments. Computer Vision, 2004, 152-165
Song C, Yang F, Li P. Rotation invariant texture measured by local binary pattern for remote sensing image classification. In: Proceedings of the 2010 International Workshop on Education Technology and Computer Science. 2010, 3–6
Liang P, Li S F, Qin J W. Multi-resolution local binary patterns for image classification. In: Proceedings of the 2010 International Conference onWavelet Analysis and Pattern Recognition. 2010, 164–169
Suruliandi A, Srinivasan E M, Ramar K. Image resolution dependency of local texture patterns in classification of color images. In: Proceedings of the 2010 IEEE Annual India Conference. 2010, 1–6
Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the 2006 Computer Society Conference on Computer Vision and Pattern Recognition. 2006, 2169–2178
Ullman S, Epshtein B. Visual classification by a hierarchy of extended fragments. In: Proceedings of Toward Category-Level Object Recognition. 2006, 321–344
Ohm J R, Ma P. Feature-based cluster segmentation of image sequences. In: Proceedings of the 1997 International Conference on Image Processing. 1997, 178–181
Wakabayashi T, Tsuruoka S, Kimura F, Miyake Y. On the size and variable transformation of feature vector for handwritten character recognition. IEICE Transactions Japan, 1993, J76-D-II(12): 2495–2503
Srikantan G, Lam SW, Srihari S N. Gradient-based contour encoding for character recognition. Pattern Recognition, 1996, 29(7): 1147–1160
Belongie S, Malik J, Puzicha J. Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(4): 509–522
Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998, 86(11): 2278–2324
Teow L N, Loe K F. Robust vision-based features and classification schemes for off-line handwritten digit recognition. Pattern Recognition, 2002, 35(11): 2355–2364
Mayraz G, Hinton G E. Recognizing handwritten digits using hierarchical products of experts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(2): 189–197
Liu C L, Nakashima K, Sako H, Fujisawa H. Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognition, 2003, 36(10): 2271–2285
Li Z C, Suen C Y. Crucial combinations of parts for handwritten alphanumeric characters. Mathematical and Computer Modelling, 2000, 31(8-9): 193–229
Li Z C, Li H J, Suen C Y, Wang H Q, Liao S Y. Recognition of handwritten characters by parts with multiple orientations. Mathematical and Computer Modelling, 2002, 35(3–4): 441–479
Suen C Y, Guo J, Li Z C. Analysis and recognition of alphanumeric handprints by parts. IEEE Transactions on Systems, Man and Cybernetics, 1994, 24(4): 614–631
Li Z C, Suen C Y. The partition-combination method for recognition of handwritten characters. Pattern Recognition Letters, 2000, 21(8): 701–720
Chellapilla K, Simard P. Using machine learning to break visual human interaction proofs (HIPs). Advances in Neural Information Processing Systems, 2004, 17: 265–272
Chellapilla K, Larson K, Simard P, Czerwinski M. Computers beat humans at single character recognition in reading based human interaction proofs. In: Proceedings of the 2005 Conference on Email and Anti-Spam. 2005
Mori G, Malik J. Recognizing objects in adversarial clutter: Breaking a visual CAPTCHA. In: Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2003, 134–141
Campos T E, Babu B R, Varma M. Character recognition in natural images. In: Proceedings of the 2009 International Conference on Computer Vision Theory and Applications. 2009
Coates A, Carpenter B, Case C, Satheesh S, Suresh B, Wang T, Wu D J, Ng A Y. Text detection and character recognition in scene images with unsupervised feature learning. In: Proceedings of the 2011 International Conference on Document Analysis and Recognition. 2011, 440–445
Diem M, Sablatnig R. Recognition of degraded handwritten characters using local features. In: Proceedings of the 2009 International Conference on Document Analysis and Recognition. 2009, 221–225
Diem M, Sablatnig R. Are characters objects? In: Proceedings of International Conference on Frontiers in Handwriting Recognition. 2010, 565–570
Garz A, Diem M, Sablatnig R. Detecting text areas and decorative elements in ancient manuscripts. In: Proceedings of the 2010 International Conference on Frontiers in Handwriting Recognition. 2010, 176–181
Sankar K P, Jawahar C V, Manmatha R. Nearest neighbor based collection ocr. In: Proceedings of the 2010 International Workshop on Document Analysis Systems. 2010, 207–214
Bay H, Tuytelaars T, Van Gool L. Surf: speeded up robust features. Computer Vision, 2006, 404–417
Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91–110
Boiman O, Shechtman E, Irani M. In defense of nearest-neighbor based image classification. In: Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition. 2008, 1–8
Author information
Authors and Affiliations
Corresponding author
Additional information
Song Wang received his BS in physics from Hebei University, and ME in computer science from Huazhong University of Science and Technology, China. Since 2010, he has been a PhD student in the Department of Intelligent Systems of the Graduate School of Information Science and Electrical Engineering, Kyushu University, Japan. His research interests include pattern recognition, off-line and on-line handwritten character recognition, image classification, scene character detection, and document analysis.
Seiichi Uchida received BE, ME, and PhD from Kyushu University, Japan, in 1990, 1992, and 1999, respectively. From 1992 to 1996, he joined SECOM Co., Ltd., Japan. Currently, he is a professor at Kyushu University. His research interests include pattern recognition and image processing. He received 2002 IEICE PRMU Research Encouraging Award, 2008 IEICE Best Paper Award, MIRU 2006 Nagao Award (best paper award), MIRU 2011 Excellent Paper Award, 2007 IAPR/ICDAR Best Paper Award, and 2010 ICFHR Best Paper Award. Dr. Uchida is a member of IEEE and IPSJ.
Marcus Liwicki received his MS in computer science from the Free University of Berlin, Germany, in 2004, and his PhD from the University of Bern, Switzerland, in 2007. Subsequently, he received the postdoctoral lecture qualification from the Technical University of Kaiserslautern, Germany, in 2011. Currently he is a senior researcher and private lecturer at the German Research Center for Artificial Intelligence (DFKI). His research interests include knowledge management, semantic desktop, electronic pen-input devices, on-line and off-line handwriting recognition, and document analysis. From October 2009 to March 2010 he visited Kyushu University (Fukuoka, Japan) as a research fellow, supported by the Japanese Society for the Promotion of Science.
Yaokai Feng received his BE andME in computer science from Tianjin University, China, in 1986 and 1992, respectively. He received his PhD in information science from Kyushu University, Japan, in 2004. Now, he is an assistant professor at Kyushu University, Japan. His current research interests include database, pattern recognition, information retrieval, and network security. In 2011, he received MIRU 2011 Excellent Paper Award. He is a member of IPSJ and IEEE.
Rights and permissions
About this article
Cite this article
Wang, S., Uchida, S., Liwicki, M. et al. Part-based methods for handwritten digit recognition. Front. Comput. Sci. 7, 514–525 (2013). https://doi.org/10.1007/s11704-013-2297-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11704-013-2297-x