Abstract
Chinese character recognition (CCR) is an important branch of pattern recognition. It was considered as an extremely difficult problem due to the very large number of categories, complicated structures, similarity between characters, and the variability of fonts or writing styles. Because of its unique technical challenges and great social needs, the last four decades witnessed the intensive research in this field and a rapid increase of successful applications. However, higher recognition performance is continuously needed to improve the existing applications and to exploit new applications. This paper first provides an overview of Chinese character recognition and the properties of Chinese characters. Some important methods and successful results in the history of Chinese character recognition are then summarized. As for classification methods, this article pays special attention to the syntactic-semantic approach for online Chinese character recognition, as well as the metasynthesis approach for discipline crossing. Finally, the remaining problems and the possible solutions are discussed.
Similar content being viewed by others
References
Stallings W. Approaches to Chinese character recognition. Pattern Recognition, 1976, 8(2): 87–98
Mori S, Yamamoto K, Yasuda M. Research on machine recognition of handprinted characters. IEEE Trans. Pattern Analysis and Machine Intelligence, 1984, 6(4): 386–405
Umeda M. Advances in recognition methods for handwritten Kanji characters. IEICE Trans. Information and Systems, 1996, E29(5): 401–410
Hildebrandt T H, Liu W. Optical recognition of handwritten Chinese characters: advances since 1980. Pattern Recognition, 1993, 26(2): 205–225
Liu C-L, Jaeger S, Nakagawa M. Online recognition of Chinese characters: the state-of-the-art. IEEE Trans. Pattern Analysis and Machine Intelligence, 2004, 26(2): 198–213
Nagy G. Pattern Recognition 1966 IEEE Workshop, IEEE Spectrum, Feb. 1967, 92–94
Iijima T, Genchi H, Mori K. A theory of character recognition by pattern matching method. ln: Proceedings of. 1st IJCPR, 1973, 50–56
Yasuda M, Fujisawa H. An improved correlation method for character recognition. Systems, Computers, and Controls, 1979, 10(2): 29–38 (Translated from Trans. IEICE Japan, 1979, 62-D(3): 217–224)
Yamashita Y, Higuchi K, Yamada Y, et al. Classification of hand-printed Kanji characters by the structured segment matching method. Pattern Recognition Letters, 1983, 1: 475–479
Casey R, Nagy G. Recognition of printed Chinese characters. IEEE Trans. Electronic Computers, 1966, EC-15(1): 91–101
Yamamoto S, Nakajima A, Nakata K. Chinese character recognition by hierarchical pattern matching. In: Proceedings of 1st IJCPR, 1973, 183–194
Xu L, Krzyzak A, Suen C Y. Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. System, Man, and Cybernetics, 1992, 27(3): 418–435
Kittler J, Hatef M, Duin R P W, et al. On combining classifiers. IEEE Trans. Pattern Analysis and Machine Intelligence, 1998, 20(3): 226–239
Liu j. Real Time Chinese Handwriting Recognition. E.E. Thesis, MIT, Cambridge, 1966
Zobrak M. A method for rapid recognition hand drawn line patterns. M.S. Thesis, University of Pittsburgh, 1966
Yamamoto k, Rosenfeld A. Recognition of handprinted Kanji characters by a relaxation method. In: Proceedings of 6th ICPR, Munich, 1982, 395–398
Fu K S. Syntactic Methods in Pattern Recognition. Academic Press, 1974
Fu K S. Syntactic Pattern Recognition and Applications. Prentice-Hall, 1982
Tai J W. A syntactic-semantic approach for Chinese character recognition. In: Proceedings of 7th ICPR, Montreal, Canada, 1984, 374–376
Kimura F, Takashina K, Tsuruoka S, et al. Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 1987, 9(1): 149–153
Kim I-J, Kim J H. Statistical character structure modeling and its application to handwritten Chinese character recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 2003, 25(11): 1422–1436
Tsukumo J, Tanaka H. Classification of handprinted Chinese characters using non-linear normalization and correlation methods. In: Proceedings of 9th ICPR, Rome, 1988, 168–171
Yamada H, Yamamoto K, Saito T. A nonlinear normalization method for hanprinted Kanji character recognition-line density equalization. Pattern Recognition, 1990, 23(9): 1023–1029
Gu Y X, Wang Q R, Suen C Y. Application of a multilayer decision tree in computer recognition of Chinese characters. IEEE Trans. Pattern Analysis and Machine Intelligence, 1983, 5(1): 83–89
Wang Q R, Suen C Y. Analysis and design of a decision tree based on entropy reduction and its application to large character set recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 1984, 6(4): 406–417
Tai J W, Liu Y J. Chinese character recognition. In: Bunke H, Sanfeliu A, eds. Syntactic and Structural Pattern Recognition-Theory and Application. World Scientific, 1989
Hao h, Xiao X, Dai R. Handwritten Chinese character recognition by metasynthesis approach. Pattern Recognition, 1997, 30(8), 1321–1328
Lin X, Ding X, Chen M, et al. Adaptive confidence transform based classifier combination for Chinese character recognition. Pattern Recognition Letters, 1998, 19(10): 975–988
Yamamoto K, Yamada H, Saito T, et al. Recognition of handprinted Chinese characters and Japanese cursive syllabury. In: Proceedings of 7th ICPR, Montreal, 1984, 385–388
Yamada H. Contour DP matching method and its application to handprinted Chinese character recognition. In: Proceedings of 7th ICPR, Montreal, 1984, 389–392
Yamamoto K, Yamada H, Saito T, et al. Recognition of handprinted characters in the first level of JIS Chinese characters. In: Proceedings of 8th ICPR, Paris, 1986, 570–572
Tsukumo J. Handprinted Kanji character recognition based on flexible template matching. In: Proceedings of 11th ICPR, The Hague, 1992, Vol.2, 483–486
Guo J, Sun N, Nemoto Y, et al. Recognition of handwritten characters using pattern transformation method with cosine function, Trans. IEICE Japan, 1993, J76-D-II(4): 835–842 (in Japanese)
Saruta K, Kato N, Abe M, et al. High accuracy recognition of ETL9B using exclusive learning neural network-II (ELNET-II). IEICE Trans. Information and Systems, 1996, 79-D(5): 516–521
Suzuki M, Omachi S, Kato N, et al. A discrimination method of similar characters using compound Mahalanobis function. Trans. IEICE Japan, 1997, J80-D-II(10): 2752–2760 (in Japanese)
Kimura F, Wakabayashi T, Tsuruoka S, et al. Improvement of handwritten Japanese character recognition using weighted direction code histogram. Pattern Recognition, 30(8): 1329–1337, 1997
Kato N, Suzuki M, Omachi S, et al. A handwritten character recognition system using directional element feature and asymmetric Mahalanobis distance. IEEE Trans. Pattern Analysis and Machine Intelligence, 1999, 21(3): 258–262
Sawa K, Wakabayashi T, Tsuruoka S, et al. Accuracy improvement by gradient feature and variable absorbing covariance matrix in handwritten Chinese character recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 2001, J84-D-II(11): 2379–2397 (in Japanese)
Dong J X, Krzyzak A, Suen C Y. High accuracy handwritten Chinese character recognition using support vector machine. In: Proceedings of Int. Workshop on Artificial Neural Networks for Pattern Recognition, Florence, Italy, 2003
Liu H, Ding X. Handwritten character recognition using gradient feature and quadratic classifier with multiple discrimination schemes. In: Proceedings of 8th ICDAR, Seoul, Korea, 2005, 19–23
Liu C-L. High accuracy handwritten Chinese character recognition using quadratic classifiers with discriminative feature extraction. In: Proceedings of 18th ICPR, Hong Kong, 2006, Vol.2, 942–945
Horiuchi T, Haruki R, Yamada H, et al. Two-dimensional extension of nonlinear normalization method using line density for character recognition. In: Proceedings of 4th ICDAR, Ulm, Germany, 1997, 511–514
Liu C-L, Marukawa K. Pseudo Two-dimensional shape normalization methods for handwritten Chinese character recognition. Pattern Recognition, 2005, 38(12): 2242–2255
Kawamura A, Yura K, Hayama T, et al. On-line recognition of freely handwritten Japanese characters using directional feature densities. In: Proceedings of 11th ICPR, The Hague, 1992, Vol.2, 183–186
Srikantan G, Lam S W, Srihari S N. Gradient-based contour encoder for character recognition. Pattern Recognition, 1996, 29(7): 1147–1160
Liu C-L, Nakashima K, Sako H, et al. Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognition, 2003, 36(10): 2271–2285
Fukunaga K. Introduction to Statistical Pattern Recognition. 2nd ed. Academic Press, 1990
Loog M, Duin R P W. Linear dimensionality reduction via a heteroscedastic extension of LDA: the Chernoff criterion. IEEE Trans. Pattern Analysis and Machine Intelligence, 2004, 26(6): 732–739
Liu C-L, Nakagawa M. Evaluation of prototype learning algorithms for nearest neighbor classifier in application to handwritten character recognition. Pattern Recognition, 2001, 34(3): 601–615
Tang Y Y, et al. Offline recognition of Chinese handwriting by multi-feature and multilevel classification. IEEE Trans. Pattern Analysis and Machine Intelligence, 1998, 20(5): 556–561
Dai R W, Hao H W, Xiao X H. Systems and Integration of Chinese Character Recognition. Zhejiang Science and Technology Press, 1998 (in Chinese)
Dai R W, Wang L X. Pattern recognition systems integration by metasynthesis. Systems Science and Systems Engineering. Scientific and Technical Documents House, Beijing, 1997, 7–13
Xiao B H, Wang C H, Dai R W. Adaptive combination of classifiers and its application to handwritten Chinese character recognition. In: Proceedings of 15th ICPR, Barcelona, 2000, 327–330
Wang C H, Xiao B H, Dai R W. Parallel compact integration in handwritten Chinese character recognition. Science in China Series F-Information Sciences, 2004, 47(1): 89–96
Xiao B H, Wang C H, Dai R W. Handwritten Chinese character recognition by metasynthetic approach. Int. J. Information Technology and Decision Making, World Scientific Press, 2003, 1(4): 621–634
Fu K S. Sequential Methods in Pattern Recognition and Machine Learning. Academic Press, New York, 1968
Fu K S. Pattern Recognition and Machine Learning. Plenum Press, 1971
Fu K S. Grammatical inference: introduction and survey, Part I and Part II. IEEE Pattern Analysis and Machine Intelligence, 1986, 8(3): 343–375
You K C, Fu K S. A syntactic approach to stage recognition using attributed grammars. IEEE Trans. System, Man, and Cybernetics, 1979, 9(6): 334–345
Tsai W H, Fu K S. Attributed grammar-a tool for combining syntactic and statistical approaches to pattern recognition. IEEE Trans. System, Man, and Cybernetics, 1980, 10(12): 873–885
Tsai W-H, Fu K S. A syntactic-statistical approach to recognition of industrial objects. In: Proceedings of 5th ICPR, Miami, 1980, 251–259
Tsai W-H, Fu K S. Error-correcting isomorphism of attributed relational graphs for pattern analysis. IEEE Trans. System, Man, and Cybernetics, 1979, 9(12): 757–768
Tsai W-H, Fu K S. A pattern deformation model and Bayes error-correcting recognition system. IEEE Trans. System, Man, and Cybernetics, 1979, 9(12): 745–756
Tai J W, Fu K S. Semantic syntax-directed translation for pictorial pattern recognition. Technical Report, School of EE, Purdue University, Oct 1981, TR-EE 81-83
Knuth D E. Semantics of context-free language. Journal of Mathematical System Theory, 1968, 2(2): 127–145
Tai J W. A kind of relational attributed grammars. Acta Automatica Sinica, 1983, 9(2): 90–98 (in Chinese)
Qian X S, Yu J Y, Dai R W. A new discipline of science-the study of open complex giant system and its methodology. Chinese J. System Engineering and Electronics, 1993, 4(2): 2–12
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dai, R., Liu, C. & Xiao, B. Chinese character recognition: history, status and prospects. Front. Comput. Sc. China 1, 126–136 (2007). https://doi.org/10.1007/s11704-007-0012-5
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s11704-007-0012-5