Skip to main content
Log in

Chinese character recognition: history, status and prospects

  • Review Article
  • Published:
Frontiers of Computer Science in China Aims and scope Submit manuscript

Abstract

Chinese character recognition (CCR) is an important branch of pattern recognition. It was considered as an extremely difficult problem due to the very large number of categories, complicated structures, similarity between characters, and the variability of fonts or writing styles. Because of its unique technical challenges and great social needs, the last four decades witnessed the intensive research in this field and a rapid increase of successful applications. However, higher recognition performance is continuously needed to improve the existing applications and to exploit new applications. This paper first provides an overview of Chinese character recognition and the properties of Chinese characters. Some important methods and successful results in the history of Chinese character recognition are then summarized. As for classification methods, this article pays special attention to the syntactic-semantic approach for online Chinese character recognition, as well as the metasynthesis approach for discipline crossing. Finally, the remaining problems and the possible solutions are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Stallings W. Approaches to Chinese character recognition. Pattern Recognition, 1976, 8(2): 87–98

    Article  MATH  Google Scholar 

  2. Mori S, Yamamoto K, Yasuda M. Research on machine recognition of handprinted characters. IEEE Trans. Pattern Analysis and Machine Intelligence, 1984, 6(4): 386–405

    Google Scholar 

  3. Umeda M. Advances in recognition methods for handwritten Kanji characters. IEICE Trans. Information and Systems, 1996, E29(5): 401–410

    Google Scholar 

  4. Hildebrandt T H, Liu W. Optical recognition of handwritten Chinese characters: advances since 1980. Pattern Recognition, 1993, 26(2): 205–225

    Article  Google Scholar 

  5. Liu C-L, Jaeger S, Nakagawa M. Online recognition of Chinese characters: the state-of-the-art. IEEE Trans. Pattern Analysis and Machine Intelligence, 2004, 26(2): 198–213

    Article  Google Scholar 

  6. Nagy G. Pattern Recognition 1966 IEEE Workshop, IEEE Spectrum, Feb. 1967, 92–94

  7. Iijima T, Genchi H, Mori K. A theory of character recognition by pattern matching method. ln: Proceedings of. 1st IJCPR, 1973, 50–56

  8. Yasuda M, Fujisawa H. An improved correlation method for character recognition. Systems, Computers, and Controls, 1979, 10(2): 29–38 (Translated from Trans. IEICE Japan, 1979, 62-D(3): 217–224)

    Google Scholar 

  9. Yamashita Y, Higuchi K, Yamada Y, et al. Classification of hand-printed Kanji characters by the structured segment matching method. Pattern Recognition Letters, 1983, 1: 475–479

    Article  Google Scholar 

  10. Casey R, Nagy G. Recognition of printed Chinese characters. IEEE Trans. Electronic Computers, 1966, EC-15(1): 91–101

    Article  Google Scholar 

  11. Yamamoto S, Nakajima A, Nakata K. Chinese character recognition by hierarchical pattern matching. In: Proceedings of 1st IJCPR, 1973, 183–194

  12. Xu L, Krzyzak A, Suen C Y. Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. System, Man, and Cybernetics, 1992, 27(3): 418–435

    Article  Google Scholar 

  13. Kittler J, Hatef M, Duin R P W, et al. On combining classifiers. IEEE Trans. Pattern Analysis and Machine Intelligence, 1998, 20(3): 226–239

    Article  Google Scholar 

  14. Liu j. Real Time Chinese Handwriting Recognition. E.E. Thesis, MIT, Cambridge, 1966

    Google Scholar 

  15. Zobrak M. A method for rapid recognition hand drawn line patterns. M.S. Thesis, University of Pittsburgh, 1966

  16. Yamamoto k, Rosenfeld A. Recognition of handprinted Kanji characters by a relaxation method. In: Proceedings of 6th ICPR, Munich, 1982, 395–398

  17. Fu K S. Syntactic Methods in Pattern Recognition. Academic Press, 1974

  18. Fu K S. Syntactic Pattern Recognition and Applications. Prentice-Hall, 1982

  19. Tai J W. A syntactic-semantic approach for Chinese character recognition. In: Proceedings of 7th ICPR, Montreal, Canada, 1984, 374–376

  20. Kimura F, Takashina K, Tsuruoka S, et al. Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 1987, 9(1): 149–153

    Google Scholar 

  21. Kim I-J, Kim J H. Statistical character structure modeling and its application to handwritten Chinese character recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 2003, 25(11): 1422–1436

    Article  Google Scholar 

  22. Tsukumo J, Tanaka H. Classification of handprinted Chinese characters using non-linear normalization and correlation methods. In: Proceedings of 9th ICPR, Rome, 1988, 168–171

  23. Yamada H, Yamamoto K, Saito T. A nonlinear normalization method for hanprinted Kanji character recognition-line density equalization. Pattern Recognition, 1990, 23(9): 1023–1029

    Article  Google Scholar 

  24. Gu Y X, Wang Q R, Suen C Y. Application of a multilayer decision tree in computer recognition of Chinese characters. IEEE Trans. Pattern Analysis and Machine Intelligence, 1983, 5(1): 83–89

    Google Scholar 

  25. Wang Q R, Suen C Y. Analysis and design of a decision tree based on entropy reduction and its application to large character set recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 1984, 6(4): 406–417

    Article  MATH  Google Scholar 

  26. Tai J W, Liu Y J. Chinese character recognition. In: Bunke H, Sanfeliu A, eds. Syntactic and Structural Pattern Recognition-Theory and Application. World Scientific, 1989

  27. Hao h, Xiao X, Dai R. Handwritten Chinese character recognition by metasynthesis approach. Pattern Recognition, 1997, 30(8), 1321–1328

    Article  Google Scholar 

  28. Lin X, Ding X, Chen M, et al. Adaptive confidence transform based classifier combination for Chinese character recognition. Pattern Recognition Letters, 1998, 19(10): 975–988

    Article  Google Scholar 

  29. Yamamoto K, Yamada H, Saito T, et al. Recognition of handprinted Chinese characters and Japanese cursive syllabury. In: Proceedings of 7th ICPR, Montreal, 1984, 385–388

  30. Yamada H. Contour DP matching method and its application to handprinted Chinese character recognition. In: Proceedings of 7th ICPR, Montreal, 1984, 389–392

  31. Yamamoto K, Yamada H, Saito T, et al. Recognition of handprinted characters in the first level of JIS Chinese characters. In: Proceedings of 8th ICPR, Paris, 1986, 570–572

  32. Tsukumo J. Handprinted Kanji character recognition based on flexible template matching. In: Proceedings of 11th ICPR, The Hague, 1992, Vol.2, 483–486

    Google Scholar 

  33. Guo J, Sun N, Nemoto Y, et al. Recognition of handwritten characters using pattern transformation method with cosine function, Trans. IEICE Japan, 1993, J76-D-II(4): 835–842 (in Japanese)

    Google Scholar 

  34. Saruta K, Kato N, Abe M, et al. High accuracy recognition of ETL9B using exclusive learning neural network-II (ELNET-II). IEICE Trans. Information and Systems, 1996, 79-D(5): 516–521

    Google Scholar 

  35. Suzuki M, Omachi S, Kato N, et al. A discrimination method of similar characters using compound Mahalanobis function. Trans. IEICE Japan, 1997, J80-D-II(10): 2752–2760 (in Japanese)

    Google Scholar 

  36. Kimura F, Wakabayashi T, Tsuruoka S, et al. Improvement of handwritten Japanese character recognition using weighted direction code histogram. Pattern Recognition, 30(8): 1329–1337, 1997

    Article  Google Scholar 

  37. Kato N, Suzuki M, Omachi S, et al. A handwritten character recognition system using directional element feature and asymmetric Mahalanobis distance. IEEE Trans. Pattern Analysis and Machine Intelligence, 1999, 21(3): 258–262

    Article  Google Scholar 

  38. Sawa K, Wakabayashi T, Tsuruoka S, et al. Accuracy improvement by gradient feature and variable absorbing covariance matrix in handwritten Chinese character recognition. IEEE Trans. Pattern Analysis and Machine Intelligence, 2001, J84-D-II(11): 2379–2397 (in Japanese)

    Google Scholar 

  39. Dong J X, Krzyzak A, Suen C Y. High accuracy handwritten Chinese character recognition using support vector machine. In: Proceedings of Int. Workshop on Artificial Neural Networks for Pattern Recognition, Florence, Italy, 2003

  40. Liu H, Ding X. Handwritten character recognition using gradient feature and quadratic classifier with multiple discrimination schemes. In: Proceedings of 8th ICDAR, Seoul, Korea, 2005, 19–23

  41. Liu C-L. High accuracy handwritten Chinese character recognition using quadratic classifiers with discriminative feature extraction. In: Proceedings of 18th ICPR, Hong Kong, 2006, Vol.2, 942–945

    Google Scholar 

  42. Horiuchi T, Haruki R, Yamada H, et al. Two-dimensional extension of nonlinear normalization method using line density for character recognition. In: Proceedings of 4th ICDAR, Ulm, Germany, 1997, 511–514

  43. Liu C-L, Marukawa K. Pseudo Two-dimensional shape normalization methods for handwritten Chinese character recognition. Pattern Recognition, 2005, 38(12): 2242–2255

    Article  Google Scholar 

  44. Kawamura A, Yura K, Hayama T, et al. On-line recognition of freely handwritten Japanese characters using directional feature densities. In: Proceedings of 11th ICPR, The Hague, 1992, Vol.2, 183–186

    Google Scholar 

  45. Srikantan G, Lam S W, Srihari S N. Gradient-based contour encoder for character recognition. Pattern Recognition, 1996, 29(7): 1147–1160

    Article  Google Scholar 

  46. Liu C-L, Nakashima K, Sako H, et al. Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognition, 2003, 36(10): 2271–2285

    Article  MATH  Google Scholar 

  47. Fukunaga K. Introduction to Statistical Pattern Recognition. 2nd ed. Academic Press, 1990

  48. Loog M, Duin R P W. Linear dimensionality reduction via a heteroscedastic extension of LDA: the Chernoff criterion. IEEE Trans. Pattern Analysis and Machine Intelligence, 2004, 26(6): 732–739

    Article  Google Scholar 

  49. Liu C-L, Nakagawa M. Evaluation of prototype learning algorithms for nearest neighbor classifier in application to handwritten character recognition. Pattern Recognition, 2001, 34(3): 601–615

    Article  MATH  Google Scholar 

  50. Tang Y Y, et al. Offline recognition of Chinese handwriting by multi-feature and multilevel classification. IEEE Trans. Pattern Analysis and Machine Intelligence, 1998, 20(5): 556–561

    Article  Google Scholar 

  51. Dai R W, Hao H W, Xiao X H. Systems and Integration of Chinese Character Recognition. Zhejiang Science and Technology Press, 1998 (in Chinese)

  52. Dai R W, Wang L X. Pattern recognition systems integration by metasynthesis. Systems Science and Systems Engineering. Scientific and Technical Documents House, Beijing, 1997, 7–13

    Google Scholar 

  53. Xiao B H, Wang C H, Dai R W. Adaptive combination of classifiers and its application to handwritten Chinese character recognition. In: Proceedings of 15th ICPR, Barcelona, 2000, 327–330

  54. Wang C H, Xiao B H, Dai R W. Parallel compact integration in handwritten Chinese character recognition. Science in China Series F-Information Sciences, 2004, 47(1): 89–96

    Article  MathSciNet  Google Scholar 

  55. Xiao B H, Wang C H, Dai R W. Handwritten Chinese character recognition by metasynthetic approach. Int. J. Information Technology and Decision Making, World Scientific Press, 2003, 1(4): 621–634

    Article  Google Scholar 

  56. Fu K S. Sequential Methods in Pattern Recognition and Machine Learning. Academic Press, New York, 1968

    MATH  Google Scholar 

  57. Fu K S. Pattern Recognition and Machine Learning. Plenum Press, 1971

  58. Fu K S. Grammatical inference: introduction and survey, Part I and Part II. IEEE Pattern Analysis and Machine Intelligence, 1986, 8(3): 343–375

    MATH  Google Scholar 

  59. You K C, Fu K S. A syntactic approach to stage recognition using attributed grammars. IEEE Trans. System, Man, and Cybernetics, 1979, 9(6): 334–345

    Google Scholar 

  60. Tsai W H, Fu K S. Attributed grammar-a tool for combining syntactic and statistical approaches to pattern recognition. IEEE Trans. System, Man, and Cybernetics, 1980, 10(12): 873–885

    MATH  Google Scholar 

  61. Tsai W-H, Fu K S. A syntactic-statistical approach to recognition of industrial objects. In: Proceedings of 5th ICPR, Miami, 1980, 251–259

  62. Tsai W-H, Fu K S. Error-correcting isomorphism of attributed relational graphs for pattern analysis. IEEE Trans. System, Man, and Cybernetics, 1979, 9(12): 757–768

    MATH  Google Scholar 

  63. Tsai W-H, Fu K S. A pattern deformation model and Bayes error-correcting recognition system. IEEE Trans. System, Man, and Cybernetics, 1979, 9(12): 745–756

    Article  MATH  Google Scholar 

  64. Tai J W, Fu K S. Semantic syntax-directed translation for pictorial pattern recognition. Technical Report, School of EE, Purdue University, Oct 1981, TR-EE 81-83

  65. Knuth D E. Semantics of context-free language. Journal of Mathematical System Theory, 1968, 2(2): 127–145

    Article  MATH  MathSciNet  Google Scholar 

  66. Tai J W. A kind of relational attributed grammars. Acta Automatica Sinica, 1983, 9(2): 90–98 (in Chinese)

    Google Scholar 

  67. Qian X S, Yu J Y, Dai R W. A new discipline of science-the study of open complex giant system and its methodology. Chinese J. System Engineering and Electronics, 1993, 4(2): 2–12

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiao Baihua.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dai, R., Liu, C. & Xiao, B. Chinese character recognition: history, status and prospects. Front. Comput. Sc. China 1, 126–136 (2007). https://doi.org/10.1007/s11704-007-0012-5

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11704-007-0012-5

Keywords

Navigation