Skip to main content
Log in

Analysis of calligraphy Chinese character recognition technology based on deep learning and computer-aided technology

  • Application of soft computing
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

To preserve Chinese cultural heritage, the originality and complexity of calligraphy characters are proof of the country's unique literary heritage. However, it has long been challenging to comprehend and appropriately classify these complex characters. The absence of a quantitative standard for calligraphy Chinese character recognition has limited accurate assessments and recognition, allowing room for improvement. Therefore, this article seeks to improve the digital evolution of Chinese calligraphy and painting works by leveraging the quick development of computer-aided technology and deep learning algorithms. We collected Chinese calligraphy samples and refined them through digitization, preprocessing, noise reduction, and resizing. We used the HOG approach to identify the unique features of each character and the Euler distance to measure spatial relationships between target and background points, capturing their distinct strokes and patterns. Then, we employed the Google LeNet Inception-v3 model to take advantage of the Convolutional Neural Network’s (CNN) capability. Our system can reliably recognize and categorize different calligraphy styles thanks to our CNN-based methodology, going beyond the constraints of conventional recognition techniques. Finally, we carefully evaluated the precision, recall, and accuracy, recognition capacity of our proposed recognition system to assess its effectiveness in correctly identifying calligraphy Chinese characters. The outcomes of our thorough analysis show a recognition rate of 93.12%, illuminating the tremendous potential of our strategy. Our method regularly beats competing algorithms, even in the presence of Gaussian white noise, obtaining accuracy rates of 91.3%, 90.9%, and 89.4% for noise levels of 0.02, 0.04, and 0.06, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm 1
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The materials and data used in this study can be accessed by contacting the corresponding author.

References

  • Al-Barhamtoshy HM, Jambi KM, Abdou SM, Rashwan MA (2021) Arabic documents information retrieval for printed, handwritten, and calligraphy image. IEEE Access 9:51242–51257

    Article  Google Scholar 

  • Ali M, Yin B, Kumar A, Sheikh AM et al (2020) Reduction of multiplications in convolutional neural networks. In: 2020 39th Chinese control conference (CCC). IEEE, pp 7406–7411. https://doi.org/10.23919/CCC50068.2020.9188843

  • Ali M, Yin B, Bilal H et al (2023) Advanced efficient strategy for detection of dark objects based on spiking network with multi-box detection. Multimed Tools Appl. https://doi.org/10.1007/s11042-023-16852-2

    Article  Google Scholar 

  • Aslam MS, Chen Z (2019) Observer-based dissipative output feedback control for network T-S fuzzy systems under time delays with mismatch premise. Nonlinear Dyn 95:2923–2941

    Article  Google Scholar 

  • Aslam MS, Dai X, Hou J, Li Q, Ullah R, Ni Z, Liu Y (2020a) Reliable control design for composite-driven scheme based on delay networked T-S fuzzy system. Int J Robust Nonlinear Control 30(4):1622–1642

    Article  MathSciNet  Google Scholar 

  • Aslam MS, Ullah R, Dai X, Sheng A (2020b) Event-triggered scheme for fault detection and isolation of non-linear system with time-varying delay. IET Control Theory Appl 14(16):2429–2438

    Article  MathSciNet  Google Scholar 

  • Aslam MS, Li Q, Hou J (2021) Fault detection for asynchronous T-S fuzzy networked Markov jump systems with new event-triggered scheme. IET Control Theory Appl 15(11):1461–1473

    Article  MathSciNet  Google Scholar 

  • Bai X, Wang X, Liu X, Liu Q, Song J, Sebe N, Kim B (2021) Explainable deep learning for efficient and robust pattern recognition: a survey of recent developments. Pattern Recogn 120:108102

    Article  Google Scholar 

  • Bi N, Suen CY, Nobile N, Tan J (2019) A multi-feature selection approach for gender identification of handwriting based on kernel mutual information. Pattern Recogn Lett 121:123–132

    Article  Google Scholar 

  • Boufenar C, Kerboua A, Batouche M (2018) Investigation on deep learning for off-line handwritten Arabic character recognition. Cogn Syst Res 50:180–195

    Article  Google Scholar 

  • Chen G, Chen P, Huang W, Zhai J (2022) Continuance intention mechanism of middle school student users on online learning platform based on qualitative comparative analysis method. Math Probl Eng 2022:1–12

    Google Scholar 

  • Dou H, Liu Y, Chen S et al (2023) A hybrid CEEMD-GMM scheme for enhancing the detection of traffic flow on highways. Soft Comput 27:16373–16388. https://doi.org/10.1007/s00500-023-09164-y

    Article  Google Scholar 

  • Fan J, Ma C, Zhong Y (2021) A selective overview of deep learning. Stat Sci Rev J Inst Math Stat 36(2):264

    MathSciNet  Google Scholar 

  • Huang J, Cheng G, Zhang J, Miao W (2023) Recognition method for stone carved calligraphy characters based on a convolutional neural network. Neural Comput Appl 35(12):8723–8732

    Google Scholar 

  • Kaoudja Z, Kherfi ML, Khaldi B (2021) A new computational method for Arabic calligraphy style representation and classification. Appl Sci 11:4852

    Article  Google Scholar 

  • Li X, Wang Y, Zhang B, Ma J (2020) PSDRNN: An efficient and effective HAR scheme based on feature extraction and deep learning. IEEE Trans Ind Inform 16(10):6703–6713

    Article  Google Scholar 

  • Liu C-L, Jaeger S, Nakagawa M (2004) Online recognition of Chinese characters: the state-of-the-art. IEEE Trans Pattern Anal Mach Intell 26(2):198–213

    Article  Google Scholar 

  • Liu Q, Sun Y, Wang C, Liu T, Tao D (2016) Elastic net hypergraph learning for image clustering and semi-supervised classification. IEEE Trans Image Process 26:452–463

    Article  MathSciNet  Google Scholar 

  • Liu A-A, Zhai Y, Xu N, Nie W, Li W, Zhang Y (2021a) Region-aware image captioning via interaction learning. IEEE Trans Circuits Syst Video Technol 32:3685–3696

    Article  Google Scholar 

  • Liu R, Wang X, Lu H, Wu Z, Fan Q, Li S, Jin X (2021b) SCCGAN: style and characters inpainting based on CGAN. Mob Netw Appl 26:3–12

    Article  Google Scholar 

  • Liu Y, Pu H, Sun D-W (2021c) Efficient extraction of deep image features using convolutional neural network (CNN) for applications in detecting and analyzing complex food matrices. Trends Food Sci Technol 113:193–204

    Article  Google Scholar 

  • Mai GT, Liang Y, Pan JH, Huang JL, Chen XL, She YC (2022) Calligraphic font recognition algorithm based on improved DenseNet network. Comput Syst Appl 31(2):253–259

    Google Scholar 

  • Mohammed MJ, Mohammed EA, Jarjees MS (2020) Recognition of multifont English electronic prescribing based on convolution neural network algorithm. Bio-Algorithms Med Syst 16(3):182–190

    Google Scholar 

  • Moodley T, van der Haar D, Noorbhai H (2022) Automated recognition of the cricket batting backlift technique in video footage using deep learning architectures. Sci Rep 12(1):1895

    Article  Google Scholar 

  • Muhammad IQ, Majid A, Shamrooz S (2023) Adaptive event-triggered robust H∞ control for Takagi-Sugeno fuzzy networked Markov jump systems with time-varying delay. Asian J Control 25(1):213–228

    Article  MathSciNet  Google Scholar 

  • Naiemi F, Ghods V, Khalesi H (2021) A novel pipeline framework for multi oriented scene text image detection and recognition. Expert Syst Appl 170:114549

    Article  Google Scholar 

  • Qi M, Cui S, Chang X, Xu Y, Meng H, Wang Y, Yin T (2022) Multi-region nonuniform brightness correction algorithm based on L-channel gamma transform. Secur Commun Netw 2022:1–9

    Google Scholar 

  • Sun M, Gong X, Nie H, Iqbal MM, Xie B (2022) Srafe: Siamese regression aesthetic fusion evaluation for Chinese calligraphic copy. CAAI Trans Intell Technol 8:1077–1086

    Article  Google Scholar 

  • Wang L, Zhai Q, Yin B et al (2019) Second-order convolutional network for crowd counting. In: Proceedings of SPIE 11198, fourth international workshop on pattern recognition, 111980T. https://doi.org/10.1117/12.2540362

  • Wu Q, Li X, Wang K et al (2023) Regional feature fusion for on-road detection of objects using camera and 3D-LiDAR in high-speed autonomous vehicles. Soft Comput 27:18195–18213. https://doi.org/10.1007/s00500-023-09278-3

    Article  Google Scholar 

  • Xiao Y, Lei W, Lu L, Chang X, Zheng X, Chen X (2021) CS-GAN: cross-structure generative adversarial networks for Chinese calligraphy translation. Knowl Based Syst 229:107334

    Article  Google Scholar 

  • Xiao Z, Shu J, Jiang H, Lui JC, Min G, Liu J, Dustdar S (2022) Multi-objective parallel task offloading and content caching in D2D-aided MEC networks. IEEE Trans Mob Comput 22:6599–6615

    Google Scholar 

  • Xie SX, Zeng L (2017) On the artistic expression of calligraphy in Chinese character logo design. Ind Sci Trib 16(14):200–201

    Google Scholar 

  • Xu JY, Lin CY, Chen ZT, Deng ZR, Pan JH, Liang H (2021) Handwritten calligraphy font recognition algorithm based on deep learning. Comput Syst Appl 30(2):213–218

    Google Scholar 

  • Xu H, Sun Z, Cao Y et al (2023) A data-driven approach for intrusion and anomaly detection using automated machine learning for the Internet of Things. Soft Comput. https://doi.org/10.1007/s00500-023-09037-4

    Article  Google Scholar 

  • Yang S, Li Q, Li W, Li X, Liu A-A (2022) Dual-level representation enhancement on characteristic and context for image-text retrieval. IEEE Trans Circuits Syst Video Technol 32:8037–8050

    Article  Google Scholar 

  • Yao W, Guo Y, Wu Y, Guo J (2017) Experimental validation of fuzzy PID control of flexible joint system in presence of uncertainties. In: 2017 36th Chinese control conference (CCC). IEEE, pp 4192–4197. https://doi.org/10.23919/ChiCC.2017.8028015.

  • Yin B, Khan J, Wang L, Zhang J, Kumar A (2019) Real-time lane detection and tracking for advanced driver assistance systems. In: 2019 Chinese control conference (CCC). IEEE, pp 6772–6777. https://doi.org/10.23919/ChiCC.2019.8866334

  • Zhang J, Guo M, Fan J (2020a) A novel generative adversarial net for calligraphic tablet images denoising. Multimed Tools Appl 79:119–140

    Article  Google Scholar 

  • Zhang Y-W, Wang J, Long W, Liu H, Zhang C, Chen Y (2020b) A fast solution for Chinese calligraphy relief modeling from 2D handwriting image. Vis Comput 36:2241–2250

    Article  Google Scholar 

  • Zheng W, Yin L (2022) Characterization inference based on joint-optimization of multi-layer semantics and deep fusion matching network. PeerJ Comput Sci 8:e908

    Article  Google Scholar 

  • Zheng W, Zhou Y, Liu S, Tian J, Yang B, Yin L (2022a) A deep fusion matching network semantic reasoning model. Appl Sci 12:3416

    Article  Google Scholar 

  • Zheng W, Tian X, Yang B, Liu S, Ding Y, Tian J, Yin L (2022b) A few shot classification methods based on multiscale relational networks. Appl Sci 12:4059

    Article  Google Scholar 

Download references

Funding

No funding was provided for the completion of this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huihui Si.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Si, H. Analysis of calligraphy Chinese character recognition technology based on deep learning and computer-aided technology. Soft Comput 28, 721–736 (2024). https://doi.org/10.1007/s00500-023-09423-y

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-023-09423-y

Keywords