skip to main content
10.1145/3638884.3638900acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccipConference Proceedingsconference-collections
research-article

Towards Robust Handwritten Tibetan Numeral Recognition with Spiking Neural Networks

Published:23 April 2024Publication History

ABSTRACT

The Tibetan region contains abundant cultural resources. Handwritten Tibetan numeral recognition is an important basic task for Tibetan language digitalization process. As a low-resource language, Tibetan poses more stringent robustness requirements for deep learning models, requiring them to resist interference and learn core features. This paper proposes a Spiking Fully Convolutional Neural network model SFCN based on spiking neural networks (SNNs) to accomplish handwritten Tibetan numeral recognition on the TibetanMNIST dataset. The fully convolutional neural network structure based on CNNs can effectively extract image features. Combined with SNNs as SFCN, it further enhances network robustness. Experiments show that our SFCN model maintains 98.90% accuracy while exhibiting extraordinary robustness under various interferences, compared to an artificial neural network (ANN) of the same structure. On the basis, combining interferences with training samples further improves model robustness. SFCN trained with adversarial attack method projected gradient descent (PGD) achieves the optimal robustness. SFCN-PGD model’s accuracy on the test set improved by 80.39% at the highest compared to traditional ANNs1.

References

  1. Jan Benda and Joel Tabak. 2014. Spike-Frequency Adaptation.Encyclopedia of computational neuroscience 15, 11 (2014), 1–12.Google ScholarGoogle Scholar
  2. Jianhao Ding, Tong Bu, Zhaofei Yu, Tiejun Huang, and Jian Liu. 2022. Snn-rat: Robustness-enhanced spiking neural network through regularized adversarial training. Advances in Neural Information Processing Systems 35 (2022), 24780–24793.Google ScholarGoogle Scholar
  3. Wulfram Gerstner, Werner M Kistler, Richard Naud, and Liam Paninski. 2014. Neuronal dynamics: From single neurons to networks and models of cognition. Cambridge University Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Ian J Goodfellow, Jonathon Shlens, and Christian Szegedy. 2014. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 (2014).Google ScholarGoogle Scholar
  5. Xuefeng Jiang, Yikun Wang, Wenbo Liu, Shuying Li, and Junrui Liu. 2019. Capsnet, cnn, fcn: Comparative performance evaluation for image classification. Int. J. Mach. Learn. Comput 9, 6 (2019), 840–848.Google ScholarGoogle ScholarCross RefCross Ref
  6. Tenzin Kaldan and Adiyillam Vijayalakshmi. 2021. TenzinNet for handwritten Tibetan numeral recognition. International Journal of Information Technology 13, 4 (2021), 1679–1682.Google ScholarGoogle ScholarCross RefCross Ref
  7. Mikhail Leontev, Dmitry Antonov, and Sergey Sukhov. 2021. Robustness of spiking neural networks against adversarial attacks. In 2021 International Conference on Information Technology and Nanotechnology (ITNT). IEEE, 1–6.Google ScholarGoogle ScholarCross RefCross Ref
  8. Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3431–3440.Google ScholarGoogle ScholarCross RefCross Ref
  9. Wolfgang Maass. 1997. Networks of spiking neurons: the third generation of neural network models. Neural networks 10, 9 (1997), 1659–1671.Google ScholarGoogle Scholar
  10. Aleksander Mądry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. 2017. Towards deep learning models resistant to adversarial attacks. stat 1050 (2017), 9.Google ScholarGoogle Scholar
  11. Nicolas Perez-Nieves, Vincent CH Leung, Pier Luigi Dragotti, and Dan FM Goodman. 2021. Neural heterogeneity promotes robust learning. Nature communications 12, 1 (2021), 5791.Google ScholarGoogle Scholar
  12. M Shahidur Rahman 2016. Towards optimal convolutional neural network parameters for bengali handwritten numerals recognition. In 2016 19th International Conference on Computer and Information Technology (ICCIT). IEEE, 431–436.Google ScholarGoogle Scholar
  13. Qianjun Shuai, Xingwen Wu, and Libiao Jin. 2019. Handwritten Tibetan Digital Words Recognition Based on improved Convolutional Neural Network. In 2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP). IEEE, 1–4.Google ScholarGoogle ScholarCross RefCross Ref
  14. Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus. 2013. Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199 (2013).Google ScholarGoogle Scholar
  15. Luke Taylor, Andrew J King, and Nicol Spencer Harper. 2022. Robust and accelerated single-spike spiking neural network training with applicability to challenging temporal tasks. (2022).Google ScholarGoogle Scholar
  16. Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. 2015. Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853 (2015).Google ScholarGoogle Scholar
  17. Friedemann Zenke, Sander M Bohté, Claudia Clopath, Iulia M Comşa, Julian Göltz, Wolfgang Maass, Timothée Masquelier, Richard Naud, Emre O Neftci, Mihai A Petrovici, 2021. Visualizing a joint future of neuroscience and neuromorphic engineering. Neuron 109, 4 (2021), 571–575.Google ScholarGoogle ScholarCross RefCross Ref
  18. Shibo Zhou, Xiaohua Li, Ying Chen, Sanjeev T Chandrasekaran, and Arindam Sanyal. 2021. Temporal-coded deep spiking neural network with easy training and robust performance. In Proceedings of the AAAI conference on artificial intelligence, Vol. 35. 11143–11151.Google ScholarGoogle ScholarCross RefCross Ref
  19. Zhaokun Zhou, Yuesheng Zhu, Chao He, Yaowei Wang, Shuicheng Yan, Yonghong Tian, and Li Yuan. 2022. Spikformer: When spiking neural network meets transformer. arXiv preprint arXiv:2209.15425 (2022).Google ScholarGoogle Scholar

Index Terms

  1. Towards Robust Handwritten Tibetan Numeral Recognition with Spiking Neural Networks

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICCIP '23: Proceedings of the 2023 9th International Conference on Communication and Information Processing
        December 2023
        648 pages
        ISBN:9798400708909
        DOI:10.1145/3638884

        Copyright © 2023 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 23 April 2024

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        Overall Acceptance Rate61of301submissions,20%
      • Article Metrics

        • Downloads (Last 12 months)1
        • Downloads (Last 6 weeks)1

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format