Skip to main content

A Shallow Graph Neural Network with Innovative Node Updating for Online Handwritten Stroke Classification

  • Conference paper
  • First Online:
Document Analysis and Recognition - ICDAR 2023 (ICDAR 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14190))

Included in the following conference series:

  • 590 Accesses

Abstract

Stroke classification is important to the layout analysis of online handwritten documents. Due to the diversity of writing styles and the complexity of layout structure, stroke classification is challenging. Graph neural networks (GNNs) is one of the most effective frameworks for stroke classification. However, GNNs has the problem of node over-compression caused by the deep structure of GNNs, which will lead to loss of node information and hence may deteriorate the performance of stroke classification. In this paper, we propose a shallow graph neural network model that is capable of retaining long-term receptive field by constructing a more reasonable graph through edge classification before the node classification step. Moreover, a novel node learning method is used to integrate edge features into nodes, where edge features not only participate in the calculation of node attention weight as in previous GNN based methods, but also participate in the final node integration. Experiments on the IAMonDo dataset show that our proposed method achieves an accuracy of 97.71% that is superior to existing state-of-the-art methods, demonstrating the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Otte, S., Krechel, D., Liwicki, M., et al.: Local feature based online mode detection with recurrent neural networks. In: 2012 International Conference on Frontiers in Handwriting Recognition, pp. 533–537. IEEE (2012)

    Google Scholar 

  2. Indermühle, E., Frinken, V., Bunke, H.: Mode detection in online handwritten documents using BLSTM neural networks. In: 2012 International Conference on Frontiers in Handwriting Recognition, pp. 302–307. IEEE (2012). Author, F., Author, S., Author, T.: Book title. 2nd edn. Publisher, Location (1999)

    Google Scholar 

  3. Khomenko, V., Volkoviy, A., Degtyarenko, I., et al.: Handwriting text/non-text classification on mobile device. In: The Fourth International Conference on Artificial Intelligence and Pattern Recognition (AIPR), pp. 42–49 (2017)

    Google Scholar 

  4. Polotskyi, S., Deriuga, I., Ignatova, T., Melnyk, V., Azarov, H.: Improving online handwriting text/non-text classification accuracy under condition of stroke context absence. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2019. LNCS, vol. 11506, pp. 210–221. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20521-8_18

    Chapter  Google Scholar 

  5. Van Phan, T., Nakagawa, M.: Text/non-text classification in online handwritten documents with recurrent neural networks. In: 2014 14th International Conference on Frontiers in Handwriting Recognition, pp. 23–28. IEEE (2014)

    Google Scholar 

  6. Bishop, C.M., Svensen, M., Hinton, G.E.: Distinguishing text from graphics in on-line handwritten ink. In: Ninth International Workshop on Frontiers in Handwriting Recognition, pp. 142–147. IEEE (2004)

    Google Scholar 

  7. Zhou, X.D., Liu, C.L.: Text/non-text ink stroke classification in Japanese handwriting based on markov random fields. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 1, pp. 377–381. IEEE (2007)

    Google Scholar 

  8. Delaye, A., Liu, C.-L.: Text/non-text classification in online handwritten documents with conditional random fields. In: Liu, C.-L., Zhang, C., Wang, L. (eds.) Pattern Recognition, pp. 514–521. Springer Berlin Heidelberg, Berlin, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33506-8_63

    Chapter  Google Scholar 

  9. Ye, J.Y., Zhang, Y.M., Liu, C.L.: Joint training of conditional random fields and neural networks for stroke classification in online handwritten documents. In: 2016 23rd International Conference on Pattern Recognition (ICPR), pp. 3264–3269. IEEE (2016)

    Google Scholar 

  10. Yun, X.-L., Zhang, Y.-M., Ye, J.-Y., Liu, C.-L.: Online handwritten diagram recognition with graph attention networks. In: Zhao, Y., Barnes, N., Chen, B., Westermann, R., Kong, X., Lin, C. (eds.) Image and Graphics: 10th International Conference, ICIG 2019, Beijing, China, August 23–25, 2019, Proceedings, Part I, pp. 232–244. Springer International Publishing, Cham (2019). https://doi.org/10.1007/978-3-030-34120-6_19

    Chapter  Google Scholar 

  11. Ye, J.Y., Zhang, Y.M., Yang, Q., et al.: Contextual stroke classification in online handwritten documents with graph attention networks. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), 993–998. IEEE (2019)

    Google Scholar 

  12. Yun, X.L., Zhang, Y.M., Yin, F., et al.: Instance GNN: a learning framework for joint symbol segmentation and recognition in online handwritten diagrams. IEEE Trans. Multimedia 24, 2580–2594 (2021)

    Article  Google Scholar 

  13. Indermühle, E., Liwicki, M., Bunke, H.: IAMonDo-database: an online handwritten document database with non-uniform contents. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 97–104 (2010)

    Google Scholar 

  14. Degtyarenko, I., Deriuga, I., Grygoriev, A., et al.: Hierarchical recurrent neural network for handwritten strokes classification. In: ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2865–2869. IEEE (2021)

    Google Scholar 

  15. Grygoriev, A., Degtyarenko, I., Deriuga, I., Polotskyi, S., Melnyk, V., Zakharchuk, D., Radyvonenko, O.: HCRNN: a novel architecture for fast online handwritten stroke classification. In: Lladós, J., Lopresti, D., Uchida, S. (eds.) Document Analysis and Recognition – ICDAR 2021: 16th International Conference, Lausanne, Switzerland, September 5–10, 2021, Proceedings, Part II, pp. 193–208. Springer International Publishing, Cham (2021). https://doi.org/10.1007/978-3-030-86331-9_13

    Chapter  Google Scholar 

  16. Defferrard, M., Bresson, X., Vandergheynst, P.: Convolutional neural networks on graphs with fast localized spectral filtering. In: Advances in Neural Information Processing Systems, pp. 3844–3852 (2016)

    Google Scholar 

  17. Hamilton, W., Ying, Z., Leskovec, J.: Inductive representation learning on large graphs. In: Advances in Neural Information Processing Systems, vol. 30 (2017)

    Google Scholar 

  18. Veličković, P., Cucurull, G., Casanova, A., et al.: Graph attention networks. arXiv preprint arXiv:1710.10903 (2017)

  19. Gong, L., Cheng, Q.: Exploiting edge features for graph neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9211–9219 (2019)

    Google Scholar 

  20. Polotskyi, S., Radyvonenko, O., Degtyarenko, I., et al.: Spatio-temporal clustering for grouping in online handwriting document layout analysis with GRU-RNN. In: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 276–281. IEEE (2020)

    Google Scholar 

  21. Van Phan, T., Nakagawa, M.: Combination of global and local contexts for text/non-text classification in heterogeneous online handwritten documents. Pattern Recogn. 51, 112–124 (2016)

    Article  Google Scholar 

  22. Li, X.H., Yin, F., Liu, C.L.: Printed/handwritten texts and graphics separation in complex documents using conditional random fields. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp. 145–150. IEEE (2018)

    Google Scholar 

  23. Wang, C., Mouchere, H., Viard-Gaudin, C., et al.: Combined segmentation and recognition of online handwritten diagrams with high order markov random field. In: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 252–257. IEEE (2016)

    Google Scholar 

  24. Delaye, A., Liu, C.L.: Multi-class segmentation of free-form online documents with tree conditional random fields. Int. J. Doc. Anal. Recogn. (IJDAR) 17(4), 313–329 (2014)

    Article  Google Scholar 

  25. Inatani, S., Van Phan, T., Nakagawa, M.: Comparison of MRF and CRF for Text/Non-text classification in Japanese Ink Documents. In: 2014 14th International Conference on Frontiers in Handwriting Recognition, pp, 684–689. IEEE (2014)

    Google Scholar 

  26. Delaye, A., Liu, C.L.: Graphics extraction from heterogeneous online documents with hierarchical random fields. In: 2013 12th International Conference on Document Analysis and Recognition, pp, 1007–1011. IEEE (2013)

    Google Scholar 

  27. Delaye, A., Liu, C.L.: Contextual text/non-text stroke classification in online handwritten notes with conditional random fields. Pattern Recogn. 47(3), 959–968 (2014)

    Article  Google Scholar 

  28. Delaye, A., Liu, C.L.: Context modeling for text/non-text separation in free-form online handwritten documents. Doc. Recogn. Retrieval XX SPIE 8658, 98–109 (2013)

    Google Scholar 

  29. Ye, J.Y., Zhang, Y.M., Yang, Q., et al.: Joint stroke classification and text line grouping in online handwritten documents with edge pooling attention networks. Pattern Recogn. 114, 107859 (2021)

    Article  Google Scholar 

  30. Wang, Z., Chen, J., Chen, H.: EGAT: edge-featured graph attention network. In: Farkaš, I., Masulli, P., Otte, S., Wermter, S. (eds.) Artificial Neural Networks and Machine Learning – ICANN 2021: 30th International Conference on Artificial Neural Networks, Bratislava, Slovakia, September 14–17, 2021, Proceedings, Part I, pp. 253–264. Springer International Publishing, Cham (2021). https://doi.org/10.1007/978-3-030-86362-3_21

    Chapter  Google Scholar 

  31. Paszke, A., Gross, S., Massa, F., et al.: Pytorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems, vol. 32 (2019)

    Google Scholar 

  32. Indermühle, E.: Analysis of digital link in electronic documents. Verlag nicht ermittelbar (2012)

    Google Scholar 

  33. Alon, U., Yahav, E.: On the bottleneck of graph neural networksand its practical implications. arXiv preprint arXiv:2006.05205 (2020)

Download references

Acknowledgement

This work is supported by National Natural Science Foundation of China (No. 61773325, 62276258), Industry University Cooperation Project of Fujian Science and Technology Department (No. 2021H6035), and the Science and Technology Planning Project of Fujian Province (No. 2020Y9064), and Fu-Xia-Quan National Independent Innovation Demonstration Project (No. 2022FX4).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Da-Han Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, YR., Wang, DH., Yun, XL., Zhang, YM., Yin, F., Zhu, S. (2023). A Shallow Graph Neural Network with Innovative Node Updating for Online Handwritten Stroke Classification. In: Fink, G.A., Jain, R., Kise, K., Zanibbi, R. (eds) Document Analysis and Recognition - ICDAR 2023. ICDAR 2023. Lecture Notes in Computer Science, vol 14190. Springer, Cham. https://doi.org/10.1007/978-3-031-41685-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-41685-9_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-41684-2

  • Online ISBN: 978-3-031-41685-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics