Abstract
The paper provides a practical solution to a real-time text/shape differentiation problem for online handwriting input. The proposed structure of the classification system comprises stroke grouping and stroke classification blocks. A new set of features is derived that has low computational complexity. The method achieves 98.5 % text/shape classification accuracy on a benchmark dataset. The proposed stroke grouping machine learning approach improves classification robustness in relation to different input styles. In contrast to the threshold-based techniques, this grouping adaptation enhances the overall discriminating accuracy of the text/shape recognition system by 11.3 %. The solution improves system’s response on a touch-screen device.
Similar content being viewed by others
References
Plamondon, R., Sriharib, S., Polytech, E., Montreal, Q.: Online and off-line handwriting recognition: a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 22, 63–84 (2000)
Indermhle, E.: Analysis of Digital Ink in Electronic Documents. Ph.D. Thesis, University of Bern, 108 pp (2012)
Bhat, A., Hammond, T.: Using entropy to distinguish shape versus text in hand-drawn diagrams. In: Proceedings of IJCAI’09, pp. 1395–1400 (2009)
Delaye, A., Liu, C.L.: Contextual text/non-text stroke classification in online handwritten with conditional random fields. Pattern Recogn. 47(3), 959–968 (2014)
Inatani, S., Van Phan, T., Nakagawa, M.: Comparison of MRF and CRF for next/non-text classification in Japanese ink documents. In: 14th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE, pp. 684–689 (2014)
Van Phan, T., Nakagawa, M.: Text/non-text classification in online handwritten documents with recurrent neural networks. In: 14th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE, pp. 23–28 (2014)
Bishop, C.M., Svensen, M., Hinton, G.E.: Distinguishing text from graphics in on-line handwritten ink. In: Proceedings of International Workshop on Frontiers in Handwriting Recognition, pp. 142–147 (2004)
Blagojevic, R., Plimmer, B., Grundy, J., et al.: Building digital ink recognizers using data mining: distinguishing between text and shapes in hand drawn diagrams. In: Trends in Applied Intelligent Systems. Springer, Berlin, pp. 358–367 (2010)
Indermhle, E., Liwicki, M., Bunke, H.: IAMonDo-database: an online handwritten document database with non-uniform contents. In: Proceedings Of International Workshop on Document Analysis Systems, pp. 97–104 (2010)
Delaye, A., Liu, C.-L.: Text/non-text classification in online handwritten documents with conditional random fields. Pattern Recognit. Chin. Conf. Proc. 321, 514–521 (2012)
Delaye, A., Liu, C.-L.: Multi-class segmentation of free-form online documents with tree conditional random fields. Int. J. Doc. Anal. Recogn. 17(4), 313–329 (2014)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labelling sequence data. In: Proceedings of the International Conference on Machine Learning, pp. 282–289 (2001)
Otte, S., Krechel, D., Liwicki, M., Dengel, A.: Local feature based online mode detection with recurrent neural networks. In: Proceedings of the 13th International Conference on Frontiers in Handwriting Recognition, pp. 531–535 (2012)
Indermhle, E., Frinken, V., Bunke, H.: Mode detection in online handwritten documents using BLSTM neural networks. In: Proceedings of the 13th International Conference on Frontiers in Handwriting Recognition, pp. 302–307 (2012)
Weber, M., Liwicki, M., Schelske, Y., Schoelzel, C., Strau, F., Dengel, A.: MCS for online mode detection: evaluation on pen-enabled multi-touch interfaces. In: Proceedings of the 11th International Conference on Document Analysis and Recognition, pp. 957–961 (2011)
Rodriguez, J., Sanchez, G., Llados, J.: Categorization of digital ink elements using spectral features. In: International Workshop on Graphics Recognition Graphics Recognition, vol. 5046, pp. 181–190. Springer, Berlin (2008)
Zhang, X., Lyu, M., Dai, G.: Extraction and segmentation of tables from Chinese ink documents based on a matrix model. Pattern Recogn. 40, 1855–1867 (2007)
Ouyang, T., Davis, R.: Learning from neighboring strokes: combining appearance and context for multi-domain sketch recognition. In: Proceedings of the 23rd Annual Conference Neural Information Processing Systems, pp. 1401–1409 (2009)
Peterson, E., Stahovich, T., Doi, E., Alvarado, C.: Grouping strokes into shapes in hand-drawn diagrams. In: Proceedings of the 24th AAAI Conference on Artificial Intelligence, vol. 10, p. 14 (2010)
Zhou, X.-D., Wang, D., Liu, C.-L.: A robust approach to text line grouping in online handwritten Japanese documents. Pattern Recogn. 42, 2077–2088 (2009)
Liwicki, M., Bunke, H.: Feature selection for on-line handwriting recognition of whiteboard notes. In: Proceedings of the Conference of the Graphonomics Society, pp. 101–105 (2007)
Soddiqi, I., Vincent, N.: A set of chain code based features for writer recognition. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, ICDAR’09, IEEE, pp. 981–985 (2009)
Vincent, N., Dorizzi, B.: Fractal justification of the normalization step for online handwriting recognition. Order 501, 4279 (2000)
Delaye, A., Lee, K.: A flexible framework for online document segmentation by pairwise stroke distance learning. Pattern Recogn. 48(4), 1193–1206 (2015)
Xulei, Y., Song, Q., Wang, Y.: A weighted support vector machine for data classification. Int. J. Pattern Recognit Artif Intell. 21(5), 961–976 (2007)
Chang, C.-C., Lin, C.-J.: LIBSVM : A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27:1–27:27 (2011) Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Degtyarenko, I., Radyvonenko, O., Bokhan, K. et al. Text/shape classifier for mobile applications with handwriting input. IJDAR 19, 369–379 (2016). https://doi.org/10.1007/s10032-016-0276-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-016-0276-0