Abstract
Writing and drawing are basic forms of human communication. Handwritten and hand-drawn documents are often used at initial stages of a project. For storage and later usage, handwritten documents are often converted into a digital format with a graphics program. Drawing with a computer in many cases requires skill and more time than less formal handwritten drawings. Even when people have experience in computer drawing and are familiar with the application, it takes time. Automatic conversion of images of hand-drawn diagrams into a digital graphic format file could save time in the design process. One of early critical tasks in hand-drawn diagram interpretation is segmentation of the diagram into text and non-text components. In this paper, we compare two approaches for offline text and non-text segmentation of contours in an image. We describe the feature extraction and classification processes. Our methods obtain 82–86 % accuracy. Future work will explore the application of these techniques in a complete diagram interpretation system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Disc. 2, 121–167 (1998)
Lemaitre, A., Carton, C., Couasnon, B.: Fusion of statistical and structural information for flowchart recognition. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 1210–1214. IEEE (2013)
Costagliola, G., Deufemia, V., Risi, M.: A multi-layer parsing strategy for on-line recognition of hand-drawn diagrams. In: IEEE Symposium on Visual Languages and Human-Centric Computing, VL/HCC 2006, pp. 103–110. IEEE Computer Society, September 2006
Freitas, C.O.A., Oliveira, L.S., Bortolozzi, F., Aires, S.B.K.: Handwritten character recognition using nonsymmetrical perceptual zoning. Int. J. Pattern Recogn. Artif. Intell. 21(01), 135–155 (2007)
Hammond, T., Davis, R.: Tahuti: a geometrical sketch recognition system for UML class diagrams. In: ACM SIGGRAPH 2006 Courses, SIGGRAPH 2006. ACM (2006)
Hammond, T., Davis, R.: Ladder, a sketching language for user interface developers. In: ACM SIGGRAPH 2007 Courses, SIGGRAPH 2007. ACM (2007)
Kara, L.B., Stahovich, T.F.: Hierarchical parsing and recognition of hand-sketched diagrams. In: ACM SIGGRAPH (2007)
Lauer, F., Suen, C.Y., Bloch, G.: A trainable feature extractor for handwritten digit recognition. Pattern Recogn. 40(6), 1816–1824 (2007)
Bresler, M., Prua, D., Hlavác, V.: Modeling flowchart structure recognition as a max-sum problem. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 1215–1219. IEEE (2013)
Niu, X.-X., Suen, C.Y.: A novel hybrid CNN-SVM classifier for recognizing handwritten digits. Pattern Recogn. 45(4), 1318–1325 (2012)
OpenCV Dev Team. Finding contours in your image (2016). http://docs.opencv.org/2.4/doc/tutorials/imgproc/shapedescriptors/find_contours/find_contours.html
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Stahovich, T.F.: Segmentation of pen strokes using pen speed. In: Proceedings of the AAAI Fall Symposium on Making Pen-Based Interaction Intelligent and Natural, pp. 21–24 (2004)
Suzuki, S., Abe, K.: Topological structural analysis of digitized binary images by border following. Comput. Vis. Graph. Image Process. 30(1), 32–46 (1985)
Waranusast, R., Haddawy, P., Dailey, M.: Segmentation of text and non-text in on-line handwritten patient record based on spatio-temporal analysis. In: Combi, C., Shahar, Y., Abu-Hanna, A. (eds.) AIME 2009. LNCS, vol. 5651, pp. 345–354. Springer, Heidelberg (2009)
Wu, J., Wang, C., Zhang, L., Rui, Y.: Offline sketch parsing via shapeness estimation. In: Proceedings of the 24th International Conference on Artificial Intelligence, IJCAI 2015, pp. 1200–1206. AAAI Press (2015)
Yang, X., Qiaozhen, Y., He, L., Guo, T.: The one-against-all partition based binary tree support vector machine algorithms for multi-class classification. Neurocomputing 113, 1–7 (2013)
Zhong, C., Ding, Y., Fu, J.: Handwritten character recognition based on 13-point feature of skeleton and self-organizing competition network. In: Proceedings of the 2010 International Conference on Intelligent Computation Technology and Automation, ICICTA 2010, vol. 02, pp. 414–417. IEEE Computer Society, Washington, D.C. (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Pravalpruk, B., Dailey, M.M. (2016). Offline Text and Non-text Segmentation for Hand-Drawn Diagrams. In: Booth, R., Zhang, ML. (eds) PRICAI 2016: Trends in Artificial Intelligence. PRICAI 2016. Lecture Notes in Computer Science(), vol 9810. Springer, Cham. https://doi.org/10.1007/978-3-319-42911-3_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-42911-3_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42910-6
Online ISBN: 978-3-319-42911-3
eBook Packages: Computer ScienceComputer Science (R0)