Analysis and Recognition of Hand-Drawn Images with Effective Data Handling

Gupta, Mohit; Mehndiratta, Pulkit

doi:10.1007/978-3-030-37188-3_22

Mohit Gupta¹² &
Pulkit Mehndiratta¹²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11932))

Included in the following conference series:

International Conference on Big Data Analytics

1085 Accesses
1 Citations

Abstract

Everyday data is being produced at a mind-boggling rate and at current pace, we are generating more than 2.5 quintillion bytes of data each day. With more data, there comes the chance of generating and understanding more useful information out of it. With this view in mind, we are presenting our work on Quick Draw dataset. It is a repository of approximately more than 50 million hand-drawn drawings of about 345 different objects, contributed by over 15 million users. The work we have presented is a useful approach to perform a country-wise analysis of the styles that people use while making strokes to draw an object or image. Since we are dealing with a huge dataset, we are also presenting a powerful technique to effectively reduce the data and its dimension which results in much more simplified and concise data. Based on the simplified data, we are enhancing the performance of our models to recognize these hand-drawn objects based on the strokes that a user has used for drawing them. For the recognition of objects, we have worked on traditional machine learning models - Nearest Neighbor (K-NN), Random Forest Classifier (RFC), Support Vector Classifier (SVC) and Multi-Layer Perceptron model (MLP) by selecting the best hyper-parameters. To make the recognition results better, we have also presented our work on deep learning models - CNN, LSTM, and WaveNet networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A novel framework for generating handwritten datasets

Article 13 November 2020

Deep learning approaches to pattern extraction and recognition in paintings and drawings: an overview

Article Open access 02 April 2021

Deep transfer learning algorithms applied to synthetic drawing images as a tool for supporting Alzheimer’s disease prediction

Article 20 April 2022

References

Stiny, G., Gips, J.: Shape grammars and the generative specification of painting and sculpture. In: IFIP Congress (2), vol. 2, no. 3, August 1971
Google Scholar
Futrelle, R.P., Nikolakis, N.: Efficient analysis of complex diagrams using constraint-based parsing. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 2, pp. 782–790. IEEE, August 1995
Google Scholar
Bimber, O., Encarnacao, L.M., Stork, A.: A multi-layered architecture for sketch-based interaction within virtual environments. Comput. Graph. 24(6), 851–867 (2000)
Article Google Scholar
Mahoney, J.V., Fromherz, M.P.: Three main concerns in sketch recognition and an approach to addressing them. In: AAAI Spring Symposium on Sketch Understanding, pp. 105–112, March 2002
Google Scholar
Long, A.C., Landay, J.A., Rowe, L.A.: Quill: a gesture design tool for pen-based user interfaces. University of California, Berkeley (2001)
Google Scholar
Gross, M.D., Do, E.Y.L.: Demonstrating the electronic cocktail napkin: a paper-like interface for early design. In: Conference on Human Factors in Computing Systems: Conference Companion on Human Factors in Computing Systems: Common Ground, vol. 13, no. 18, pp. 5–6, April 1996
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Donahue, J., et al.: Decaf: a deep convolutional activation feature for generic visual recognition. In International Conference on Machine Learning, pp. 647–655, January 2014
Google Scholar
Davis, R.: Position statement and overview: sketch recognition at MIT. In: AAAI Sketch Understanding Symposium (2002)
Google Scholar
Suykens, J.A., Vandewalle, J.: Least squares support vector machine classifiers. Neural Process. Lett. 9(3), 293–300 (1999)
Article Google Scholar
Zhang, M., Zhang, D.X.: Trained SVMs based rules extraction method for text classification. In: 2008 IEEE International Symposium on IT in Medicine and Education, pp. 16–19. IEEE, December 2008
Google Scholar
Lau, K.W., Wu, Q.H.: Online training of support vector classifier. Pattern Recogn. 36(8), 1913–1920 (2003)
Article Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article Google Scholar
Keller, J.M., Gray, M.R., Givens, J.A.: A fuzzy k-nearest neighbor algorithm. IEEE Trans. Syst. Man Cybern. 4, 580–585 (1985)
Article Google Scholar
Beyer, K., Goldstein, J., Ramakrishnan, R., Shaft, U.: When is “nearest neighbor” meaningful? In: Beeri, C., Buneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 217–235. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-49257-7_15
Chapter Google Scholar
Dudani, S.A.: The distance-weighted k-nearest-neighbor rule. IEEE Trans. Syst. Man Cybern. 4, 325–327 (1976)
Article Google Scholar
Jain, A.K., Mao, J., Mohiuddin, K.M.: Artificial neural networks: a tutorial. Computer 29(3), 31–44 (1996)
Article Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification, August 2014. https://arxiv.org/abs/1408.5882
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Proceedings of Advances in Neural Information Processing Systems, pp. 649–657 (2015)
Google Scholar
Xiao, Y., Cho, K.: Efficient character-level document classification by combining convolution and recurrent layers, February 2016. https://arxiv.org/abs/1602.00367
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Du, Y., Wang, W., Wang, L.: Hierarchical recurrent neural network for skeleton based action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1110–1118 (2015)
Google Scholar
Paine, T.L., et al.: Fast wavenet generation algorithm (2016). arXiv preprint arXiv:1611.09482
Oord, A.V.D., et al.: WaveNet: a generative model for raw audio (2016). arXiv preprint arXiv:1609.03499
Cao, Y., Wang, H., Wang, C., Li, Z., Zhang, L., Zhang, L.: MindFinder: interactive sketch-based image search on millions of images. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 1605–1608. ACM, October 2010
Google Scholar
Paulson, B., Eoff, B., Wolin, A., Johnston, J., Hammond, T.: Sketch-based educational games: drawing kids away from traditional interfaces. In: Proceedings of the 7th International Conference on Interaction Design and Children, pp. 133–136. ACM, June 2008
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Jaypee Institute of Information Technology, Noida, India
Mohit Gupta & Pulkit Mehndiratta

Authors

Mohit Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Pulkit Mehndiratta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pulkit Mehndiratta .

Editor information

Editors and Affiliations

Missouri University of Science and Technology, Rolla, MO, USA
Sanjay Madria
Harbin Institute of Technology, Shenzhen, China
Philippe Fournier-Viger
Ahmedabad University, Ahmedabad, India
Sanjay Chaudhary
International Institute of Information Technology, Hyderabad, India
P. Krishna Reddy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gupta, M., Mehndiratta, P. (2019). Analysis and Recognition of Hand-Drawn Images with Effective Data Handling. In: Madria, S., Fournier-Viger, P., Chaudhary, S., Reddy, P. (eds) Big Data Analytics. BDA 2019. Lecture Notes in Computer Science(), vol 11932. Springer, Cham. https://doi.org/10.1007/978-3-030-37188-3_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-37188-3_22
Published: 12 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37187-6
Online ISBN: 978-3-030-37188-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Analysis and Recognition of Hand-Drawn Images with Effective Data Handling

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A novel framework for generating handwritten datasets

Deep learning approaches to pattern extraction and recognition in paintings and drawings: an overview

Deep transfer learning algorithms applied to synthetic drawing images as a tool for supporting Alzheimer’s disease prediction

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Analysis and Recognition of Hand-Drawn Images with Effective Data Handling

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A novel framework for generating handwritten datasets

Deep learning approaches to pattern extraction and recognition in paintings and drawings: an overview

Deep transfer learning algorithms applied to synthetic drawing images as a tool for supporting Alzheimer’s disease prediction

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation