Skip to main content

Analysis and Recognition of Hand-Drawn Images with Effective Data Handling

  • Conference paper
  • First Online:
Big Data Analytics (BDA 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11932))

Included in the following conference series:

Abstract

Everyday data is being produced at a mind-boggling rate and at current pace, we are generating more than 2.5 quintillion bytes of data each day. With more data, there comes the chance of generating and understanding more useful information out of it. With this view in mind, we are presenting our work on Quick Draw dataset. It is a repository of approximately more than 50 million hand-drawn drawings of about 345 different objects, contributed by over 15 million users. The work we have presented is a useful approach to perform a country-wise analysis of the styles that people use while making strokes to draw an object or image. Since we are dealing with a huge dataset, we are also presenting a powerful technique to effectively reduce the data and its dimension which results in much more simplified and concise data. Based on the simplified data, we are enhancing the performance of our models to recognize these hand-drawn objects based on the strokes that a user has used for drawing them. For the recognition of objects, we have worked on traditional machine learning models - Nearest Neighbor (K-NN), Random Forest Classifier (RFC), Support Vector Classifier (SVC) and Multi-Layer Perceptron model (MLP) by selecting the best hyper-parameters. To make the recognition results better, we have also presented our work on deep learning models - CNN, LSTM, and WaveNet networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Stiny, G., Gips, J.: Shape grammars and the generative specification of painting and sculpture. In: IFIP Congress (2), vol. 2, no. 3, August 1971

    Google Scholar 

  2. Futrelle, R.P., Nikolakis, N.: Efficient analysis of complex diagrams using constraint-based parsing. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 2, pp. 782–790. IEEE, August 1995

    Google Scholar 

  3. Bimber, O., Encarnacao, L.M., Stork, A.: A multi-layered architecture for sketch-based interaction within virtual environments. Comput. Graph. 24(6), 851–867 (2000)

    Article  Google Scholar 

  4. Mahoney, J.V., Fromherz, M.P.: Three main concerns in sketch recognition and an approach to addressing them. In: AAAI Spring Symposium on Sketch Understanding, pp. 105–112, March 2002

    Google Scholar 

  5. Long, A.C., Landay, J.A., Rowe, L.A.: Quill: a gesture design tool for pen-based user interfaces. University of California, Berkeley (2001)

    Google Scholar 

  6. Gross, M.D., Do, E.Y.L.: Demonstrating the electronic cocktail napkin: a paper-like interface for early design. In: Conference on Human Factors in Computing Systems: Conference Companion on Human Factors in Computing Systems: Common Ground, vol. 13, no. 18, pp. 5–6, April 1996

    Google Scholar 

  7. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  8. Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)

    Google Scholar 

  9. Donahue, J., et al.: Decaf: a deep convolutional activation feature for generic visual recognition. In International Conference on Machine Learning, pp. 647–655, January 2014

    Google Scholar 

  10. Davis, R.: Position statement and overview: sketch recognition at MIT. In: AAAI Sketch Understanding Symposium (2002)

    Google Scholar 

  11. Suykens, J.A., Vandewalle, J.: Least squares support vector machine classifiers. Neural Process. Lett. 9(3), 293–300 (1999)

    Article  Google Scholar 

  12. Zhang, M., Zhang, D.X.: Trained SVMs based rules extraction method for text classification. In: 2008 IEEE International Symposium on IT in Medicine and Education, pp. 16–19. IEEE, December 2008

    Google Scholar 

  13. Lau, K.W., Wu, Q.H.: Online training of support vector classifier. Pattern Recogn. 36(8), 1913–1920 (2003)

    Article  Google Scholar 

  14. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  Google Scholar 

  15. Keller, J.M., Gray, M.R., Givens, J.A.: A fuzzy k-nearest neighbor algorithm. IEEE Trans. Syst. Man Cybern. 4, 580–585 (1985)

    Article  Google Scholar 

  16. Beyer, K., Goldstein, J., Ramakrishnan, R., Shaft, U.: When is “nearest neighbor” meaningful? In: Beeri, C., Buneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 217–235. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-49257-7_15

    Chapter  Google Scholar 

  17. Dudani, S.A.: The distance-weighted k-nearest-neighbor rule. IEEE Trans. Syst. Man Cybern. 4, 325–327 (1976)

    Article  Google Scholar 

  18. Jain, A.K., Mao, J., Mohiuddin, K.M.: Artificial neural networks: a tutorial. Computer 29(3), 31–44 (1996)

    Article  Google Scholar 

  19. Kim, Y.: Convolutional neural networks for sentence classification, August 2014. https://arxiv.org/abs/1408.5882

  20. Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Proceedings of Advances in Neural Information Processing Systems, pp. 649–657 (2015)

    Google Scholar 

  21. Xiao, Y., Cho, K.: Efficient character-level document classification by combining convolution and recurrent layers, February 2016. https://arxiv.org/abs/1602.00367

  22. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  23. Du, Y., Wang, W., Wang, L.: Hierarchical recurrent neural network for skeleton based action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1110–1118 (2015)

    Google Scholar 

  24. Paine, T.L., et al.: Fast wavenet generation algorithm (2016). arXiv preprint arXiv:1611.09482

  25. Oord, A.V.D., et al.: WaveNet: a generative model for raw audio (2016). arXiv preprint arXiv:1609.03499

  26. Cao, Y., Wang, H., Wang, C., Li, Z., Zhang, L., Zhang, L.: MindFinder: interactive sketch-based image search on millions of images. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 1605–1608. ACM, October 2010

    Google Scholar 

  27. Paulson, B., Eoff, B., Wolin, A., Johnston, J., Hammond, T.: Sketch-based educational games: drawing kids away from traditional interfaces. In: Proceedings of the 7th International Conference on Interaction Design and Children, pp. 133–136. ACM, June 2008

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pulkit Mehndiratta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gupta, M., Mehndiratta, P. (2019). Analysis and Recognition of Hand-Drawn Images with Effective Data Handling. In: Madria, S., Fournier-Viger, P., Chaudhary, S., Reddy, P. (eds) Big Data Analytics. BDA 2019. Lecture Notes in Computer Science(), vol 11932. Springer, Cham. https://doi.org/10.1007/978-3-030-37188-3_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-37188-3_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-37187-6

  • Online ISBN: 978-3-030-37188-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics