Skip to main content
Log in

Knowing Your Dog Breed: Identifying a Dog Breed with Deep Learning

  • Research Article
  • Published:
International Journal of Automation and Computing Aims and scope Submit manuscript

Abstract

Dog breed identification is essential for many reasons, particularly for understanding individual breeds’ conditions, health concerns, interaction behavior, and natural instinct. This paper presents a solution for identifying dog breeds using their images of their faces. The proposed method applies a deep learning based approach in order to recognize their breeds. The method begins with a transfer learning by retraining existing pre-trained convolutional neural networks (CNNs) on the public dog breed dataset. Then, the image augmentation with various settings is also applied on the training dataset, in order to improve the classification performance. The proposed method is evaluated using three different CNNs with various augmentation settings and comprehensive experimental comparisons. The proposed model achieves a promising accuracy of 89.92% on the published dataset with 133 dog breeds.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. S. G. Tong, Y. Y. Huang, Z. M. Tong. A robust face recognition method combining LBP with multi-mirror symmetry for images with various face interferences. International Journal of Automation and Computing, vol. 16, no. 5, pp. 671–682, 2019. DOI: https://doi.org/10.1007/s11633-018-1153-8.

    Article  Google Scholar 

  2. F. K. Zaman, A. A. Shafie, Y. M. Mustafah. Robust face recognition against expressions and partial occlusions. International Journal of Automation and Computing, vol. 13, no. 4, pp. 319–337, 2016. DOI: https://doi.org/10.1007/s11633-016-0974-6.

    Article  Google Scholar 

  3. J. R. Xue, J. W. Fang, P. Zhang. A survey of scene understanding by event reasoning in autonomous driving. International Journal of Automation and Computing, vol. 15, no. 3, pp. 249–266, 2018. DOI: https://doi.org/10.1007/s11633-018-1126-y.

    Article  Google Scholar 

  4. M. Chanvichitkul, P. Kumhom, K. Chamnongthai. Face recognition based dog breed classification using coarse-to-fine concept and PCA. In Proceedings of Asia-Pacific Conference on Communications, IEEE, Bangkok, Thailand, pp. 25–29, 2007. DOI: https://doi.org/10.1109/APCC.2007.4433495.

    Google Scholar 

  5. P. Prasong, K. Chamnongthai. Face-recognition-based dog-breed classification using size and position of each local part, and PCA. In Proceedings of the 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, IEEE, Phetchaburi, Thailand, 2012. DOI: https://doi.org/10.1109/ECTICon.2012.6254212.

    Book  Google Scholar 

  6. N. Dalal, B. Triggs. Histograms of oriented gradients for human detection. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, San Diego, USA, pp. 886–893, 2005. DOI: https://doi.org/10.1109/CVPR.2005.177.

    Google Scholar 

  7. D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, vol. 60, no. 2, pp. 91–110, 2004. DOI: https://doi.org/10.1023/B:VISI.0000029664.99615.94.

    Article  Google Scholar 

  8. O. M. Parkhi, A. Vedaldi, A. Zisserman, C. V. Jawahar. Cats and dogs. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Providence, USA, pp. 3498–3505, 0012. DOI: https://doi.org/10.1009/CVPR.2012.6248092.

    Google Scholar 

  9. J. X. Liu, A. Kanazawa, D. Jacobs, P. Belhumeur. Dog breed classification using part localization. In Proceedings of the 12th European Conference on Computer Vision, Springer, Florence, Italy, pp. 172–185, 2002. DOI: https://doi.org/10.1007/978-3-642-33718-5_13.

    Google Scholar 

  10. K. Lai, X. Y. Tu, S. Yanushkevich. Dog identification using soft biometrics and neural networks. In Proceedings of International Joint Conference on Neural Networks, IEEE, Budapest, Hungyry, pp. 1–8, 2019. DOI: 10.1109/IJCNN.2019.8851971.

    Google Scholar 

  11. X. Y. Tu, K. Lai, S. Yanushkevich. Transfer learning on convolutional neural networks for dog identification. In Proceedings of the 9th IEEE International Conference on Software Engineering and Service Science, IEEE, Beijing, China, pp. 357–360, 2018. DOI: https://doi.org/10.1109/ICSESS.2018.8663718.

    Google Scholar 

  12. B. Zhao, J. S. Feng, X. Wu, S. C Yan. A survey on deep learning-based fine-grained object classification and semantic segmentation. International Journal of Automation and Computing, vol. 14, no. 2, pp. 119–135, 2017. DOI: https://doi.org/10.1007/s11633-017-1053-3.

    Article  Google Scholar 

  13. O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. A. Ma, Z. H. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, F. F. Li. ImageNet large scale visual recognition challenge. International Journal of Computer Vision, vol. 115, no. 3, pp. 211–252, 2085. DOL https://doi.org/10.1007/s11263-015-0816-y.

    Article  MathSciNet  Google Scholar 

  14. C. Szegedy, V. Vanhoucke, S Ioffe, J. Shlens, Z. Wojna. Rethinking the inception architecture for computer vision. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, aas Vegas, USA, pp. 2818–2826, 2016. DOI: https://doi.org/10.1109/CVPR.2016.308.

    Google Scholar 

  15. C. Szegedy, W. Liu, Y. Q. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich. Going deeper with convolutions. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Boston, MA, USA, pp. 1–9, 2015. DOI: https://doi.org/10.1109/CVPR.2015.7298594.

    Google Scholar 

  16. M. Sandler, A. Howard, M. L. Zhu, A. Zhmoginov, L. C Chen. MobHenetv2: inverted residuals and linear bottle-necks. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Salt Lake City, USA, pp. 4500–4520, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00474.

    Google Scholar 

  17. B. Zoph, V. Vasudevan, J. SMens, Q. V. Le. Learnmg transferable architectures for scalable image recognition. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Satt Lake City, USA, pp.697–87100, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00907.

    Google Scholar 

  18. J. Yosinski, J. Clune, Y. Bengio, H. Lipson. How transferable are features in deep neural networks? In Proceedings of the 27th International Conference on Neural Information Processing Systems, MIT Press, Montreal, Canada, pp. 3320–3328, 2014.

    Google Scholar 

  19. L. Shao, F. Zhu, X. L. Li. Transfer learning for visual categorization: a survey. IEEE Transactions on Neural Networks and Learning Systems, vol. 26, no. 5, pp. 1019–1034, 2015. DOI: https://doi.org/10.1109/TNNLS.2014.2330900.

    Article  MathSciNet  Google Scholar 

  20. J. Deng, W. Dong, R. Socher, L. J. Li, K. Li, F. F. Li. ImageNet: a large-scale hierarchical image database. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Miami, USA, pp. 248–255, 2009. DOI: https://doi.org/10.1109/CVPR.2009.5206848.

    Google Scholar 

  21. T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, C. L. Zitnick. Microsoft coco: common objects in context. In Proceedings of the 13th European Conference on Computer Vision, Springer, Zurich, Switzerland, pp. 740–755, 2014. DOI: https://doi.org/10.1007/9783-319-10602-1_48.

    Google Scholar 

  22. N. Srivastava, G. Hinton, A. Krizhevsky, L Sutskever, R. Salakhutdinov. Dropout: a simple way to prevent neural networks from oveffittmg. Journal of Machine Learning Research, vol. 15, pp. 1929–1958, 2014.

    MATH  Google Scholar 

  23. K. Weiss, T. M. Khoshgoftaar, D. D. Wang. A survey of transfer learning. Journal of Big Data, vol. 3, no. 1, Article number 9, 2016. DOI: https://doi.org/10.1186/s40537-016-0043-6.

    Google Scholar 

  24. A. R. Zamir, A. Sae, W. Shen, L. Guibas, J. Malik, S. Savarese. Taskonomy: disentangling task transfer learning. In Proceedings of IEEE/ CVF Conference on Computer Vision and Pattern Recognition, IEEE, Salt Lake City, USA, pp. 3712–3722, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00391.

    Google Scholar 

  25. S. Iffee, C. Szegedy. Batch normalization: accelerating deep network training by reducing internal covariate shift. In Proceedings of the 32nd International Conference on Machine Learning, Lille, France, pp. 448–456, 2015.

  26. C. Shorten, T. M. Khoshgoftaar. A survey on image data augmentation for deep learning. Journal of Big Data, vol. 6, no. 1, Article number 60, 2019.

    Google Scholar 

  27. L. Perez, J. Wang. The effectiveness of data augmentation in image classification using deep learning. [online], Available: https://arxiv.orglabs/1712.04621, 2017.

  28. I. J. Goodfellow, J. Pouget-Abadie, Ml. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, Y. Bengio. Generative adversarial nets. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, pp. 2672–2680, 2014.

  29. A. Khosla, N. Jayadevaprakash, B. P. Yao, F. F. Li. Novel dataset for fine-grained image categorization: Stafford dogs. In Proceedings of the 1st Workshop on Fine-Grained Visual Categorization, IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Colorado Springs, USA, 2011.

    Google Scholar 

Download references

Acknowledgments

This research was supported by the Royal Golden Jubilee (RGJ) Ph.D. Programme under the Thailand Research Fund (No. PHD/0053/2 561)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Worapan Kusakunniran.

Additional information

Punyanuch Borwarnginn received the B. Sc. degree in information and communication technology from Mahidol University, Thailand in 2009, and the M. Sc. degree in informatics from the University of Edinburgh, UK in 2011. She is currently a Ph. D. degree candidate in computer science from Faculty of Information and Communication Technology, Mahidol University, Thailand.

Her research interests include image processing, biometrics, computer vision, pattern recognition and machine learning.

Worapan Kusakunniran received tee B.Eng. degree in computer engineering from the University of New South Wales (UNSW), Australia in 2008, and the Ph. D. degree in computer science and engineering from UNSW, in cooperation with the Neville Roach Laboratory, National ICT Australia, Autraaha in 2013. He is currently a lecturer with Faculty of Information and Communication Technology, Mahidol University, Thailand. He is the author of several papers in top international conferences and journals. Dr. Kusakunniran served as a program committee member for many international conferences and workshops. Also, he has served as a reviewer for several international coherences and journals, such as International Conference on Pattern Recognition, IEEE International Conference on Image Processing, IEEE International Conference on Advanced Video and Signal Based Surveillance, Pattern Recognition, IEEE Transactions on Image Processing, IEEE Transactions on Information Forensics and Security, and IEEE Signal Processing Letters.

His research interests include biometrics, pattern recognition, medical image processing, computer vision, multimedia, and machine learning.

Sarattha Karnjanapreechakorn received the B. Sc. degree in electrical-mechanical manufacturing engineering from Kasertsart University, Thailand in 2015, and the M. Sc. degree in game technology and gamification from Mahidol University, Thailand in 2017. He is currently a Ph. D. degree candidate in computer science of Faculty of Information and Communication Technology, Mahidol University, Thailand.

His research interests include image processing, biometrics, computer vision, pattern recognition and machine learning.

Kittikhun Thongkanchorn received the B. Sc. degree in information and communication technology from University of Mahidol, Thailand in 2007, and the M. Sc degree in computer science from Faculty of ICT, Mahidol University, Thailand in 2012. He is currently a computer scientist, senior professional level with Faculty of ICT, Mahidol University, Thailand. His research interests include computer system and network, elastic computing and distributed system, computer security and policy, image processing and machine learning.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Borwarnginn, P., Kusakunniran, W., Karnjanapreechakorn, S. et al. Knowing Your Dog Breed: Identifying a Dog Breed with Deep Learning. Int. J. Autom. Comput. 18, 45–54 (2021). https://doi.org/10.1007/s11633-020-1261-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11633-020-1261-0

Keywords