Abstract
Dog breed identification is essential for many reasons, particularly for understanding individual breeds’ conditions, health concerns, interaction behavior, and natural instinct. This paper presents a solution for identifying dog breeds using their images of their faces. The proposed method applies a deep learning based approach in order to recognize their breeds. The method begins with a transfer learning by retraining existing pre-trained convolutional neural networks (CNNs) on the public dog breed dataset. Then, the image augmentation with various settings is also applied on the training dataset, in order to improve the classification performance. The proposed method is evaluated using three different CNNs with various augmentation settings and comprehensive experimental comparisons. The proposed model achieves a promising accuracy of 89.92% on the published dataset with 133 dog breeds.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
S. G. Tong, Y. Y. Huang, Z. M. Tong. A robust face recognition method combining LBP with multi-mirror symmetry for images with various face interferences. International Journal of Automation and Computing, vol. 16, no. 5, pp. 671–682, 2019. DOI: https://doi.org/10.1007/s11633-018-1153-8.
F. K. Zaman, A. A. Shafie, Y. M. Mustafah. Robust face recognition against expressions and partial occlusions. International Journal of Automation and Computing, vol. 13, no. 4, pp. 319–337, 2016. DOI: https://doi.org/10.1007/s11633-016-0974-6.
J. R. Xue, J. W. Fang, P. Zhang. A survey of scene understanding by event reasoning in autonomous driving. International Journal of Automation and Computing, vol. 15, no. 3, pp. 249–266, 2018. DOI: https://doi.org/10.1007/s11633-018-1126-y.
M. Chanvichitkul, P. Kumhom, K. Chamnongthai. Face recognition based dog breed classification using coarse-to-fine concept and PCA. In Proceedings of Asia-Pacific Conference on Communications, IEEE, Bangkok, Thailand, pp. 25–29, 2007. DOI: https://doi.org/10.1109/APCC.2007.4433495.
P. Prasong, K. Chamnongthai. Face-recognition-based dog-breed classification using size and position of each local part, and PCA. In Proceedings of the 9th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology, IEEE, Phetchaburi, Thailand, 2012. DOI: https://doi.org/10.1109/ECTICon.2012.6254212.
N. Dalal, B. Triggs. Histograms of oriented gradients for human detection. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, San Diego, USA, pp. 886–893, 2005. DOI: https://doi.org/10.1109/CVPR.2005.177.
D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, vol. 60, no. 2, pp. 91–110, 2004. DOI: https://doi.org/10.1023/B:VISI.0000029664.99615.94.
O. M. Parkhi, A. Vedaldi, A. Zisserman, C. V. Jawahar. Cats and dogs. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Providence, USA, pp. 3498–3505, 0012. DOI: https://doi.org/10.1009/CVPR.2012.6248092.
J. X. Liu, A. Kanazawa, D. Jacobs, P. Belhumeur. Dog breed classification using part localization. In Proceedings of the 12th European Conference on Computer Vision, Springer, Florence, Italy, pp. 172–185, 2002. DOI: https://doi.org/10.1007/978-3-642-33718-5_13.
K. Lai, X. Y. Tu, S. Yanushkevich. Dog identification using soft biometrics and neural networks. In Proceedings of International Joint Conference on Neural Networks, IEEE, Budapest, Hungyry, pp. 1–8, 2019. DOI: 10.1109/IJCNN.2019.8851971.
X. Y. Tu, K. Lai, S. Yanushkevich. Transfer learning on convolutional neural networks for dog identification. In Proceedings of the 9th IEEE International Conference on Software Engineering and Service Science, IEEE, Beijing, China, pp. 357–360, 2018. DOI: https://doi.org/10.1109/ICSESS.2018.8663718.
B. Zhao, J. S. Feng, X. Wu, S. C Yan. A survey on deep learning-based fine-grained object classification and semantic segmentation. International Journal of Automation and Computing, vol. 14, no. 2, pp. 119–135, 2017. DOI: https://doi.org/10.1007/s11633-017-1053-3.
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. A. Ma, Z. H. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, F. F. Li. ImageNet large scale visual recognition challenge. International Journal of Computer Vision, vol. 115, no. 3, pp. 211–252, 2085. DOL https://doi.org/10.1007/s11263-015-0816-y.
C. Szegedy, V. Vanhoucke, S Ioffe, J. Shlens, Z. Wojna. Rethinking the inception architecture for computer vision. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, aas Vegas, USA, pp. 2818–2826, 2016. DOI: https://doi.org/10.1109/CVPR.2016.308.
C. Szegedy, W. Liu, Y. Q. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich. Going deeper with convolutions. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Boston, MA, USA, pp. 1–9, 2015. DOI: https://doi.org/10.1109/CVPR.2015.7298594.
M. Sandler, A. Howard, M. L. Zhu, A. Zhmoginov, L. C Chen. MobHenetv2: inverted residuals and linear bottle-necks. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Salt Lake City, USA, pp. 4500–4520, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00474.
B. Zoph, V. Vasudevan, J. SMens, Q. V. Le. Learnmg transferable architectures for scalable image recognition. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Satt Lake City, USA, pp.697–87100, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00907.
J. Yosinski, J. Clune, Y. Bengio, H. Lipson. How transferable are features in deep neural networks? In Proceedings of the 27th International Conference on Neural Information Processing Systems, MIT Press, Montreal, Canada, pp. 3320–3328, 2014.
L. Shao, F. Zhu, X. L. Li. Transfer learning for visual categorization: a survey. IEEE Transactions on Neural Networks and Learning Systems, vol. 26, no. 5, pp. 1019–1034, 2015. DOI: https://doi.org/10.1109/TNNLS.2014.2330900.
J. Deng, W. Dong, R. Socher, L. J. Li, K. Li, F. F. Li. ImageNet: a large-scale hierarchical image database. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Miami, USA, pp. 248–255, 2009. DOI: https://doi.org/10.1109/CVPR.2009.5206848.
T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, C. L. Zitnick. Microsoft coco: common objects in context. In Proceedings of the 13th European Conference on Computer Vision, Springer, Zurich, Switzerland, pp. 740–755, 2014. DOI: https://doi.org/10.1007/9783-319-10602-1_48.
N. Srivastava, G. Hinton, A. Krizhevsky, L Sutskever, R. Salakhutdinov. Dropout: a simple way to prevent neural networks from oveffittmg. Journal of Machine Learning Research, vol. 15, pp. 1929–1958, 2014.
K. Weiss, T. M. Khoshgoftaar, D. D. Wang. A survey of transfer learning. Journal of Big Data, vol. 3, no. 1, Article number 9, 2016. DOI: https://doi.org/10.1186/s40537-016-0043-6.
A. R. Zamir, A. Sae, W. Shen, L. Guibas, J. Malik, S. Savarese. Taskonomy: disentangling task transfer learning. In Proceedings of IEEE/ CVF Conference on Computer Vision and Pattern Recognition, IEEE, Salt Lake City, USA, pp. 3712–3722, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00391.
S. Iffee, C. Szegedy. Batch normalization: accelerating deep network training by reducing internal covariate shift. In Proceedings of the 32nd International Conference on Machine Learning, Lille, France, pp. 448–456, 2015.
C. Shorten, T. M. Khoshgoftaar. A survey on image data augmentation for deep learning. Journal of Big Data, vol. 6, no. 1, Article number 60, 2019.
L. Perez, J. Wang. The effectiveness of data augmentation in image classification using deep learning. [online], Available: https://arxiv.orglabs/1712.04621, 2017.
I. J. Goodfellow, J. Pouget-Abadie, Ml. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, Y. Bengio. Generative adversarial nets. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, pp. 2672–2680, 2014.
A. Khosla, N. Jayadevaprakash, B. P. Yao, F. F. Li. Novel dataset for fine-grained image categorization: Stafford dogs. In Proceedings of the 1st Workshop on Fine-Grained Visual Categorization, IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Colorado Springs, USA, 2011.
Acknowledgments
This research was supported by the Royal Golden Jubilee (RGJ) Ph.D. Programme under the Thailand Research Fund (No. PHD/0053/2 561)
Author information
Authors and Affiliations
Corresponding author
Additional information
Punyanuch Borwarnginn received the B. Sc. degree in information and communication technology from Mahidol University, Thailand in 2009, and the M. Sc. degree in informatics from the University of Edinburgh, UK in 2011. She is currently a Ph. D. degree candidate in computer science from Faculty of Information and Communication Technology, Mahidol University, Thailand.
Her research interests include image processing, biometrics, computer vision, pattern recognition and machine learning.
Worapan Kusakunniran received tee B.Eng. degree in computer engineering from the University of New South Wales (UNSW), Australia in 2008, and the Ph. D. degree in computer science and engineering from UNSW, in cooperation with the Neville Roach Laboratory, National ICT Australia, Autraaha in 2013. He is currently a lecturer with Faculty of Information and Communication Technology, Mahidol University, Thailand. He is the author of several papers in top international conferences and journals. Dr. Kusakunniran served as a program committee member for many international conferences and workshops. Also, he has served as a reviewer for several international coherences and journals, such as International Conference on Pattern Recognition, IEEE International Conference on Image Processing, IEEE International Conference on Advanced Video and Signal Based Surveillance, Pattern Recognition, IEEE Transactions on Image Processing, IEEE Transactions on Information Forensics and Security, and IEEE Signal Processing Letters.
His research interests include biometrics, pattern recognition, medical image processing, computer vision, multimedia, and machine learning.
Sarattha Karnjanapreechakorn received the B. Sc. degree in electrical-mechanical manufacturing engineering from Kasertsart University, Thailand in 2015, and the M. Sc. degree in game technology and gamification from Mahidol University, Thailand in 2017. He is currently a Ph. D. degree candidate in computer science of Faculty of Information and Communication Technology, Mahidol University, Thailand.
His research interests include image processing, biometrics, computer vision, pattern recognition and machine learning.
Kittikhun Thongkanchorn received the B. Sc. degree in information and communication technology from University of Mahidol, Thailand in 2007, and the M. Sc degree in computer science from Faculty of ICT, Mahidol University, Thailand in 2012. He is currently a computer scientist, senior professional level with Faculty of ICT, Mahidol University, Thailand. His research interests include computer system and network, elastic computing and distributed system, computer security and policy, image processing and machine learning.
Rights and permissions
About this article
Cite this article
Borwarnginn, P., Kusakunniran, W., Karnjanapreechakorn, S. et al. Knowing Your Dog Breed: Identifying a Dog Breed with Deep Learning. Int. J. Autom. Comput. 18, 45–54 (2021). https://doi.org/10.1007/s11633-020-1261-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11633-020-1261-0