Abstract
This paper focuses on the face detection problem of three popular animal categories that need control such as horses, cats and dogs. Existing detectors are generally based on Convolutional Neural Networks (CNNs) as backbones. CNNs are strong and fascinating classification tools but present some weak points such as the big number of layers and parameters, require a huge dataset and ignore the relationship between image parts. To be precise, to deal with these problems, this paper contributes to present a new Convolutional Neural Network for Animal Face Detection (CNNAFD), a new backbone CNNAFD-MobileNetV2 for animal face detection and a new Tunisian Horse Detection Database (THDD). CNNAFD used a processed filters based on gradient features and applied with a new way. A new sparse convolutional layer ANOFS-Conv is proposed through a sparse feature selection method known as Automated Negotiation-based Online Feature Selection (ANOFS). The ANOFS method is used as a training optimizer for the new ANOFS-Conv layer. CNNAFD ends by stacked fully connected layers which represent a strong classifier. The fusion of CNNAFD and MobileNetV2 constructs the new network CNNAFD-MobileNetV2 which improves the classification results and gives better detection decisions. The proposed detector with the new CNNAFD-MobileNetV2 network provides effective results and proves to be competitive with the detectors of the related works with an Average Precision equal to 98.28%, 99.78%, 99.00% and 92.86% on the THDD, Cat Database, Stanford Dogs Dataset and Oxford-IIIT Pet Dataset respectively.
Similar content being viewed by others
References
BenSaid F, Alimi AM (2016) A new online feature selection method for decision making problems with ultra-high dimension and massive training data. Journal of Information Assurance and Security, pp 293–301
BenSaid F, Alimi AsM (2016) Ofsf-bc: Online feature selection framework for binary classification. In: International conference of computer systems and applications, pp 1–8
BenSaid F, Alimi AM (2015) Anofs: Automated negotiation based online feature selection method. In: International conference on intelligent systems design and applications, pp 225–230
BenSaid F, Alimi AM (2016) Multi agent-learner based online feature selection system. In: IEEE international conference on systems, man, and cybernetics (SMC), pp 003652–003657
BenSaid F, Alimi AM (2021) Online feature selection system for big data classification based on multi-objective automated negotiation. Pattern Recogn 110:107–629
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE computer society conference on computer vision and pattern recognition (CVPR), pp 886–893
Duan J, Liao S, Zhou S, ZLi S (2016) Face classification: A specialized benchmark study. In: Chinese conference on biometric recognition, pp 22–29
Everingham M, Gool LV, Williams CKI, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis (IJCV) 88:303–338
Howse J, Alekhin A (2020) Haarcascade frontal cat face. https://github.com/opencv/opencv/blob/master/data/haarcascades/haarcascade_frontalcatface.xml. Accessed 13 April 2020
Howse J, Alekhin A (2020) Haarcascade frontal cat face extended. OpenCV, https://github.com/opencv/opencv/blob/master/data/haarcascades/haarcascade_frontalcatface_extended.xml. Accessed 13 April 2020
Huang G, Liu Z, Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2261–2269
Jarraya I, Ouarda W, Alimi AM (2015) A preliminary investigation on horses recognition using facial texture features. In: IEEE International Conference on Systems, Man, and Cybernetics, pp 2803–2808
Jarraya I, Ouarda W, Alimi AM (2017) Deep neural network features for horses identity recognition using multiview horses? face pattern. In: International conference on machine vision (ICMV), vol 10341, pp 52–56
Jocher G, Stoken A, Borovec J et al (2020) ultralytics/yolov5: v3.0. https://zenodo.org/record/3983579#.YKUgT6HjLIU. Accessed 13 August 2020
Joshi S, Verma DK, Saxena G, Paraye A (2019) Issues in training a convolutional neural network model for image classification. In: International conference on advances in computing and data sciences, pp 282–293
Khosla A, Jayadevaprakash N, Yao B, Li F-F (2011) Novel dataset for fine-grained image categorization : Stanford dogs. In: First workshop on fine-grained visual categorization, CVPR
Kishore T, Jha A, Kumar S, Bhattacharya S, Sultana M (2021) Deep cnn based automatic detection and identification of bengal tigers. In: Computational intelligence in communications and business analytics, pp 503–507
Korte T (2014) 10,000 cat pictures (for science). Center For Data Innovation. https://datainnovation.org/2014/08/10000-cat-pictures-for-science/. Accessed 21 August 2014
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: Single shot multibox detector. European Conference on Computer Vision 9905:21–37
Liu W, Hasan I, Liao S (2020) Center and scale prediction: A box-free approach for pedestrian and face detection. In: arXiv:1904.02948v3
Manoj S, Rakshith S, Kanchana V (2021) Identification of cattle breed using the convolutional neural network. In: International conference on signal processing and communication. https://doi.org/10.1109/ICSPC51351.2021.9451706, pp 503–507
Mukai N, Zhang Y, Chang Y (2018) Pet face detection. In: Nicograph International (NicoInt). https://doi.org/10.1109/NICOINT.2018.00018, pp 52–57
Nelson J (2020) Training a tensorflow mobilenet object detection model with a custom dataset. Roboflow. https://blog.roboflow.com/training-a-tensorflow-object-detection-model-with-a-custom-dataset/. Accessed 09 February 2020
Ouarda W, Trichili H, Alimi AM, Solaiman B (2013) Combined local features selection for face recognition based on na?ve bayesian classification. In: International conference on hybrid intelligent systems, pp 240–245
Ouarda W, Trichili H, Alimi AM, Solaiman B (2014) Face recognition based on geometric features using support vector machines. In: International conference of soft computing and pattern recognition, pp 89–95
Ouarda W, Trichili H, Alimi AM, Solaiman B (2014) Mlp neural network for face recognition based on gabor features and dimensionality reduction techniques. In: International conference on multimedia computing and systems. https://doi.org/10.1109/ICMCS.2014.6911265, pp 127–134
Ouarda W, Trichili H, Alimi AM, Solaiman B (2015) Bag of face recognition systems based on holistic approaches. In: International conference on intelligent systems design and applications (ISDA), pp 201–206
Ouarda W, Trichili H, Alimi A M, Solaiman B (2016) Towards a novel biometric system for smart riding club. Journal of information assurance and security (JIAS) 11:201–213
Parkhi OM, Vedaldi A, Zisserman A, Jawahar CV (2012) Cats and dogs. In: IEEE conference on computer vision and pattern recognition
Paul V, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57:137?154. https://doi.org/10.1023/B:VISI.0000013087.49260.fb
Rathor S, Kumari S, Singh R, Gupta P (2021) Two layers machine learning architecture for animal classification using hog and lbp. In: Proceedings of international conference on communication and artificial intelligence, pp 445–453
Redmon J, Farhadi A (2016) Yolo9000: Better, faster, stronger. arXiv:1612.08242
Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. arXiv:1804.02767v1
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, vol 28, pp 91–99
Sabour S, Frosst N, Hinton G (2017) Dynamic routing between capsules. In: Proceedings of the 31st international conference on neural information processing systems, p 3859?3869
Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L (2018) Mobilenetv2: Inverted residuals and linear bottlenecks. In: IEEE conference on computer vision and pattern recognition, pp 4510–4520
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2818–2826
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2818–2826
Taunk P, Jayasri G, Priya J P, Kumar NS (2020) Face detection using viola jones with haar cascade. Test Engineering and Management, 83
Tureckova A, Holik T, Oplatkova ZK (2020) Dog face detection using yolo network. MENDEL Soft Computing Journal 26:2571–3701
Vlachynska A, Oplatkova ZK, Turecek T (2019) Dog face detection and localization of dogface’s landmarks. Advances in Intelligent Systems and Computing 764:465–476
Wang C-Y, Liao H-YM, Yeh I-H, Wu Y-H, Chen P-Y, Hsieh J-W (2019) Cspnet: A new backbone that can enhance learning capability of cnn. In: ArXiv: Comput Vis Pattern Recognit
Wu Y, Kirillov A, Massa F, Lo W-Y, Girshick R (2019) Detectron2. https://github.com/facebookresearch/detectron2
Xie S, Girshick R, Dollár P, Tu Z, He K (2016) Aggregated residual transformations for deep neural networks. arXiv:1611.05431
Yamada A, Kojima K, Kiyama J, Okamoto M, Murata H (2011) Directional edge-based dog and cat face detection method for digital camera. In: IEEE international conference on consumer electronics (ICCE), pp 87–88
Zagoruyko S, Komodakis N (2017) Wide residual networks. arXiv:1605.07146v4
Zhang B, Li J, Wang Y, Tai Y, Wang C, Li J, Huang F, Xia Y, Pei W, Ji R (2020) Asfd: Automatic and scalable face detector. arXiv:2003.11228
Zhang C, Zhang Z (2010) A survey of recent advances in face detection. Microsoft, https://www.microsoft.com/en-us/research/publication/a-survey-of-recent-advances-in-face-detection/. Accessed June 2010
Zhang W, Sun J, Tang X (2011) From tiger to panda: Animal head detection. IEEE Trans Image Process 20:1696–1708
Acknowledgment
The authors would like to acknowledge that the THDD was created with the help of four Riding Clubs in Sfax, Tunisia: Equestrian Clubs of road Mahdia, road Tunis in Sakiet Ezzit, road Ain Km 17, and road Saltnia Km 17.
Funding
The research leading to these results has received funding from the Tunisian Ministry of Higher Education and Scientific Research under the grant agreement number LR11ES48.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Financial interests
The authors declare they have no financial interests.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Jarraya, I., BenSaid, F., Ouarda, W. et al. A new convolutional neural network based on a sparse convolutional layer for animal face detection. Multimed Tools Appl 82, 91–124 (2023). https://doi.org/10.1007/s11042-022-12610-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12610-y