skip to main content
10.1145/3672919.3672992acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsaideConference Proceedingsconference-collections
research-article

Lightweight multi-attribute target detection for dogs and cats based on improved YOLOv7

Published: 24 July 2024 Publication History

Abstract

Multi-attribute target detection techniques can accurately obtain the location and attribute information of cats and dogs, thus providing a more effective means of pet management. However, it is difficult to quickly locate dogs and cats in complex environments and accurately identify their fine-grained attributes. To address these issues, this research introduces a streamlined and efficient method for multi-attribute target detection, which firstly uses a lightweight YOLOv7 network (YOLOv7-P) to quickly determine the location of cats and dogs, and excludes the influence of background and other redundant information on classification, and then uses a B-CNN (Bi-Linear Convolutional Neural Network, B-CNN) network to identify the fine-grained attributes of cats and dogs, and improve the cat and dog attribute classification accuracy. The YOLOv7-P network simplifies its structure by utilizing the PConv (partial convolution) module instead of the traditional 3x3 convolution block in the YOLOv7 ELAN structure. Additionally, it incorporates a hierarchical adaptive channel pruning technique that identifies and removes unimportant filters within the most redundant layers, thereby minimizing redundant computations and memory accesses, ultimately enhancing the speed of target detection. In comparison to the original YOLOv7 model, the YOLOv7-P network boasts a significant reduction in parameters by 17.02%, a decrease in GFLOPS by 26.44%, and an impressive 9.50% improvement in FPS. This optimization not only simplifies the network structure but also enhances its efficiency in target detection tasks. The target region localised by YOLOv7-P was fed into the B-CNN network for training and classification, resulting in 98.92% and 92.08% classification accuracy for cat and dog species and breeds in The Oxford-IIIT Pet Dataset dataset, respectively. In essence, the multi-attribute target detection method introduced in this study demonstrates swift and precise capabilities in detecting cats and dogs with multiple attributes in intricate environments. This approach holds considerable practical value and offers broad application prospects.

References

[1]
He K M, Gkioxari G, Dollar P and Girshick R. 2020. Mask R-CNN. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(2): 386-397.
[2]
C. -Y. Wang, A. Bochkovskiy and H. -Y. M. Liao, "YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors," 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 2023, pp. 7464-7475.
[3]
X. Xu, "Research on Multi-Labels Image Classification Based on Self-Supervised Model," 2022 International Conference on Image Processing and Computer Vision (IPCV), Okinawa, Japan, 2023, pp. 56-59.
[4]
Q. Shen, "Blur Retail Product Images for Fine-grained Image Classification Combining Text Area Features," 2023 IEEE International Conference on Sensors, Electronics and Computer Engineering (ICSECE), Jinzhou, China, 2023, pp. 595-599.
[5]
S. Li, S. Wang, Z. Dong, A. Li, L. Qi and C. Yan, "PSBCNN: Fine-grained image classification based on pyramid convolution networks and SimAM," 2022 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech), Falerna, Italy, 2022, pp. 1-4.
[6]
ZHOU Feiyan,JIN Linpeng,DONG Jun. A review of convolutional neural network research[J]. Comput. Journal, 2017, 40(6):1229-1251.
[7]
B. -S. Wang, J. -W. Hsieh, Y. -K. Hsieh and P. -Y. Chen, "COFENet: Co-Feature Neural Network Model for Fine-Grained Image Classification," 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 2022, pp. 3928-3932.
[8]
Z. Huang, J. Du and H. Zhang, "A Multi-Stage Vision Transformer for Fine-grained Image Classification," 2021 11th International Conference on Information Technology in Medicine and Education (ITME), Wuyishan, Fujian, China, 2021, pp. 191-195.
[9]
LIN T Y, CHOWDHURY R A,MAJI S.Bilinear Convolutional Neural Networks for Fine-Grained Visual Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(6):1309-1322.
[10]
J. Chen, "Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks," 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 2023, pp. 12021-12031.
[11]
Z. Wang, C. Li and X. Wang, "Convolutional Neural Network Pruning with Structural Redundancy Reduction," 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 14908-14917.

Index Terms

  1. Lightweight multi-attribute target detection for dogs and cats based on improved YOLOv7

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CSAIDE '24: Proceedings of the 2024 3rd International Conference on Cyber Security, Artificial Intelligence and Digital Economy
    March 2024
    676 pages
    ISBN:9798400718212
    DOI:10.1145/3672919
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 July 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CSAIDE 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 20
      Total Downloads
    • Downloads (Last 12 months)20
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media