research-article

MDCYOLO: Improved YOLOv5 Algorithm with Modified Deformable Convolution

Authors:

Kun Chen,

Hongqiang WangAuthors Info & Claims

ICMLCA '23: Proceedings of the 2023 4th International Conference on Machine Learning and Computer Application

Pages 866 - 872

https://doi.org/10.1145/3650215.3650367

Published: 16 April 2024 Publication History

Get Access

Abstract

Convolution is a fundamental operation for feature extraction in deep neural networks. However, traditional convolutional operations use fixed-shaped kernels, resulting in a stable receptive field for each point on the output feature map. This stability limits their adaptability to irregularly shaped objects. The mainstream deep learning network model uses convolution as the basic method of feature extraction, which also results in the model having poor recognition effect on data sets with large differences in shape and size. Deformable Convolutional Networks (DCN) series of work calculate corresponding offsets from the perspective of shape for the points on the feature map participating in the convolution operation, thereby changing the shape of the receptive field. However, DCN does not consider adjusting the feature map weights. Therefore, we proposed a modified deformable convolution (MDC), adding a mask to adjust the weight of the feature map based on Deformable ConvNets v2 (DCNv2) to simultaneously adjust the shape and weight of the feature map participating in the convolution operation. Furthermore, we used MDC in YOLOv5 and named the improved YOLOv5 MDCYOLO. Experimental results show that the detection accuracy of the MDC is significantly higher than DCNv2, and ultimately increased by 1.9% on the Pascal VOC data set and 3.1% on the COCO data set.

References

[1]

J. Dai, H. Qi, Y. Xiong, Y. Li, G. Zhang, H. Hu, and Y. Wei, “Deformable convolutional networks,” in {Proceedings of the IEEE international conference on computer vision}, 2017, pp. 764–773.

Crossref

Google Scholar

[2]

S. Albawi, T. A. Mohammed, and S. Al-Zawi, “Understanding of a convolutional neural network,” in {2017 International Conference on Engineering and Technology (ICET)}, 2017, pp. 1–6.

Crossref

Google Scholar

[3]

Q. Wang, S. Zhang, Y. Qian, G. Zhang, and H. Wang, “Enhancing representation learning by exploiting effective receptive fields for object detection,” {Neurocomputing}, vol. 481, pp. 22–32, 2022. [Online]. Available: https://doi.org/10.1016/j.neucom.2022.01.0 20

Crossref

Google Scholar

[4]

J. Dai, H. Qi, Y. Xiong, Y. Li, G. Zhang, H. Hu, and Y. Wei, “Deformable convolutional networks,” in {Proceedings of the IEEE international conference on computer vision}, 2017, pp. 764–773.

Crossref

Google Scholar

[5]

X. Zhu, H. Hu, S. Lin, and J. Dai, “Deformable convnets v2: More deformable, better results,” in {Proceedings of the IEEE/CVF conference on computer vision and pattern recognition}, 2019, pp. 9308– 9316.

Google Scholar

[6]

S. Shetty, “Application of convolutional neural network for image classification on Pascal VOC challenge 2012 dataset,” {arXiv preprint arXiv:1607.03785}, 2016.

Google Scholar

[7]

Tsung-Yi Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick, “Microsoft coco: Common objects in context,” in {Computer Vision-ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13}, 2014, pp. 740–755

Google Scholar

[8]

A. M. Roy, R. Bose, and J. Bhaduri, “A fast accurate fine-grain object detection model based on YOLOv4 deep neural network,” {Neural Computing and Applications}, pp. 1–27, 2022.

Google Scholar

[9]

S. Wu and Y. Xu, “DSN: A new deformable subnetwork for object detection,” {IEEE Transactions on Circuits and Systems for Video Technology}, vol. 30, no. 7, pp. 2057–2066, 2019.

Google Scholar

[10]

A. G. Asuero, A. Sayago, and A. G. González, “The correlation coefficient: An overview,” {Critical reviews in analytical chemistry}, vol. 36, no. 1, pp. 41–59, 2006.

Google Scholar

[11]

Q. Xu, Z. Zhu, H. Ge, Z. Zhang, and X. Zang, “Effective face detector based on YOLOv5 and super resolution reconstruction,” {Computational and Mathematical Methods in Medicine}, vol. 2021, pp. 1–9, 2021.

Google Scholar

Index Terms

MDCYOLO: Improved YOLOv5 Algorithm with Modified Deformable Convolution
1. Computing methodologies
  1. Machine learning

Recommendations

Fast modified Self-organizing Deformable Model: Geometrical feature-preserving mapping of organ models onto target surfaces with various shapes and topologies
Highlights
- We propose a novel method for mapping a human tissue model onto a simple surface.
Graphical abstract

Display Omitted

Abstract Background and Objective
This paper proposes a new method for mapping surface models of human organs onto target surfaces with the same genus as the organs.
Methods
In the proposed method, ...
Convolution surfaces
SIGGRAPH '91: Proceedings of the 18th annual conference on Computer graphics and interactive techniques

Smoothly blended articulated models are often difficult to construct using current techniques. Our solution in this paper is to extend the surfaces introduced by Blinn [Blinn 1982] by using three-dimensional convolution with skeletons composed of ...
Deep neural network with deformable convolution and side window convolution for image denoising
Highlights
- A convolutional network module with irregular convolutions is provided for image denoising.
Abstract
A noval neural networks with irregular convolution block is proposed for image denoising. In the field of image processing, convolutional neural networks have shown great advantages compared with traditional approaches, however, it is ...

Comments

Information & Contributors

Information

Published In

ICMLCA '23: Proceedings of the 2023 4th International Conference on Machine Learning and Computer Application

October 2023

1065 pages

ISBN:9798400709449

DOI:10.1145/3650215

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 April 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

National Natural Science Foundation of China

Conference

ICMLCA 2023

ICMLCA 2023: 2023 4th International Conference on Machine Learning and Computer Application

October 27 - 29, 2023

Hangzhou, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
13
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)3

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Index Terms

Recommendations

Fast modified Self-organizing Deformable Model: Geometrical feature-preserving mapping of organ models onto target surfaces with various shapes and topologies

Convolution surfaces

Deep neural network with deformable convolution and side window convolution for image denoising

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Funding Sources

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Share

Share this Publication link

Share on social media

Affiliations