skip to main content
10.1145/3581807.3581821acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccprConference Proceedingsconference-collections
research-article

Object Detection Algorithm Based on Coordinate Attention and Context Feature Enhancement

Published: 22 May 2023 Publication History

Abstract

In recent years, object detection has been widely used in various fields such as face detection, remote sensing image detection and pedestrian detection. Due to the complex environment in the actual scene, we need to fully obtain the feature information in the image to improve the accuracy of object detection. This paper proposes an object detection algorithm based on coordinate attention and contextual feature enhancement. We design a multi-scale attention feature pyramid network, which first uses multi-branch atrous convolution to capture multi-scale context information, and then fuses the coordinate attention mechanism to embed location information into channel attention, and finally uses a bidirectional feature pyramid structure to effectively fuse high-level features and low-level features. We also adopt the GIoU loss function to further improve the accuracy of object detection. The experimental results show that the proposed method has certain advantages compared with other detection algorithms in the PASCAL VOC datasets.

References

[1]
LeCun, Y., Bengio, Y., & Hinton, G. 2015. Deep learning. nature, 521(7553), 436-444.
[2]
Zou, Z., Shi, Z., Guo, Y., and Ye, J. 2019. Object detection in 20 years: A survey. arXiv preprint arXiv:1905.05055.
[3]
Agarwal, S., Terrail, J. O. D., and Jurie, F. 2018. Recent advances in object detection in the age of deep convolutional neural networks. arXiv preprint arXiv:1809.03193.
[4]
Girshick, R., Donahue, J., Darrell, T., and Malik, J. 2014. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580-587.
[5]
He, K., Zhang, X., Ren, S., and Sun, J. 2015. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence, 37(9), 1904-1916.
[6]
Girshick, R. 2015. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pp. 1440-1448.
[7]
Ren, S., He, K., Girshick, R., and Sun, J. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28.
[8]
Lin, T. Y., Dollár, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. 2017. Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2117-2125.
[9]
He, K., Gkioxari, G., Dollár, P., and Girshick, R. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision, pp. 2961-2969.
[10]
Cai, Z., and Vasconcelos, N. 2018. Cascade r-cnn: Delving into high quality object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 6154-6162.
[11]
Pang, J., Chen, K., Shi, J., Feng, H., Ouyang, W., and Lin, D. 2019. Libra r-cnn: Towards balanced learning for object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 821-830.
[12]
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779-788.
[13]
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., and Berg, A. C. 2016, October. Ssd: Single shot multibox detector. In European conference on computer vision, pp. 21-37. Springer, Cham.
[14]
Redmon, J., and Farhadi, A. 2017. YOLO9000: better, faster, stronger. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7263-7271.
[15]
Redmon, J., and Farhadi, A. 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.
[16]
Lin, T. Y., Goyal, P., Girshick, R., He, K., and Dollár, P. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pp. 2980-2988.
[17]
Liu, S., and Huang, D. 2018. Receptive field block net for accurate and fast object detection. In Proceedings of the European conference on computer vision (ECCV), pp. 385-400.
[18]
Zhao, Q., Sheng, T., Wang, Y., Tang, Z., Chen, Y., Cai, L., and Ling, H. 2019, July. M2det: A single-shot object detector based on multi-level feature pyramid network. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33, No. 01, pp. 9259-9266.
[19]
Tian, Z., Shen, C., Chen, H., and He, T. 2019. Fcos: Fully convolutional one-stage object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 9627-9636.
[20]
Tan, M., Pang, R., and Le, Q. V. 2020. Efficientdet: Scalable and efficient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10781-10790.
[21]
Chen, L. C., Papandreou, G., Schroff, F., and Adam, H. 2017. Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587.
[22]
Hou, Q., Zhou, D., and Feng, J. 2021. Coordinate attention for efficient mobile network design. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 13713-13722.
[23]
Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. 2018. Path aggregation network for instance segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 8759-8768.
[24]
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., and Savarese, S. 2019. Generalized intersection over union: A metric and a loss for bounding box regression. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 658-666.

Cited By

View all
  • (2023)ECGYOLO: Mask Detection AlgorithmApplied Sciences10.3390/app1313750113:13(7501)Online publication date: 25-Jun-2023
  • (2023)Small Object Detection via a Dense Connection and Feature Enhancement Network2023 9th International Conference on Computer and Communications (ICCC)10.1109/ICCC59590.2023.10507587(1760-1765)Online publication date: 8-Dec-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCPR '22: Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition
November 2022
683 pages
ISBN:9781450397056
DOI:10.1145/3581807
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 May 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Atrous convolution
  2. Coordinate attention
  3. Feature pyramid network
  4. Object detection

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • National Natural Science Foundation of China
  • Natural Science Foundation of Guangxi
  • Development Foundation of the 54th Research Institute of China Electronics Technology Group Corporation
  • Guilin Science and Technology Development Program

Conference

ICCPR 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)2
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)ECGYOLO: Mask Detection AlgorithmApplied Sciences10.3390/app1313750113:13(7501)Online publication date: 25-Jun-2023
  • (2023)Small Object Detection via a Dense Connection and Feature Enhancement Network2023 9th International Conference on Computer and Communications (ICCC)10.1109/ICCC59590.2023.10507587(1760-1765)Online publication date: 8-Dec-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media