Joint deep separable convolution network and border regression reinforcement for object detection

Quan, Yu; Li, Zhixin; Chen, Shengjia; Zhang, Canlong; Ma, Huifang

doi:10.1007/s00521-020-05255-1

Joint deep separable convolution network and border regression reinforcement for object detection

Original Article
Published: 05 August 2020

Volume 33, pages 4299–4314, (2021)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Yu Quan¹,
Zhixin Li ORCID: orcid.org/0000-0002-5313-6134¹,
Shengjia Chen¹,
Canlong Zhang¹ &
…
Huifang Ma²

431 Accesses
9 Citations
Explore all metrics

Abstract

The improvement of object detection performance mainly depends on the extraction of local information near the target area of interest, which is also the main reason for the lack of feature semantic information. Considering the importance of scene and semantic information for visual recognition, in this paper, the improvement of the object detection algorithm is realized from three parts. Firstly, the basic residual convolution module is fused with the separable convolution module to construct a depth-wise separable convolution network (D_SCNet-127 R-CNN). Then, the feature map is sent to the scene-level region proposal self-attention network to re-identify the candidate area. This part is composed of three parallel branches: semantic segmentation module, region proposal network, and region proposal self-attention module. Finally, this paper uses deep reinforcement learning combined with a border regression network to achieve precise location of the object, and improve the calculation speed of the entire model through a light-weight head network. This model can effectively solve the limitation of feature extraction in traditional object detection and obtain more comprehensive detailed features. The experimental on MSCOCO17, Pascal VOC07, and Cityscapes datasets shows that the proposed method has good validity and scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

LKC-Net: large kernel convolution object detection network

Article Open access 12 June 2023

Rich Features and Precise Localization with Region Proposal Network for Object Detection

Depthwise grouped convolution for object detection

Article 13 September 2021

References

Caicedo JC, Lazebnik S (2015) Active object localization with deep reinforcement learning. In: Proceedings of the IEEE international conference on computer vision, pp 2488–2496
Dai J, Li Y, He K, Sun J (2016) R-fcn: object detection via region-based fully convolutional networks. In: Advances in neural information processing systems, pp 379–387
Deng Z, Li K, Zhao Q, Zhang Y, Chen H (2017) Effective face landmark localization via single deep network. arXiv:1702.02719
Fan H, Ling H (2019) Siamese cascaded region proposal networks for real-time visual tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7952–7961
Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3146–3154
Ghiasi G, Lin T-Y, Le QV (2019) Nas-fpn: learning scalable feature pyramid architecture for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7036–7045
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Guo P, Xie G, Li R (2019) Object detection using multiview cca-based graph spectral learning. J Circuits Syst Comput (4) 29:2050022
Article Google Scholar
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969
Janai J, Güney F, Behl A, Geiger A (2017) Computer vision for autonomous vehicles: problems, datasets and state-of-the-art. arXiv:1704.05519
Jiang H, Cheng MM, Li SJ, Borji A, Wang J (2019) Joint salient object detection and existence prediction. Front Comput Sci 13(4):778–788
Article Google Scholar
Kirillov A, Girshick R, He K, Dollár P (2019) Panoptic feature pyramid networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 6399–6408
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Lin K, Yang H-F, Hsiao J-H, Chen C-S (2015) Deep learning of binary hash codes for fast image retrieval. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 27–35
Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: single shot multibox detector. In: European conference on computer vision, pp 21–37. Springer
Liu Y, Wang R, Shan S, Chen X (2018) Structure inference net: object detection using scene-level context and instance-level relationships. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6985–6994
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Mathe S, Pirinen A, Sminchisescu C (2016) Reinforcement learning for visual object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2894–2902
Purkait P, Zhao C, Zach C (2017) Spp-net: deep absolute pose regression with synthetic views. arXiv:1712.03452
Quan Y, Li Z, Zhang F, Zhang C (2019) D\_dnet-65 r-cnn: object detection model fusing deep dilated convolutions and light-weight networks. In: Pacific rim international conference on artificial intelligence. Springer, pp 16–28
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905
Article Google Scholar
Steder B, Rusu RB, Konolige K, Burgard W (2011) Point feature extraction on 3d range scans taking into account object boundaries. In: 2011 IEEE international conference on robotics and automation. IEEE, pp 2601–2608
Wang J, Chen K, Yang S, Loy CC, Lin D (2019) Region proposal by guided anchoring. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2965–2974
Wang X, Han TX, Yan S (2009) An hog-lbp human detector with partial occlusion handling. In: 2009 IEEE 12th international conference on computer vision. IEEE, pp 32–39
Ye Y, Zhang C, Hao X (2019) Arpnet: attention region proposal network for 3d object detection. Sci China Inf Sci 62(12):220104
Article Google Scholar
Zhang H, Li D, Ji Y, Zhou H, Wu W, Liu K (2019) Towards new retail: a benchmark dataset for smart unmanned vending machines. IEEE Trans Ind Inform 15:1–10
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (Nos. 61966004, 61663004, 61866004, 61762078), the Guangxi Natural Science Foundation (Nos. 2019GXNSFDA245018, 2018GXNSFDA281009, 2017GXNSFAA198365), the Guangxi “Bagui Scholar” Teams for Innovation and Research Project, the Guangxi Talent Highland Project of Big Data Intelligence and Application, Guangxi Collaborative Innovation Center of Multi-source Information Integration and Intelligent Processing.

Author information

Authors and Affiliations

Guangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, 541004, China
Yu Quan, Zhixin Li, Shengjia Chen & Canlong Zhang
College of Computer Science and Engineering, Northwest Normal University, Lanzhou, 730070, China
Huifang Ma

Authors

Yu Quan
View author publications
You can also search for this author in PubMed Google Scholar
Zhixin Li
View author publications
You can also search for this author in PubMed Google Scholar
Shengjia Chen
View author publications
You can also search for this author in PubMed Google Scholar
Canlong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Huifang Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhixin Li.

Ethics declarations

Conflict of interest

We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work. There is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of, the manuscript entitled.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Quan, Y., Li, Z., Chen, S. et al. Joint deep separable convolution network and border regression reinforcement for object detection. Neural Comput & Applic 33, 4299–4314 (2021). https://doi.org/10.1007/s00521-020-05255-1

Download citation

Received: 13 January 2020
Accepted: 27 July 2020
Published: 05 August 2020
Issue Date: May 2021
DOI: https://doi.org/10.1007/s00521-020-05255-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Joint deep separable convolution network and border regression reinforcement for object detection

Abstract

Access this article

Similar content being viewed by others

LKC-Net: large kernel convolution object detection network

Rich Features and Precise Localization with Region Proposal Network for Object Detection

Depthwise grouped convolution for object detection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Joint deep separable convolution network and border regression reinforcement for object detection

Abstract

Access this article

Similar content being viewed by others

LKC-Net: large kernel convolution object detection network

Rich Features and Precise Localization with Region Proposal Network for Object Detection

Depthwise grouped convolution for object detection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation