Graph Convolution and Self Attention Based Non-maximum Suppression

Qiu, Zhe; Gu, Xiaodong

doi:10.1007/978-3-030-22796-8_9

Zhe Qiu¹⁷ &
Xiaodong Gu¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11554))

Included in the following conference series:

International Symposium on Neural Networks

2601 Accesses

Abstract

Non-maximum suppression is an integral and last part of object detection. Traditional NMS algorithm sorts the detection boxes according to their class scores. The detection boxes with maximum score are always selected while all other boxes with a sufficient overlap with the preserved boxes are discarded. This strategy is simple and effective. However, there still need some improvements in this process because the algorithm makes a ‘hard’ decision (accept or reject) for each box. In this paper, we formulate the non-maximum suppression as a rescoring process and construct a network called NmsNet which utilizes graph convolution and self attention mechanism to predict each box as an object or redundant one. We evaluate our method on the VOC2007 dataset. The experimental results show that our method achieves a higher MAP compared with the traditional greedy NMS and the Soft NMS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Residual Joint Attention Network with Graph Structure Inference for Object Detection

ANMS: attention-based non-maximum suppression

Article 17 February 2022

YOLO-SA: An Efficient Object Detection Model Based on Self-attention Mechanism

References

Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vis. 104(2), 154–171 (2013)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Google Scholar
Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Google Scholar
Bodla, N., Singh, B., Chellappa, R., Davis, L.S.: Soft-NMS—improving object detection with one line of code. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5562–5570. IEEE Press, Venice (2017)
Google Scholar
Hosang, J., Benenson, R., Schiele, B.: learning non-maximum suppression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6469–6477. IEEE Press, Honolulu (2017)
Google Scholar
Ba, J.L., Kiros, J.R., Hinton, G.E.: Layer normalization. arXiv preprint arXiv:1607.06450 (2016)
Niepert, M., Ahmed, M., Kutzkov, K.: Learning convolutional neural networks for graphs. In: Proceedings of the 33rd International Conference on Machine Learning, pp. 2014–2023. ACM, New York (2016)
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Google Scholar
Yu, A.W., et al.: QANet: combining local convolution with global self-attention for reading comprehension. In: International Conference on Learning Representations (2018)
Google Scholar

Download references

Acknowledgement

This work was supported in part by National Natural Science Foundation of China under grants 61771145 and 61371148.

Author information

Authors and Affiliations

Department of Electronic Engineering, Fudan University, Shanghai, China
Zhe Qiu & Xiaodong Gu

Authors

Zhe Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodong Gu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaodong Gu .

Editor information

Editors and Affiliations

Dalian University of Technology, Dalian, China
Huchuan Lu
Sichuan University, Chengdu, China
Huajin Tang
Northeastern University, Shenyang, China
Zhanshan Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qiu, Z., Gu, X. (2019). Graph Convolution and Self Attention Based Non-maximum Suppression. In: Lu, H., Tang, H., Wang, Z. (eds) Advances in Neural Networks – ISNN 2019. ISNN 2019. Lecture Notes in Computer Science(), vol 11554. Springer, Cham. https://doi.org/10.1007/978-3-030-22796-8_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-22796-8_9
Published: 26 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22795-1
Online ISBN: 978-3-030-22796-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics