NAS-WFPN: Neural Architecture Search Weighted Feature Pyramid Networks for Object Detection

Li, Xiaohan; Xie, Ziyan; Lai, Taotao; Zhao, Fusheng; Xu, Haiyin; Chen, Riqing

doi:10.1007/978-3-030-68884-4_32

Xiaohan Li¹⁴,
Ziyan Xie¹⁴,
Taotao Lai¹⁵,
Fusheng Zhao¹⁶,
Haiyin Xu¹⁷ &
…
Riqing Chen¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12383))

Included in the following conference series:

International Conference on Security, Privacy and Anonymity in Computation, Communication and Storage

1051 Accesses
1 Citations

Abstract

As we known, most of convolution neural architectures are manually designed. However, they cannot obtain the optimal structures. To address this problem, based on Weighted Feature Pyramid Networks (WFPN), in this paper, we use gaussian kernel to calculate the weight to design a novel method called the Neural Architecture Search Weighted Feature Pyramid Networks (i.e., NAS-WFPN). NAS-WFPN mainly consists of three parts (i.e., top-down pathway, bottom-up pathway and lateral connections) to fuse features across different scales. Experimental results show that NAS-WFPN achieves higher accuracy compared with the existing object detection methods. Specifically, NAS-WFPN increases accuracy by 2.3 AP compared to SSDLite with MobileNetV2 model and gets 49.1 AP, which exceeds NAS-FPN and Mask R-CNN.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Li, Z., Peng, C., Yu, G., et al.: Detnet: a backbone network for object detection. arXiv preprint arXiv:1804.06215 (2018)
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 2117–2125 (2017)
Google Scholar
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2016), pp. 770–778 (2016)
Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L., et al.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 4700–4708 (2017)
Google Scholar
He, K., Gkioxari, G., Dollár, P., et al.: Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV 2017), pp. 2961–2969 (2017)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., et al.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV 2017), pp. 2980–2988 (2017)
Google Scholar
Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Ghiasi, G., Lin, T.Y., Le, Q.V.: NAS-FPN: learning scalable feature pyramid architecture for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 7036–7045 (2019)
Google Scholar
Fu, C.Y., Liu, W., Ranga, A., et al.: DSSD: deconvolutional single shot detector. arXiv preprint arXiv:1701.06659 (2017)
Kong, T., Sun, F., Huang, W., Liu, H.: Deep feature pyramid reconfiguration for object detection. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11209, pp. 172–188. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01228-1_11
Chapter Google Scholar
Kong, T., Sun, F., Yao, A., et al.: Ron: reverse connection with objectness prior networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 5936–5944 (2017)
Google Scholar
Kim, S.-W., Kook, H.-K., Sun, J.-Y., Kang, M.-C., Ko, S.-J.: Parallel feature pyramid network for object detection. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11209, pp. 239–256. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01228-1_15
Chapter Google Scholar
Woo, S., Hwang, S., Kweon, I.S.: Stairnet: Top-down semantic aggregation for accurate one shot detection. In: IEEE Winter Conference on Applications of Computer Vision (WACV 2018), pp. 1093–1102 (2018)
Google Scholar
Kim, Y., Kang, B.-N., Kim, D.: SAN: learning relationship between convolutional features for multi-scale object detection. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11209, pp. 328–343. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01228-1_20
Chapter Google Scholar
Yu, F., Wang, D., Shelhamer, E., et al.: Deep layer aggregation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), pp. 2403–2412 (2018)
Google Scholar
Zhang, S., Wen, L., Bian, X., et al.: Single-shot refinement neural network for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), pp. 4203–4212 (2018)
Google Scholar
Zhou, P., Ni, B., Geng, C., et al.: Scale-transferrable object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), pp. 528–537 (2018)
Google Scholar
Zoph, B., Le, Q.V.: Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578, 2016.
Sandler, M., Howard, A., Zhu, M., et al.: Mobilenetv2: inverted residuals and linear bottlenecks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018), pp. 4510–4520 (2018)
Google Scholar
Real, E., Aggarwal, A., Huang, Y., et al.: Regularized evolution for image classifier architecture search. In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2019), pp. 4780–4789 (2019)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV 2015), pp. 1440–1448 (2015)
Google Scholar
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China under Grant 61972093, and Grant 61702101, and in part by the Young and Middle-aged Teachers Education and Research Project in Fujian Province under Grant JAT170477.

Author information

Authors and Affiliations

School of Computer Science, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
Xiaohan Li, Ziyan Xie & Riqing Chen
School of Computer and Control Engineering, Minjiang University, Fuzhou, 350108, China
Taotao Lai
School of Mathematics and Computer Science, Quanzhou Normal University, Quanzhou, 362000, China
Fusheng Zhao
Department of Information Engineering, Hebei Vocational and Technical College of Building Materials, Qinhuangdao, 066000, China
Haiyin Xu

Authors

Xiaohan Li
View author publications
You can also search for this author in PubMed Google Scholar
Ziyan Xie
View author publications
You can also search for this author in PubMed Google Scholar
Taotao Lai
View author publications
You can also search for this author in PubMed Google Scholar
Fusheng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Haiyin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Riqing Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Riqing Chen .

Editor information

Editors and Affiliations

Guangzhou University, Guangzhou, China
Guojun Wang
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Bing Chen
Computer Science, Georgia State University, Atlanta, GA, USA
Wei Li
College of Science and Engineering, Qatar Foundation Education City, Doha, Qatar
Roberto Di Pietro
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Xuefeng Yan
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Hao Han

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, X., Xie, Z., Lai, T., Zhao, F., Xu, H., Chen, R. (2021). NAS-WFPN: Neural Architecture Search Weighted Feature Pyramid Networks for Object Detection. In: Wang, G., Chen, B., Li, W., Di Pietro, R., Yan, X., Han, H. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2020. Lecture Notes in Computer Science(), vol 12383. Springer, Cham. https://doi.org/10.1007/978-3-030-68884-4_32

Download citation

DOI: https://doi.org/10.1007/978-3-030-68884-4_32
Published: 07 February 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68883-7
Online ISBN: 978-3-030-68884-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics