Abstract
The research field of object detection has been a hotspot in computer vision. However, most of the one-stage lightweight object detection models based on the deep convolutional neural network have the problems of many parameters. To address this problem, this paper proposes a new model named Fusion Shuffle Light Detector (FSLDet). First, based on the FSSD mode, we apply the improved lightweight Shufflenet V2 network to the FSSD model for feature extraction, where the improvement about ShuffleNet v2 is an adjustment for the network structure. Meanwhile, we adopt the bidirectional feature pyramid model to improve the feature fusion operation, which makes the fused features have more semantic information. Experiments were carried out on PASCAL VOC 2007 + 2012 dataset and helmet detection dataset. The experiment shows that the FSLDet model is superior to the state-of-the-art model in multiple evaluation criteria.
Similar content being viewed by others
References
Chen P, Ho S (2016) Is overfeat useful for image-based surface defect classification tasks?[C]//Proceedings of the 2016 IEEE Inter-national Conference on Image Processing (ICIP), Phoenix, AZ, USA : 749-753
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part Model[c]//proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA: pp 1–8
Fu C, Zhu S, Chen H et al (2019) Simbnn: A similarity-aware binarized neural network acceleration framework[C]//Proceedings of the 2019 IEEE 27th Annual International Symposium on Field-Programmable Custom Com-puting Machines (FCCM), San Diego, CA, USA. 319-319
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA: 580–587
He K, Gkioxari G, Dollar P et al (2020) Mask r-cnn[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence 42(2):386–397
He Y, Zhang X, Sun J (2017) Channel pruning for accelerating very deep neural networks[C]//Proceedings of the 2017 IEEE In-ternational Conference on Computer Vision (ICCV), Venice Italy, 1398–1406
Krizhevsky A, Sutskever I, Hinton G (2012) Imagenet classification with deep convolutional neural networks[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems, Red Hook, NY, USA: 1097..C1105
Law H, Deng J (2018) Cornernet: Detecting objects as paired keypoint[C]//European Conference on Computer Vision; Springer: New York, NY, USA. 765-781
Liu W, Anguelov D, Erhan D et al (2016) Ssd: Single shot multibox detector European[C]//European Conference on Computer Vision. 21-37
Liu S, Huang D, Wang Y (2018) Receptive field block net for accurate and fast object detection[C]//European Conference on Computer Vision; Springer: New York, NY, USA. 404-419
Liu S, Qi L, Qin H et al (2018) Path aggregation network for instance segmentation[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 8759-8768
Lin T, Dollar P, Girshick R, et al. (2017) Feature pyramid networks for object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA. 936–944
Ma N, Zhang X, Zheng H et al (2018) Shufflenet v2: Practical guidelines for efficient cnn architecture design[C]//European Conference on Computer Vision; Springer: New York, NY, USA. 122-138
Pang J, Chen K, Shi J et al (2019) Libra r-cnn: Towards balanced learning for object detection[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA. 821-830
Redmon J, Farhadi A (2017) Yolo9000: Better, faster, stronger. Inproceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA. 6517–6525
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection[C]//Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA: 779-788
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence 39(6):1137–1149
Shen Z, Liu Z, Li J et al (2017) Dsod: Learning deeply supervised object detectors from scratch[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy. 1937-1945
Sinha D, El-Sharkawy M (2019) Thin mobilenet: An enhanced mobilenet architecture[C]//Proceedings of 2019 IEEE 10th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), New York City, NY, USA. 0280-0285
Tan M, Pang R, Le QV (2020) Efficientdet: Scalable and efficient object detection[C]//Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA. 10778-10787
Tian Z, Shen C, Chen H et al (2019) Fcos: Fully convolutional one-stage object detection[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South). 9626-9635
Viola P, AJones M (2001) Rapid object detection using a boosted cascade of simple features[C]//Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Kauai, HI, USA, Dec 8-14: I-I
Wang S, Wu L, Wu W et al (2019) Optical fiber defect detection method based on dssd network[C]//Proceedings of 2019 IEEE International Conference on Smart Internet of Things (SmartIoT), Tianjin, China. 422-426
Won J, Lee D, Lee K, Lin C (2019) An improved yolov3-based neural network for de-identification technology[C]//Proceedings of 2019 34th International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC), JeJu, Korea (South). 1-2
Yan L, Sheng M, Wang C et al (2021) Hybrid neural networks based facial expression recognition for smart city[J]. Multimedia Tools and Applications, 1 - 24
Yan L, Fu J, Wang C et al (2021) Enhanced network optimized generative adversarial network for image enhancement[J]. Multimedia Tools and Applications 80(9):14363–14381
Zhai M, Liu J, Zhang W et al (2019) Multi-scale feature fusion single shot object detector based on densenet[C]//International Conference on Intelligent Robotics and Applications; Springer: New York, NY, USA. 450-460
Zhang X, Zhou X, Lin M, Sun J (2018) Shufflenet: An extremely efficient convolutional neural network for mobile devices[C]// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 6848-6856
Zhang S, Chi C, Yao Y et al (2020) Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection[C]//Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA. 9756-9765
Acknowledgements
Acknowledgements This work is funded by the National Natural Science Foundation of China under Grant No.61772180, the Key R&D plan of Hubei Province (2020BHB004,2020BAB012) and Natural Science Foundation of Hubei Province No.2020CFB798.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, C., Wang, Z., Li, K. et al. Lightweight object detection model fused with feature pyramid. Multimed Tools Appl 82, 601–618 (2023). https://doi.org/10.1007/s11042-022-12127-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12127-4