Skip to main content
Log in

High-density pedestrian detection algorithm based on deep information fusion

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

In order to improve the accuracy of high-density population detection, a high density pedestrian detection algorithm (YOLOv4-HDPD) is proposed based on deep information fusion. By increasing the connection points of cross-layer fusion, high-level semantic information is further integrated with feature information. The improved Iterative Self-Organizing Data Analysis algorithm (ISODATA) makes the anchor value more suitable for the network model without increasing the number of parameters. Moreover, the network anti-interference ability is increased by replacing the CIOU algorithm target detection object. Compared with the original network, the YOLOv4-HDPD network has improved in mAP and avgIOU. Under the premise that the detection speed of the network is basically not affected, mAP is increased by 5.28% and avgIOU is increased by 5.73%. In terms of the current results, the network algorithm has been improved the detection effect of high-density pedestrians. At the same time, the network provides a new idea for solving the clustering and detection of dense targets in real scenes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Yu H, Li G, Zhang W, Huang Q, Du D, Tian Q, Sebe N (2020) The unmanned aerial vehicle benchmark: Object detection, tracking and baseline. Int J Comput Vis 128(5):1141–1159

    Article  Google Scholar 

  2. Liang H, Fu Y, Gao J (2021) Bio-inspired self-organized cooperative control consensus for crowded uuv swarm based on adaptive dynamic interaction topology. Appl Intell, pp 1–18

  3. Yan J, Pu W, Zhou S, Liu H, Bao Z (2020) Collaborative detection and power allocation framework for target tracking in multiple radar system. Information Fusion 55:173–183

    Article  Google Scholar 

  4. Hu Z, Wei Z, Ma X, Sun H, Yang J (2020) Multi-parameter deep-perception and many-objective autonomous-control of rolling schedule on high speed cold tandem mill. ISA transactions 102:193–207

    Article  Google Scholar 

  5. Hu Z, Wei Z, Sun H, Yang J, Wei L (2021) Optimization of metal rolling control using soft computing approaches: a review. Archives of Computational Methods in Engineering 28(2):405–421

    Article  Google Scholar 

  6. Li B, Xu W, Xu Z, Li J, Peng P (2021) A two-domain coordinated sentence similarity scheme for question-answering robots regarding unpredictable outliers and non-orthogonal categories. Appl Intell, pp 1–17

  7. Li B, Yan J, Wu W, Zhu Z, Hu X (2018) High performance visual tracking with siamese region proposal network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 8971–8980

  8. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587

  9. Lu X, Li B, Yue Y, Li Q, Yan J (2019) Grid r-cnn. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7363–7372

  10. Bochkovskiy A, Wang C-Y, Liao H-Y M (2020) Yolov4: Optimal speed and accuracy of object detection. arXiv:2004.10934

  11. Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European conference on computer vision, Springer, pp 740–755

  12. Shao S, Zhao Z, Li B, Xiao T, Yu G, Zhang X, Sun J (2018) Crowdhuman: A benchmark for detecting human in a crowd. arXiv:1805.00123

  13. Xia S, Peng D, Meng D, Zhang C, Wang G, Giem E, Wei W, Chen Z (2020) A fast adaptive k-means with no bounds. IEEE Transactions on Pattern Analysis and Machine Intelligence

  14. Ahmed M, Seraj R, Islam SMS (2020) The k-means algorithm: A comprehensive survey and performance evaluation. Electronics 9(8):1295

    Article  Google Scholar 

  15. Mydhili SK, Periyanayagi S, Baskar S, Shakeel PM, Hariharan PR (2020) Machine learning based multi scale parallel k-means++ clustering for cloud assisted internet of things. Peer-to-Peer Networking and Applications 13(6):2023–2035

    Article  Google Scholar 

  16. Choo D, Grunau C, Portmann J, Rozhon V (2020) k-means++: few more steps yield constant approximation. In: International Conference on Machine Learning, PMLR, pp 1909–1917

  17. Abbas AW, Minallh N, Ahmad N, Abid SAR, Khan MAA (2016) K-means and isodata clustering algorithms for landcover classification using remote sensing. Sindh University Research Journal-SURJ (Science Series), 48(2)

  18. Le H, Nguyen M, Yan WQ (2020) Machine learning with synthetic data–a new way to learn and classify the pictorial augmented reality markers in real-time. In: 2020 35th International Conference on Image and Vision Computing New Zealand (IVCNZ), IEEE, pp 1–6

  19. He K, Zhang X, Ren S, Sun J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence 37(9):1904–1916

    Article  Google Scholar 

  20. Liu S, Qi L, Qin H, Shi J, Jia J (2018) Path aggregation network for instance segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8759–8768

  21. Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. arXiv:1804.02767

  22. Xu Z-F, Jia R-S, Sun H-M, Liu Q-M, Cui Z (2020) Light-yolov3: fast method for detecting green mangoes in complex scenes using picking robots. Appl Intell 50(12):4670–4687

    Article  Google Scholar 

  23. Yu J, Jiang Y, Wang Z, Cao Z, Huang T (2016) Unitbox: An advanced object detection network. In: Proceedings of the 24th ACM international conference on Multimedia, pp 516–520

  24. Rahman MA, Wang Y (2016) Optimizing intersection-over-union in deep neural networks for image segmentation. In: International symposium on visual computing, Springer, pp 234–244

  25. Zheng Z, Wang P, Liu W, Li J, Ye R, Ren D (2020) Distance-iou loss: Faster and better learning for bounding box regression.. In: AAAI, pp 12993–13000

  26. Rezatofighi H, Tsoi N, Gwak J, Sadeghian A, Reid I, Savarese S (2019) Generalized intersection over union: A metric and a loss for bounding box regression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 658–666

  27. Erwin, Damayanti HR (2021) Supervised retinal vessel segmentation based average filter and iterative self organizing data analysis technique. Int J Comput Intell Appl 20(01):2150003

    Article  Google Scholar 

  28. Li L, Jiang Q, Zhang L, Ding G, Wang L, Zhang R, Zhang ZG, Li Q, Ewing JR, Kapke A et al (2006) Ischemic cerebral tissue response to subventricular zone cell transplantation measured by iterative self-organizing data analysis technique algorithm. Journal of Cerebral Blood Flow & Metabolism 26(11):1366–1377

    Article  Google Scholar 

  29. Zhang H, Hu Z, Hao R (2020) Joint information fusion and multi-scale network model for pedestrian detection. Vis Comput, pp 1–10

  30. Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271

  31. Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788

  32. Qiao T, Su H, Liu G, Wang M (2019) Object detection algorithm based on improved feature extraction network. Laser & Optoelectronics Progress 56(23):231008

    Article  Google Scholar 

  33. Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, PMLR, pp 6105–6114

  34. Zhang S, Chi C, Yao Y, Lei Z, Li SZ (2020) Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9759–9768

  35. Lee Y, Park J (2020) Centermask: Real-time anchor-free instance segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13906–13915

Download references

Acknowledgments

The authors would like to thank the editor and anonymous reviewers for their helpful comments and suggestions to improve the quality of this paper. This work was supported by National Natural Science Foundation of China (No.62003296), the Natural Science Foundation of Hebei (No.F2020203031), the College Students’ Innovative Entrepreneurial Training Plan Program (No.202010216014), Science and Technology Project of Hebei Education Department (No.QN2020225). Finally, the authors would like to thank EfficientDet [33], ATSS [34], and CenterMask [35] for their inspiration.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaofang Yang.

Ethics declarations

Conflict of Interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, H., Yang, X., Hu, Z. et al. High-density pedestrian detection algorithm based on deep information fusion. Appl Intell 52, 15483–15495 (2022). https://doi.org/10.1007/s10489-022-03354-1

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-03354-1

Keywords

Navigation