Skip to main content

Advertisement

Log in

Visual detection and tracking algorithms for human motion

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In dense scenes, a large number of individuals can introduce serious complications for motion detection, such as blurred vision, chaotic scenes, and complex behaviours. For low-density pedestrian detection and tracking algorithms, the accuracy is greatly reduced for both detection and tracking. High-density detection or tracking fails too when these problems are encountered in high-density scenes. In light of the above problems, a detection algorithm and a tracking algorithm based on the human head and shoulder model are proposed. A support vector machine is used to train the classifier by machine learning. The detection algorithm proposed in this paper achieves a detection accuracy of 94% by using the MIT and INRIA datasets. The average accuracy of pedestrian tracking in high-density scenes is approximately 95%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data availability

The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to their containing information that could compromise the privacy of research participants.

References

  1. Bilal M, Hanif MS (2020) Benchmark revision for HOG-SVM pedestrian detector through reinvigorated training and evaluation methodologies. IEEE Trans Intell Transp Syst 21(3):1277–1287

    Article  Google Scholar 

  2. Chen K, Song X, Zhai X, Zhang B, Hou B, Wang Yi (2019) An integrated deep learning framework for occluded pedestrian tracking[J]. IEEE Access 7:26060–26072

    Article  Google Scholar 

  3. Chu Y, Fei J, Hou S (2020) Adaptive global sliding-mode control for dynamic systems using double hidden layer recurrent neural network structure[J]. IEEE Transactions on Neural Networks and Learning Systems 31(4):1297–1309

    Article  MathSciNet  Google Scholar 

  4. Hasan I, Setti F, Tsesmelis T, Del Bue A, Galasso F, Cristani M (2018) MX-LSTM: mixing tracklets and vislets to jointly forecast trajectories and head poses[C]. IEEE/CVF conference on computer vision and pattern recognition, pp 6067–6076

  5. Hasan I, Setti F, Tsesmelis T, Belagiannis V, Amin S, Del Bue A, Cristani M, Galasso F (2021) Forecasting people trajectories and head poses by jointly reasoning on tracklets and vislets[J]. IEEE Trans Pattern Anal Mach Intell 43(4):1267–1278

    Article  Google Scholar 

  6. INRIA Person Dataset[DB/OL]. http://pascal.inrialpes.fr/data/human. 2022-04-11

  7. Kim JU, Park S, Ro YM (2022) Uncertainty-guided cross-modal learning for robust multispectral pedestrian detection[J]. IEEE Trans Circuits Syst Video Technol 32(3):1510–1523

    Article  Google Scholar 

  8. Lee HS, Cho SI (2022) Luminance level of histogram-based scene-change detection for frame rate up-conversion[J]. IEEE Access 10:15968–15977

    Article  Google Scholar 

  9. Leng J, Liu Y, Dawei Du, Zhang T, Quan P (2020) Robust obstacle detection and recognition for driver assistance systems[J]. IEEE Trans Intell Transp Syst 21(4):1560–1571

    Article  Google Scholar 

  10. Li T, Chang H, Wang M et al (2015) Crowded scene analysis: a survey[J]. IEEE Trans Circuits Syst Video Technol 25(3):367–386

    Article  Google Scholar 

  11. Li T, Ma Y, Shen H, Endoh T (2020) FPGA implementation of real-time pedestrian detection using normalization-based validation of adaptive features clustering[J]. IEEE Trans Veh Technol 69(9):9330–9341

    Article  Google Scholar 

  12. Li S, Jing Wu, Long C, Lin Y-B (2021) A full-process optimization-based background subtraction for moving object detection on general-purpose embedded devices[J]. IEEE Trans Consum Electron 67(2):129–140

    Article  Google Scholar 

  13. MIT Pedestrian Data[DB/OL]. http://cbcl.mit.edu/software-datasets/PedestrianData.html. 2022-04-1

  14. Qin Z, Shelton CR (2016) Social grouping for multi-target tracking and head pose estimation in video[J]. IEEE Trans Pattern Anal Mach Intell 38(10):2082–2095

    Article  Google Scholar 

  15. Radman A, Suandi SA (2018) Robust face pseudo-sketch synthesis and recognition using morphological-arithmetic operations and HOG-PCA[J]. Multimed Tools Appl 77:25311–25332

    Article  Google Scholar 

  16. Sanberg WP, Dubbelman G, de With PHN (2021) ASTEROIDS: a stixel tracking extrapolation-based relevant obstacle impact detection system [J]. IEEE Trans Intell Veh 6(1):34–46

    Article  Google Scholar 

  17. Sun Z, Chen J, Chao L, Ruan W, Mukherjee M (2021) A survey of multiple pedestrian tracking based on tracking-by-detection framework. IEEE Trans Circuits Syst Video Technol 31(5):1819–1833

    Article  Google Scholar 

  18. Sundararaman R, De Almeida Braga C, Marchand E, Pettré J (2021) Tracking pedestrian heads in dense crowd[C]. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 3864–3874

  19. Zhang G, Weizhe Xu, Zhao W, Huang C, Yk EN, Chen Y, Jian Su (2021) A Multiscale attention network for remote sensing scene images classification[J]. IEEE J Sel Top Appl Earth Obs Remote Sens 14:9530–9545

    Article  Google Scholar 

  20. Zhang J, Zhang Xi, Zhang Y, Duan Y, Li Y, Pan Z (2021) Meta-knowledge learning and domain adaptation for unseen background subtraction[J]. IEEE Trans Image Process 30:9058–9068

    Article  Google Scholar 

Download references

Acknowledgements

This research was financially supported by the Major Scientific Research Project for Universities of Guangdong Province (2020ZDZX3058); Guangdong Provincial Special Funds Project for Discipline Construction (No. 2013 WYXM0122); Science and Technology Projects of Zhuhai in the Field of Social Development (2220004000066); Key Laboratory of Intelligent Multimedia Technology (201762005)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ge Yang.

Ethics declarations

Conflict of interest

We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work, and there is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of, the manuscript entitled.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, G., Chen, S. Visual detection and tracking algorithms for human motion. Multimed Tools Appl 82, 47165–47188 (2023). https://doi.org/10.1007/s11042-023-15231-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-15231-1

Keyword

Navigation