Skip to main content
Log in

CNA-DeepSORT algorithm for multi-target tracking

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In recent years, multi-target tracking algorithms have been developed rapidly. However, in multi-target tracking, mutual occlusion and cross between targets and sudden disappearance and reappearance of targets in videos can easily occur, which could only result in missed detection, false detection, and wrong ID switching. To address the above problems, the CenterNet attention DeepSORT algorithm (CNA-DeepSORT) proposed in this paper incorporates a CenterNet network with channel attention mechanism in the original detection part of the DeepSORT algorithm instead of Faster R-CNN, and designs a multi-scale feature extraction module with the DeepSORT algorithm in the multi-scale feature extraction module and designed a pedestrian recognition network combined with the DeepSORT algorithm. These improvements lead to a 3.7% improvement in MOTA metric, 1.6% improvement in MOTP metric, 238 fewer false ID switches, 2627 fewer FP metrics, 3943 fewer FN metrics, a decrease in run speed, and a 4 Hz reduction in frame rate compared to the original DeepSORT algorithm. improved by 3. 7, and there is some improvement in handling the occlusion problem of multi-target tracking, and the false and missed detection of targets during ID switching is reduced.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

Data availability

Not applicable for that section.

References

  1. Arulampalam MS, Maskell S, Gordon N et al (2002) A Tutorial on Particle Filters for Online Nonlinear/Non-Gaussian Bayesian Tracking[J]. IEEE Trans Signal Process 50(2):174–188

    Article  Google Scholar 

  2. Bewley A, Ge Z, Ott L, et al. Simple Online and Realtime Tracking[C]// 2016 IEEE International Conference on Image Processing (ICIP). IEEE, 2016.

  3. Bhatti UA, et al. 2022. Local Similarity-Based Spatia–Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering," in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, Art no. 5514215. https://doi.org/10.1109/TGRS.2021.3090410.

  4. Bhatti UA et al (2021) Time Series Analysis and Forecasting of Air Pollution Particulate Matter (PM2.5): An SARIMA and Factor Analysis Approach. IEEE Access 9:41019–41031. https://doi.org/10.1109/ACCESS.2021.3060744

    Article  Google Scholar 

  5. Bhatti UA, Huang M, Di Wu, Zhang Yu, Mehmood A, Han H (2019) Recommendation system using feature extraction and pattern recognition in clinical care systems. Ent Inform Syst 13(3):329–351. https://doi.org/10.1080/17517575.2018.1557256

    Article  Google Scholar 

  6. Bhatti UA, Huang M, Hao Wang Yu, Zhang AM, Di Wu (2018) Recommendation system for immunization coverage and monitoring. Hum Vaccin Immunother 14(1):165–171. https://doi.org/10.1080/21645515.2017.1379639

    Article  Google Scholar 

  7. Blackman SS (2004) Multiple hypothesis tracking for multiple target tracking[J]. IEEE Aerosp Electron Syst Mag 19(1):5–18

    Article  Google Scholar 

  8. Bossard L, Guillaumin. 2014. Gool L V . Food-101 – Mining Discriminative Components with Random Forests[J]. Springer International Publishing

  9. Choi W. (2016) Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor. IEEE Int Conf Comp Vis IEEE.

  10. Everingham M, Winn J. (2011) The pascal visual object classes challenge 2012 (voc2012) development kit[J]. Pattern Analysis, Statistical Modelling and Computational Learning, Tech. Rep. 8: 5.

  11. Fukunaga K, Hostetler L (1975) The estimation of the gradient of a density function, with applications in pattern recognition[J]. IEEE Trans Inf Theory 21(1):32–40

    Article  MathSciNet  Google Scholar 

  12. Girshick R. 2015. Fast R-CNN[J]. Computer Science

  13. Hu J, Shen L, Sun G. (2018) Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conf Comp Vis Pattern Recog. 7132–7141.

  14. Kuhn HW (2010) The Hungarian method for the assignment problem[J]. Nav Res Logist 52(1–2):7–21

    Google Scholar 

  15. Lan W. (2020) Target tracking and risk avoidance system for intelligent driving system based on 5G signal anomaly detection[J]. International Journal of Communication Systems.

  16. Leal-Taixé L, Milan A, Reid I, et al. (2015) Motchallenge 2015: Towards a benchmark for multi-target tracking[J]. arXiv preprint arXiv:1504.01942.

  17. Lee B, Erdenee E, Jin S et al (2016) Multi-Class Multi-Object Tracking using Changing Point Detection[J]. Springer, Cham

    Book  Google Scholar 

  18. Liang Z, Zhi B, Sun Y et al (2016) MARS: A Video Benchmark for Large-Scale Person Re-Identification[J]. Springer, Cham

    Google Scholar 

  19. Lin T Y, Goyal P , Girshick R, et al. (2017). Focal Loss for Dense Object Detection[J]. IEEE Trans Pattern Anal Mach Intell. (99):2999–3007.

  20. Liu W, Anguelov D, Erhan D et al (2016) SSD: Single Shot MultiBox Detector[J]. Springer, Cham

    Google Scholar 

  21. Luo W, Xing J, Zhang X , et al. Multiple Object Tracking: A Literature Review[J]. Eprint Arxiv, 2015.

  22. Milan A, Leal-Taixe L, Reid I, et al. 2016. MOT16: A Benchmark for Multi-Object Tracking[J].

  23. Redmon J, Divvala S , Girshick R , et al. (2016) You Only Look Once: Unified, Real-Time Object Detection[J]. IEEE.

  24. Redmon J, Farhadi A. (2017). YOLO9000: Better, Faster, Stronger[J]. IEEE Conference on Computer Vision & Pattern Recognition. 6517–6525.

  25. Redmon J , Farhadi A. (2018). YOLOv3: An Incremental Improvement[J]. arXiv e-prints.

  26. Ren S, He K, Girshick R et al (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149

    Article  Google Scholar 

  27. Roccetti M, Delnevo G, Casini L, et al. (2019) Is bigger always better? A controversial journey to the center of machine learning design, with uses and misuses of big data for predicting water meter failures[J]. J Big Data. 6(1).

  28. Roecker JA (1994) A class of near optimal JPDA algorithms[J]. IEEE Trans Aerosp Electron Syst 30(2):504–510

    Article  Google Scholar 

  29. Roecker JA (1994) A class of near optimal JPDA algorithms[J]. Aerospace Electr Syst IEEE Trans 30(2):504–510

    Article  Google Scholar 

  30. Rui L A , Bz A , Zhu T A , et al. (2022) An End-to-End Identity Association Network based on Geometry Refinement for Multi-Object Tracking[J]. Pattern Recognition.

  31. Sanchez-Matilla R, Poiesi F, Cavallaro A. (2016) Online Multi-target Tracking with Strong and Weak Detections. Springer Int Pub. Springer International Publishing.

  32. Simonyan K, Zisserman A. (2014) Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556.

  33. Smiatek J, Jung A, Bluhmki E. (2021) Validation Is Not Verification: Precise Terminology and Scientific Methods in Bioprocess Modeling, Trends in Biotechnology, 39(11): (1117–1119).

  34. Strippoli S. Personal Digital Assistant (PDA)[J]. 2002.

  35. Strippoli S. (2002) Personal Digital Assistant (PDA)[J].

  36. Sun S, Akhtar N, Song H, et al. 2019. Deep Affinity Network for Multiple Object Tracking[J]. IEEE Trans on Pattern Anal Mach Intell. PP(99).

  37. Xiang Y, Alahi A, Savarese S. (2015) Learning to track: Online multi-object tracking by decision making[C]. Proc IEEE Int Conf Comp Vis. 4705–4713.

  38. Yang B, Yan J, Lei Z, et al. 2014. Aggregate channel features for multi-view face detection[C]. IEEE Int Joint Conf Biomet. IEEE. 1–8.

  39. Yu F, Li W, Li Q, et al. (2016) Poi: Multiple object tracking with high performance detection and appearance feature. Eur Conf Comp Vis. Springer, Cham. 36–42.

  40. Yu F, Wang D, Shelhamer E , et al. (2017). Deep Layer Aggregation[J]. arXiv

  41. Zagoruyko S. (2016) Komodakis N . Wide Residual Networks[J].

  42. Zeiler M D, Fergus R. (2014) Visualizing and understanding convolutional networks[C]. Eur Conf Comp Vis. Springer, Cham. 818–833.

  43. Zheng Z, Zheng L, Yang Y. 2017. Pedestrian Alignment Network for Large-scale Person Re-identification[J]. IEEE Trans Circuits Syst Vid Tech. PP(99).

  44. Zhou X, Wang D, Krähenbühl P. 2019. Objects as points[J]. arXiv preprint arXiv:1904.07850.

Download references

Author information

Authors and Affiliations

Authors

Contributions

Formal analysis, Kaili Feng and Wenxiao Huo; Methodology, Kaili Feng and Wenxiao Huo; Supervision, Wenhao Xu and Tianping Li; Writing—original draft, Kaili Feng and Wenxiao Huo; Writing—review & editing, Kaili Feng and Meng Li.

Corresponding author

Correspondence to Tianping Li.

Ethics declarations

Ethical approval

This paper is an original research achievement obtained by the author, and no one draft is submitted more than once; The content of the paper does not involve state secrets; It has not been published in any form in any language at home and abroad; The content of the paper shall not infringe upon the copyright and other rights of others. In case of multiple submissions, infringement, disclosure and other problems, the author of the paper will bear full responsibility.

Consent to participate

All authors consent to participate.

Consent for publication

All authors consent for publication.

Conflicts of interest

The authors declare no conflict of interest. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Feng, K., Huo, W., Xu, W. et al. CNA-DeepSORT algorithm for multi-target tracking. Multimed Tools Appl 83, 4731–4755 (2024). https://doi.org/10.1007/s11042-023-15813-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-15813-z

Keywords

Navigation