Skip to main content

Cascaded-Scoring Tracklet Matching for Multi-object Tracking

  • Conference paper
  • First Online:
Pattern Recognition and Computer Vision (PRCV 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14434))

Included in the following conference series:

  • 402 Accesses

Abstract

Multi-object tracking (MOT) aims at locating the object of interest in a successive video sequence and associating the same moving object frame by frame. Most existing approaches to MOT lack the integration of both motion and appearance information, which limits the effectiveness of tracklet association. The conventional approaches for tracklet association often struggle when dealing with scenarios involving multiple objects with indistinguishable appearances and irregular motions, leading to suboptimal performance. In this paper, we introduce an appearance-assisted feature warper (AFW) module and a motion-guided based target aware (MTA) module to efficiently utilize the appearance and motion information. Additionally, we introduce a cascaded-scoring tracklet matching (CSTM) strategy that seamlessly integrates the two modules, combining appearance features with motion information. Our proposed online MOT tracker is called CSTMTrack. Through extensive quantitative and qualitative results, we demonstrate that our tracker achieves efficient and favorable performance compared to several other state-of-the-art trackers on the MOTChallenge benchmark.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13(2) (2012)

    Google Scholar 

  2. Bewley, A., Ge, Z., Ott, L., Ramos, F., Upcroft, B.: Simple online and realtime tracking. In: ICIP, pp. 3464–3468 (2016)

    Google Scholar 

  3. Cao, J., Pang, J., Weng, X., Khirodkar, R., Kitani, K.: Observation-centric sort: rethinking sort for robust multi-object tracking. In: CVPR, pp. 9686–9696 (2023)

    Google Scholar 

  4. Cheng, Y., Li, L., Xu, Y., Li, X., Yang, Z., Wang, W., Yang, Y.: Segment and track anything. arXiv preprint arXiv:2305.06558 (2023)

  5. Gao, P., Ma, Y., Yuan, R., Xiao, L., Wang, F.: Learning cascaded siamese networks for high performance visual tracking. In: ICIP, pp. 3078–3082 (2019)

    Google Scholar 

  6. Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: Yolox: exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 (2021)

  7. Guo, S., Wang, J., Wang, X., Tao, D.: Online multiple object tracking with cross-task synergy. In: CVPR, pp. 8136–8145 (2021)

    Google Scholar 

  8. Huang, K., Chu, J., Qin, P.: Two-stage object tracking based on similarity measurement for fused features of positive and negative samples. In: PRCV, pp. 621–632 (2022)

    Google Scholar 

  9. Liang, C., Zhang, Z., Zhou, X., Li, B., Zhu, S., Hu, W.: Rethinking the competition between detection and reid in multiobject tracking. IEEE Trans. Image Process. 31, 3182–3196 (2022)

    Article  Google Scholar 

  10. Liu, Y., Yang, W., Yu, H., Feng, L., Kong, Y., Liu, S.: Background suppressed and motion enhanced network for weakly supervised video anomaly detection. In: PRCV, pp. 678–690 (2022)

    Google Scholar 

  11. Pang, B., Li, Y., Zhang, Y., Li, M., Lu, C.: Tubetk: adopting tubes to track multi-object in a one-step training model. In: CVPR, pp. 6308–6318 (2020)

    Google Scholar 

  12. Pang, J., et al.: Quasi-dense similarity learning for multiple object tracking. In: CVPR, pp. 164–173 (2021)

    Google Scholar 

  13. Peng, J., et al.: Chained-tracker: chaining paired attentive regression results for end-to-end joint multiple-object detection and tracking. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12349, pp. 145–161. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58548-8_9

    Chapter  Google Scholar 

  14. Saleh, F., Aliakbarian, S., Rezatofighi, H., Salzmann, M., Gould, S.: Probabilistic tracklet scoring and inpainting for multiple object tracking. In: CVPR, pp. 14329–14339 (2021)

    Google Scholar 

  15. Sommer, L., Krüger, W.: Usage of vehicle re-identification models for improved persistent multiple object tracking in wide area motion imagery. In: ICIP, pp. 331–335 (2022)

    Google Scholar 

  16. Stadler, D., Beyerer, J.: Modelling ambiguous assignments for multi-person tracking in crowds. In: WACV, pp. 133–142 (2022)

    Google Scholar 

  17. Sun, P., et al.: Transtrack: multiple object tracking with transformer. arXiv preprint arXiv:2012.15460 (2020)

  18. Tokmakov, P., Li, J., Burgard, W., Gaidon, A.: Learning to track with object permanence. In: ICCV, pp. 10860–10869 (2021)

    Google Scholar 

  19. Wang, L., Hui, L., Xie, J.: Facilitating 3D object tracking in point clouds with image semantics and geometry. In: PRCV, pp. 589–601 (2021)

    Google Scholar 

  20. Wang, S., Sheng, H., Zhang, Y., Wu, Y., Xiong, Z.: A general recurrent tracking framework without real data. In: ICCV, pp. 13219–13228 (2021)

    Google Scholar 

  21. Wang, Y., Kitani, K., Weng, X.: Joint object detection and multi-object tracking with graph neural networks. In: ICRA, pp. 13708–13715 (2021)

    Google Scholar 

  22. Wang, Y., Li, C., Tang, J.: Learning soft-consistent correlation filters for RGB-T object tracking. In: PRCV, pp. 295–306 (2018)

    Google Scholar 

  23. Wang, Z., Zheng, L., Liu, Y., Li, Y., Wang, S.: Towards real-time multi-object tracking. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12356, pp. 107–122. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58621-8_7

    Chapter  Google Scholar 

  24. Wojke, N., Bewley, A., Paulus, D.: Simple online and realtime tracking with a deep association metric. In: ICIP, pp. 3645–3649 (2017)

    Google Scholar 

  25. Wu, D., Han, W., Wang, T., Dong, X., Zhang, X., Shen, J.: Referring multi-object tracking. In: CVPR, pp. 14633–14642 (2023)

    Google Scholar 

  26. Xu, J., Cao, Y., Zhang, Z., Hu, H.: Spatial-temporal relation networks for multi-object tracking. In: ICCV, pp. 3988–3998 (2019)

    Google Scholar 

  27. Yang, F., Chang, X., Sakti, S., Wu, Y., Nakamura, S.: Remot: a model-agnostic refinement for multiple object tracking. Image Vis. Comput. 106, 104091 (2021)

    Article  Google Scholar 

  28. Yin, J., Wang, W., Meng, Q., Yang, R., Shen, J.: A unified object motion and affinity model for online multi-object tracking. In: CVPR, pp. 6768–6777 (2020)

    Google Scholar 

  29. Yu, E., Li, Z., Han, S., Wang, H.: Relationtrack: relation-aware multiple object tracking with decoupled representation. IEEE Trans. Multimedia (2022)

    Google Scholar 

  30. Zeng, F., Dong, B., Zhang, Y., Wang, T., Zhang, X., Wei, Y.: MOTR: end-to-end multiple-object tracking with transformer. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) ECCV 2022. LNCS, vol. 13687, pp. 659–675. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-19812-0_38

    Chapter  Google Scholar 

  31. Zhang, M., Pan, Z., Feng, J., Zhou, J.: 3D multi-object detection and tracking with sparse stationary lidar. In: PRCV, pp. 16–28 (2021)

    Google Scholar 

  32. Zhang, Y., et al.: Bytetrack: multi-object tracking by associating every detection box. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) ECCV 2022. LNCS, vol. 13682, pp. 1–21. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-20047-2_1

    Chapter  Google Scholar 

  33. Zhang, Y., Wang, C., Wang, X., Zeng, W., Liu, W.: FairMOT: on the fairness of detection and re-identification in multiple object tracking. Int. J. Comput. Vision 129, 3069–3087 (2021)

    Article  Google Scholar 

  34. Zhao, K., et al.: Driver behavior decision making based on multi-action deep Q network in dynamic traffic scenes. In: PRCV, pp. 174–186 (2022)

    Google Scholar 

  35. Zhou, X., Koltun, V., Krähenbühl, P.: Tracking objects as points. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12349, pp. 474–490. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58548-8_28

    Chapter  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China under Grant U21A20514, 62002302, by the FuXiaQuan National Independent Innovation Demonstration Zone Collaborative Innovation Platform Project under Grant 3502ZCQXT2022008, and by the China Fundamental Research Funds for the Central Universities under Grants 20720230038.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yang Lu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xie, Y., Wang, H., Lu, Y. (2024). Cascaded-Scoring Tracklet Matching for Multi-object Tracking. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14434. Springer, Singapore. https://doi.org/10.1007/978-981-99-8549-4_14

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-8549-4_14

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-8548-7

  • Online ISBN: 978-981-99-8549-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics