Abstract
Multi-object tracking (MOT) aims at locating the object of interest in a successive video sequence and associating the same moving object frame by frame. Most existing approaches to MOT lack the integration of both motion and appearance information, which limits the effectiveness of tracklet association. The conventional approaches for tracklet association often struggle when dealing with scenarios involving multiple objects with indistinguishable appearances and irregular motions, leading to suboptimal performance. In this paper, we introduce an appearance-assisted feature warper (AFW) module and a motion-guided based target aware (MTA) module to efficiently utilize the appearance and motion information. Additionally, we introduce a cascaded-scoring tracklet matching (CSTM) strategy that seamlessly integrates the two modules, combining appearance features with motion information. Our proposed online MOT tracker is called CSTMTrack. Through extensive quantitative and qualitative results, we demonstrate that our tracker achieves efficient and favorable performance compared to several other state-of-the-art trackers on the MOTChallenge benchmark.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13(2) (2012)
Bewley, A., Ge, Z., Ott, L., Ramos, F., Upcroft, B.: Simple online and realtime tracking. In: ICIP, pp. 3464–3468 (2016)
Cao, J., Pang, J., Weng, X., Khirodkar, R., Kitani, K.: Observation-centric sort: rethinking sort for robust multi-object tracking. In: CVPR, pp. 9686–9696 (2023)
Cheng, Y., Li, L., Xu, Y., Li, X., Yang, Z., Wang, W., Yang, Y.: Segment and track anything. arXiv preprint arXiv:2305.06558 (2023)
Gao, P., Ma, Y., Yuan, R., Xiao, L., Wang, F.: Learning cascaded siamese networks for high performance visual tracking. In: ICIP, pp. 3078–3082 (2019)
Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: Yolox: exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 (2021)
Guo, S., Wang, J., Wang, X., Tao, D.: Online multiple object tracking with cross-task synergy. In: CVPR, pp. 8136–8145 (2021)
Huang, K., Chu, J., Qin, P.: Two-stage object tracking based on similarity measurement for fused features of positive and negative samples. In: PRCV, pp. 621–632 (2022)
Liang, C., Zhang, Z., Zhou, X., Li, B., Zhu, S., Hu, W.: Rethinking the competition between detection and reid in multiobject tracking. IEEE Trans. Image Process. 31, 3182–3196 (2022)
Liu, Y., Yang, W., Yu, H., Feng, L., Kong, Y., Liu, S.: Background suppressed and motion enhanced network for weakly supervised video anomaly detection. In: PRCV, pp. 678–690 (2022)
Pang, B., Li, Y., Zhang, Y., Li, M., Lu, C.: Tubetk: adopting tubes to track multi-object in a one-step training model. In: CVPR, pp. 6308–6318 (2020)
Pang, J., et al.: Quasi-dense similarity learning for multiple object tracking. In: CVPR, pp. 164–173 (2021)
Peng, J., et al.: Chained-tracker: chaining paired attentive regression results for end-to-end joint multiple-object detection and tracking. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12349, pp. 145–161. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58548-8_9
Saleh, F., Aliakbarian, S., Rezatofighi, H., Salzmann, M., Gould, S.: Probabilistic tracklet scoring and inpainting for multiple object tracking. In: CVPR, pp. 14329–14339 (2021)
Sommer, L., Krüger, W.: Usage of vehicle re-identification models for improved persistent multiple object tracking in wide area motion imagery. In: ICIP, pp. 331–335 (2022)
Stadler, D., Beyerer, J.: Modelling ambiguous assignments for multi-person tracking in crowds. In: WACV, pp. 133–142 (2022)
Sun, P., et al.: Transtrack: multiple object tracking with transformer. arXiv preprint arXiv:2012.15460 (2020)
Tokmakov, P., Li, J., Burgard, W., Gaidon, A.: Learning to track with object permanence. In: ICCV, pp. 10860–10869 (2021)
Wang, L., Hui, L., Xie, J.: Facilitating 3D object tracking in point clouds with image semantics and geometry. In: PRCV, pp. 589–601 (2021)
Wang, S., Sheng, H., Zhang, Y., Wu, Y., Xiong, Z.: A general recurrent tracking framework without real data. In: ICCV, pp. 13219–13228 (2021)
Wang, Y., Kitani, K., Weng, X.: Joint object detection and multi-object tracking with graph neural networks. In: ICRA, pp. 13708–13715 (2021)
Wang, Y., Li, C., Tang, J.: Learning soft-consistent correlation filters for RGB-T object tracking. In: PRCV, pp. 295–306 (2018)
Wang, Z., Zheng, L., Liu, Y., Li, Y., Wang, S.: Towards real-time multi-object tracking. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12356, pp. 107–122. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58621-8_7
Wojke, N., Bewley, A., Paulus, D.: Simple online and realtime tracking with a deep association metric. In: ICIP, pp. 3645–3649 (2017)
Wu, D., Han, W., Wang, T., Dong, X., Zhang, X., Shen, J.: Referring multi-object tracking. In: CVPR, pp. 14633–14642 (2023)
Xu, J., Cao, Y., Zhang, Z., Hu, H.: Spatial-temporal relation networks for multi-object tracking. In: ICCV, pp. 3988–3998 (2019)
Yang, F., Chang, X., Sakti, S., Wu, Y., Nakamura, S.: Remot: a model-agnostic refinement for multiple object tracking. Image Vis. Comput. 106, 104091 (2021)
Yin, J., Wang, W., Meng, Q., Yang, R., Shen, J.: A unified object motion and affinity model for online multi-object tracking. In: CVPR, pp. 6768–6777 (2020)
Yu, E., Li, Z., Han, S., Wang, H.: Relationtrack: relation-aware multiple object tracking with decoupled representation. IEEE Trans. Multimedia (2022)
Zeng, F., Dong, B., Zhang, Y., Wang, T., Zhang, X., Wei, Y.: MOTR: end-to-end multiple-object tracking with transformer. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) ECCV 2022. LNCS, vol. 13687, pp. 659–675. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-19812-0_38
Zhang, M., Pan, Z., Feng, J., Zhou, J.: 3D multi-object detection and tracking with sparse stationary lidar. In: PRCV, pp. 16–28 (2021)
Zhang, Y., et al.: Bytetrack: multi-object tracking by associating every detection box. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) ECCV 2022. LNCS, vol. 13682, pp. 1–21. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-20047-2_1
Zhang, Y., Wang, C., Wang, X., Zeng, W., Liu, W.: FairMOT: on the fairness of detection and re-identification in multiple object tracking. Int. J. Comput. Vision 129, 3069–3087 (2021)
Zhao, K., et al.: Driver behavior decision making based on multi-action deep Q network in dynamic traffic scenes. In: PRCV, pp. 174–186 (2022)
Zhou, X., Koltun, V., Krähenbühl, P.: Tracking objects as points. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12349, pp. 474–490. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58548-8_28
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grant U21A20514, 62002302, by the FuXiaQuan National Independent Innovation Demonstration Zone Collaborative Innovation Platform Project under Grant 3502ZCQXT2022008, and by the China Fundamental Research Funds for the Central Universities under Grants 20720230038.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Xie, Y., Wang, H., Lu, Y. (2024). Cascaded-Scoring Tracklet Matching for Multi-object Tracking. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14434. Springer, Singapore. https://doi.org/10.1007/978-981-99-8549-4_14
Download citation
DOI: https://doi.org/10.1007/978-981-99-8549-4_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8548-7
Online ISBN: 978-981-99-8549-4
eBook Packages: Computer ScienceComputer Science (R0)