skip to main content
10.1145/3654823.3654838acmotherconferencesArticle/Chapter ViewAbstractPublication PagescacmlConference Proceedingsconference-collections
research-article

Regalignsort: Multi-Object Tracking Based on Location Regression and Local Feature Association

Published: 29 May 2024 Publication History

Abstract

Recently, multi-object tracking (MOT) has attracted more attention, and it has made significant progress with the development of various object detection methods as well as various REID models. However, motion models in most recent tracking algorithms usually assume that the object does linear motion in a tiny time window, so these methods are sensitive to nonlinear motion and irregular random motion, resulting in tracking accuracy being easily disturbed by occlusion. In addition, not powerful enough global appearance feature makes it difficult to recover the trajectory from the lost state, which leads to a large number of ID switches in the tracking process. In view of the above two problems, we propose a new multi-object tracking (MOT) algorithm named RegAlignSORT based on the DeepSORT framework. Aiming at the occlusion or irregular movement that linear motion models are susceptible to, we use the regression head of Faster-RCNN to realize more stable motion estimation, avoiding the tracking uncertainty of the linear motion model. Aiming at the problem of limited matching accuracy of global appearance features, we use our own previously proposed LocalOptReid network to extract local features, and the local optimal alignment strategy is used to calculate the alignment appearance loss between trajectories and detections to achieve more accurate data association. The experimental results show that our proposed RegAlignSORT achieve state-of-the-art HOTA and IDF1 on MOT17 and MOT20 datasets.

References

[1]
Bewley A, Ge Z, Ott L, Simple online and realtime tracking[C]//2016 IEEE international conference on image processing (ICIP). IEEE, 2016: 3464-3468
[2]
Wojke N, Bewley A, Paulus D. Simple online and realtime tracking with a deep association metric[C]//2017 IEEE international conference on image processing (ICIP). IEEE, 2017: 3645-3649.
[3]
Bochinski E, Senst T, Sikora T. Extending IOU based multi-object tracking by visual information[C]//2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). IEEE, 2018: 1-6.
[4]
Zhu, Z. (2018) “End-to-end flow correlation tracking with spatial-temporal attention,” 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition [Preprint]. Available at: https://doi.org/10.1109/cvpr.2018.00064.
[5]
Bergmann P, Meinhardt T, Leal-Taixe L. Tracking without bells andwhistles[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 941-951.
[6]
Y. Du, "StrongSORT: Make DeepSORT Great Again," in IEEE Transactions on Multimedia.
[7]
Erich L Lehmann and George Casella. Theory of point estimation. Springer Science & Business Media, 2006.
[8]
Rudolf Emil Kalman Contributions to the theory of optimal control. Bol. soc. mat. mexicana, 5(2):102–119, 1960.
[9]
Gerald L Smith, Stanley F Schmidt, and Leonard A McGee. Application of statistical filter theory to the optimal estimation of position and velocity on board a circumlunar vehicle. National Aeronautics and Space Administration, 1962.
[10]
Simon J Julier and Jeffrey K Uhlmann. New extension of the kalman filter to nonlinear systems. In Signal processing, sensor fusion, and target recognition VI, volume 3068, pages 182–193. International Society for Optics and Photonics, 1997. 2, 15
[11]
Stefano Pellegrini, Andreas Ess, Konrad Schindler, and Luc Van Gool. You'll never walk alone: modeling social behavior for multi- target tracking. IEEE International Conference on Computer Vision (ICCV), pages 261–268, 2009.
[12]
Paul Scovanner and Marshall F. Tappen. Learning pedestrian dynamics from the real world. IEEE International Conference on Computer Vision (ICCV), pages 381–388, 2009.
[13]
Boyu Chen, Dong Wang, Peixia Li, Shuang Wang, and Huchuan Lu. Real-time ’actor-critic’ tracking. In The European Conference on Computer Vision (ECCV), September 2018
[14]
J. Shao, Z. Xiong, H. Wu and Y. Zhang, "LocalOptReID: Person re- identification using optimal alignment of local features," 2022 IEEE International Conference on Unmanned Systems (ICUS), Guangzhou, China, 2022, pp. 1310-1319.
[15]
Zhang, Y., Sun, P., Jiang, Y., Yu, D., Weng, F., Yuan, Z., Luo, P., Liu, W., Wang, X.: Bytetrack: Multi-object tracking by associating every detection box. In: Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXII. pp. 1–21. Springer (2022).
[16]
Zhou, X., Koltun, V., Krahenbuhl, P. Tracking objects as points. In: European Conference on Computer Vision. pp. 474–490. Springer (2020).
[17]
Zhang, Y., Wang, C., Wang, X., Zeng, W., Liu, W. Fairmot: On the fairness of detection and re-identifification in multiple object tracking. International Journal of Computer Vision 129(11), 3069– 3087 (2021).
[18]
Sun, P., Cao, J., Jiang, Y., Zhang, R., Xie, E., Yuan, Z., Wang, C., Luo, P.: Transtrack: Multiple object tracking with transformer. arXiv preprint arXiv:2012.15460 (2020).
[19]
Fu, Z., Angelini, F., Chambers, J., Naqvi, S.M.: Multilevel cooperative fusion of gm-phd filters for online multiple human tracking. IEEE Transactions on Multimedia 21(9), 2277–2291 (2019).
[20]
Stadler, D., Beyerer, J.: On the performance of crowd specifific detectors in multi-pedestrian tracking. In: 2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). pp. 1–12. IEEE (2021).
[21]
Wang, Q., Zheng, Y., Pan, P., Xu, Y.: Multiple object tracking with correlation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 3876–3886 (2021).
[22]
Cao, J., Weng, X., Khirodkar, R., Pang, J., Kitani, K. Observation- centric sort: Rethinking sort for robust multi-object tracking. arXiv preprint arXiv:2203.14360 (2022).
[23]
Yu, E., Li, Z., Han, S., Wang, H.: Relationtrack: Relation-aware multiple object tracking with decoupled representation. IEEE Transactions on Multimedia pp. 1–1 (2022). https://doi.org/10.1109/TMM.2022.3150169.

Index Terms

  1. Regalignsort: Multi-Object Tracking Based on Location Regression and Local Feature Association

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CACML '24: Proceedings of the 2024 3rd Asia Conference on Algorithms, Computing and Machine Learning
    March 2024
    478 pages
    ISBN:9798400716416
    DOI:10.1145/3654823
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 May 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. DeepSORT
    2. Faster-RCNN
    3. LocalOptReid
    4. MOT
    5. Regressor

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CACML 2024

    Acceptance Rates

    Overall Acceptance Rate 93 of 241 submissions, 39%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 8
      Total Downloads
    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 20 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media