A generic MOT boosting framework by combining cues from SOT, tracklet and re-identification

Liang, Tianyi; Lan, Long; Zhang, Xiang; Luo, Zhigang

doi:10.1007/s10115-021-01576-2

A generic MOT boosting framework by combining cues from SOT, tracklet and re-identification

Regular Paper
Published: 23 June 2021

Volume 63, pages 2109–2127, (2021)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Tianyi Liang¹,
Long Lan ORCID: orcid.org/0000-0002-4238-8985^1,2,3,
Xiang Zhang² &
…
Zhigang Luo¹

456 Accesses
1 Altmetric
Explore all metrics

Abstract

In this paper, we propose a generic boosting framework for multiple object tracking (MOT). Unlike other works tracking objects from zero, our framework uses their results (tracklets) and makes further optimizations. The motivation of us derives from the observation that most modern MOT trackers have been acceptable performance and can yield relatively reliable tracklets; accordingly, we straight focus on the tracklet-level re-identification, which is the most challenging issue in this case. To achieve that goal, we simultaneously utilize the techniques of single object tracking, tracking fragment (tracklets) and re-identification mechanism through casting them into a multi-label energy optimization and then innovatively solving it using the $\alpha -$expansion with label costs algorithm. All these techniques inspire recent MOT a lot to mitigate the occlusion problem, but to our knowledge, by far few works explore to reasonably combine them all like us. Furthermore, we introduce a spatial attention to improve the appearance model and a hierarchical clustering as post-process to progressively improve the tracking consistency. Finally, testing results on the most used benchmarks demonstrate the significant effectiveness and generality of our framework, and the importance of each contribution is also verified through ablative studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

Article Open access 12 April 2024

EnhanceCenter for improving point based tracking and rich feature representation

Article Open access 03 March 2025

Online Multi-object Tracking Using Single Object Tracker and Markov Clustering

Notes

Codes are available at: https://vision.cs.uwaterloo.ca/files/gco-v3.0.zip.
https://github.com/xstgavin/amilan-motchallenge-devkit.

References

Ajaeiya GA, Elhajj IH, Chehab A, Kayssi AI, Kneppers M (2018) Mobile apps identification based on network flows. Knowl Inf Syst 55(3):771–796
Article Google Scholar
Babaee M, Athar A, Rigoll G (2018) Multiple people tracking using hierarchical deep tracklet re-identification. arXiv preprint arXiv:1811.04091
Bergmann P, Meinhardt T, Leal-Taixe L (2019) Tracking without bells and whistles. In: 2019 IEEE/CVF international conference on computer vision (ICCV), pp 941–951
Bernardin K, Stiefelhagen R (2008) Evaluating multiple object tracking performance: the clear mot metrics. Eurasip J Image and Video Process 1:1–10
Article Google Scholar
Bewley A, Ge Z, Ott L, Ramos F, Upcroft B (2016) Simple online and realtime tracking. In: 2016 IEEE international conference on image processing (ICIP), pp 3464–3468
Bochinski E, Eiselein V, Sikora T (2017) High-speed tracking-by-detection without using image information. In: 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS), pp 1–6
Boykov Y, Veksler O, Zabih R (2001) Fast approximate energy minimization via graph cuts. IEEE Trans Pattern Anal Mach Intell 23(11):1222–1239
Article Google Scholar
Chen J, Sheng H, Zhang Y, Xiong Z (2017) Enhancing detection model for multiple hypothesis tracking. In: CVPR workshops pp 2143–2152
Chen L, Ai H, Zhuang Z, Shang C (2018) Real-time multiple people tracking with deeply learned candidate selection and person re-identification. In: 2018 IEEE international conference on multimedia and expo (ICME), pp 1–6
Chu Q, Ouyang W, Li H, Wang X, Liu B, Yu N (2017) Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism. In: Proceedings of the IEEE international conference on computer vision, pp 4846–4855
Chu Q, Ouyang W, Liu B, Zhu F, Yu N (2020) Dasot: A unified framework integrating data association and single object tracking for online multi-object tracking. In: AAAI 2020: the thirty-fourth AAAI conference on artificial intelligence, vol 34, pp 10672–10679
Dai J, Li Y, He K, Sun J (2016) R-fcn: Object detection via region-based fully convolutional networks. In: Proceedings of the 30th international conference on neural information processing systems, pp 379–387
Delong A, Osokin A, Isack HN, Boykov Y (2012) Fast approximate energy minimization with label costs. Int J Comput Vision 96(1):1–27
Article MathSciNet Google Scholar
Felzenszwalb FP, Girshick BR, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell pp 1627–1645
Feng W, Hu Z, Wu W, Yan J, Ouyang W (2019) Multi-object tracking with multiple cues and switcher-aware classification. arXiv preprint arXiv:1901.06129
Feng W, Lan L, Luo Y, Yu Y, Zhang X, Luo Z (2020a) Near-online multi-pedestrian tracking via combining multiple consistent appearance cues. IEEE Trans Circuits Syst Video Technol pp 1–14
Feng W, Lan L, Zhang X, Luo Z (2020b) Learning sequence-to-sequence affinity metric for near-online multi-object tracking. Knowl Inf Syst 62(10):3911–3930
Article Google Scholar
Fu Z, Angelini F, Chambers J, Naqvi MS (2019) Multi-level cooperative fusion of gm-phd filters for online multiple human tracking. IEEE Transactions on Multimedia pp 1–14
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 770–778
He K, Gkioxari G, Dollar P, Girshick R (2020) Mask r-cnn. IEEE Trans Pattern Anal Mach Intell 42(2):386–397
Article Google Scholar
Jaderberg M, Simonyan K, Zisserman A, Kavukcuoglu K (2015) Spatial transformer networks. In: NIPS’15 proceedings of the 28th international conference on neural information processing systems, pp 2017–2025
Kim C, Li F, Rehg MJ (2018) Multi-object tracking with neural gating using bilinear lstm. In: ECCV pp 208–224
Lan L, Tao D, Gong C, Guan N, Luo Z (2016) Online multi-object tracking by quadratic pseudo-boolean optimization. In: IJCAI’16 proceedings of the twenty-fifth international joint conference on artificial intelligence, pp 3396–3402
Lan L, Wang X, Hua G, Huang ST, Tao D (2020) Semi-online multi-people tracking by re-identification. Int J Comput Vis pp 1–19
Leal-Taixe L, Canton-Ferrer C, Schindler K (2016) Learning by tracking: Siamese cnn for robust target association. In: 2016 IEEE conference on computer vision and pattern recognition workshops (CVPRW), vol 1, pp 418–425
Li B, Yan J, Wu W, Zhu Z, Hu X (2018) High performance visual tracking with siamese region proposal network. In: CVPR, pp 8971–8980
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: CVPR ’14 proceedings of the 2014 IEEE conference on computer vision and pattern recognition, pp 152–159
Lin TY, Dollar P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 936–944
Liu H, Cocea M (2019a) Granular computing-based approach of rule learning for binary classification. Granul Comput 4(2):275–283
Article Google Scholar
Liu H, Cocea M (2019b) Nature-inspired framework of ensemble learning for collaborative classification in granular computing context. Granul Comput 4(4):715–724
Article Google Scholar
Liu W, Anguelov D, Erhan D, Szegedy C, Reed SE, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision pp 21–37
Long L, Wang X, Zhang S, Tao D, Wen G, Huang TS (2018) Interacting tracklets for multi-object tracking. IEEE Trans Image Process 27(9):4585–4597
Article MathSciNet Google Scholar
Lukezic A, Vojir T, Zajc LC, Matas J, Kristan M (2017) Discriminative correlation filter with channel and spatial reliability. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 4847–4856
Milan A, Leal-Taixé L, Reid ID, Roth S, Schindler K (2016) Mot16: A benchmark for multi-object tracking. arXiv preprint arXiv:1603.00831
Milan A, Rezatofighi HS, Dick A, Reid I, Schindler K (2016) Online multi-target tracking using recurrent neural networks. In: National conference on artificial intelligence, 4225–4232
Ren S, He K, Girshick BR, Sun J (2017) Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell pp 1137–1149
Sadeghian A, Alahi A, Savarese S (2017) Tracking the untrackable: Learning to track multiple cues with long-term dependencies. In: 2017 IEEE international conference on computer vision (ICCV), pp 300–311
Schulter S, Vernaza P, Choi W, Chandraker M (2017) Deep network flow for multi-object tracking. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 2730–2739
Shen H, Huang L, Huang C, Xu W (2018) Tracklet association tracker: an end-to-end learning-based association approach for multi-object tracking. arXiv preprint arXiv:1808.01562
Soh CW, Njilla LL, Kwiat KK, Kamhoua CA (2020) Learning quasi-identifiers for privacy-preserving exchanges: a rough set theory approach. Granul Comput 5(1):1–14
Article Google Scholar
Son J, Baek M, Cho M, Han B (2017) Multi-object tracking with quadruplet convolutional neural networks. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 3786–3795
Sun S, Akhtar N, Song H, Mian AS, Shah M (2019) Deep affinity network for multiple object tracking. IEEE Trans Pattern Anal Mach Intell 43:104–119
Google Scholar
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: ECCV pp 501–518
Tang S, Andres B, Andriluka M, Schiele B (2016) Multi-person tracking by multicut and deep matching. In: European conference on computer vision, W11 benchmarking multi-target tracking: MOTChallenge, pp 100–111
Tang S, Andriluka M, Andres B, Schiele B (2017) Multiple people tracking by lifted multicut and person re-identification. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp 3701–3710
Wang S, Zeng Y, Liu X, Zhu E, Yin J, Xu C, Kloft M (2019) Effective end-to-end unsupervised outlier detection via inlier priority of discriminative network. In: NeurIPS pp 5960–5973
Wu J, Hong Z, Pan S, Zhu X, Cai Z, Zhang C (2016) Multi-graph-view subgraph mining for graph classification. Knowl Inf Syst 48(1):29–54
Article Google Scholar
Yang F, Choi W, Lin Y (2016) Exploit all the layers: Fast and accurate cnn object detector with scale dependent pooling and cascaded rejection classifiers. In: CVPR pp 2129–2137
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark. In: 2015 IEEE international conference on computer vision (ICCV), pp 1116–1124
Zhu J, Yang H, Liu N, Kim M, Zhang W, Yang MH (2018) Online multi-object tracking with dual matching attention networks. In: Proceedings of the European conference on computer vision (ECCV), pp 379–396

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (No. 61906210).

Author information

Authors and Affiliations

Science and Technology on Parallel and Distributed Processing, National University of Defense Technology, Changsha, 410073, China
Tianyi Liang, Long Lan & Zhigang Luo
Institute for Quantum Information & State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha, 410073, China
Long Lan & Xiang Zhang
Peng cheng Laboratory, Shenzhen, China
Long Lan

Authors

Tianyi Liang
View author publications
You can also search for this author inPubMed Google Scholar
Long Lan
View author publications
You can also search for this author inPubMed Google Scholar
Xiang Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Zhigang Luo
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Long Lan.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liang, T., Lan, L., Zhang, X. et al. A generic MOT boosting framework by combining cues from SOT, tracklet and re-identification. Knowl Inf Syst 63, 2109–2127 (2021). https://doi.org/10.1007/s10115-021-01576-2

Download citation

Received: 25 May 2020
Revised: 22 April 2021
Accepted: 26 April 2021
Published: 23 June 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s10115-021-01576-2

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A generic MOT boosting framework by combining cues from SOT, tracklet and re-identification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

EnhanceCenter for improving point based tracking and rich feature representation

Online Multi-object Tracking Using Single Object Tracker and Markov Clustering

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now