Multiple Object Tracking Based on Temporal Local Slice Representation of Sub-regions

Mannepalli, Sripranav; Satti, Ravi Venkata Sai Maheswara Reddy; Shakya, Rohit; Yeturu, Kalidas

doi:10.1007/978-3-031-31417-9_53

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1777))

Included in the following conference series:

International Conference on Computer Vision and Image Processing

398 Accesses

Abstract

Multiple object tracking (MOT) involves consistent labeling of objects in a given scene. A scene consists of multiple frames and within each frame rectangular subregions are specified as objects of interest. The task is to label the same object across frames with same identifier. However challenges in this setting involve, change in posture of the object, mild background change in the object region, occlusion, lighting changes, speed of movement and other such critical parameters. MOT is important because of its various applications in mobile robots, autonomous driving, and video surveillance analysis. There a number of neural network based methods which add modules based on property of interest such as a sub-network for velocity, a network for physical motion characteristics and networks based on pixel and edge information characteristics. However they have difficulty dealing with long duration occlusions as well as generalization issues due to millions of parameters and implicit overfitting. We present a new idea called, Temporal Local Slicing (TLS) that obtains local information across frames for a given subregion in the object vectorization step. The vectorization involves histogram of pixel intensities for red, blue and green channels of the sub region. We have performed a total of five experiments and observed the effectiveness of TLS and also a new idea of Gossip vectorization in Multiple object tracking. The object recognition accuracy of TLS vectors is 99.5% and mAP score of 99.1% on train and test partition of a video scene. However the MOT specific scores have been MOTA 56%, IDF1 72%, Recall 56.7%, Precision 98.5% and LOCA 91.9%. These are non-trivial scores indicating potential value in the idea of TLS vectorization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Michael, B.D., Fabian, R., Bastian, L., Esther, K.M., Luc, V.G.: Robust tracking-by-detection using a detector confidence particle filter. In: 2009 IEEE 12th International Conference on Computer Vision, May 2010. https://doi.org/10.1109/ICCV.2009.5459278
Jerome, B., Francois, F., Pascal, F.: Robust people tracking with global trajectory optimization. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), June 2006. https://doi.org/10.1109/CVPR.2006.258
Jerome, B., Francois, F., Pascal, F.: Multiple object tracking using k-shortest paths optimization. IEEE Trans. Pattern Anal. Mach. Intell. 1806–1819 (2011). https://doi.org/10.1109/TPAMI.2011.21
Joshua, C., Matthew, S., Goldgof, D.B., Deborah, S.B., Rangachar, K.: Understanding transit scenes: a survey on human behavior-recognition algorithms. IEEE Trans. Intell. Transport. Syst. 206–224 (2010). https://doi.org/10.1109/TITS.2009.2030963
Patrick, D., et al.: Mot20: a benchmark for multi object tracking in crowded scenes (2020). https://arxiv.org/abs/2003.09003
Forney, G.D.: The Viterbi algorithm. Proc. IEEE 61(3), 268–278 (1973). https://doi.org/10.1109/PROC.1973.9030
Article MathSciNet Google Scholar
Uchiyama, H., Marchand, E.: Object detection and pose tracking for augmented reality: recent approaches, November 2012. https://hal.inria.fr/hal-00751704/
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79 – 86 (1951). https://doi.org/10.1214/aoms/1177729694
Wenhan, L., Junliang, X., Anton, M., Xiaoqin, Z., Wei, L., Tae-Kyun, K.: Multiple object tracking: a literature review. Artif. Intell. ELSEVIER 293(103448) (2020). https://doi.org/10.1016/j.artint.2020.103448
Xin, L., Kejun, W., Wei, W., Yang, L.: A multiple object tracking method using Kalman filter. In: 2010 IEEE International Conference on Information and Automation, pp. 1862–1866 (2010). https://doi.org/10.1109/ICINFA.2010.5512258
Luiten, J., et al.: HOTA: a higher order metric for evaluating multi-object tracking. Int. J. Comput. Vision, 1–31 (2020). https://doi.org/10.1007/s11263-020-01375-2
Hilda, M.F., Adriane, E.S.: Looking at the center of the targets helps multiple object tracking. J. Vision 10 (2010). https://doi.org/10.1167/10.4.19
Morefield, C.L.: Application of 0-1 integer programming to multitarget tracking problems. IEEE Trans. Autom. Control AC-22(3), 302–312 (1977). https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=1101500
Storms, P., Spieksma, F.: An LP-based algorithm for the data association problem in multitarget tracking. In: Proceedings of the Third International Conference on Information Fusion (2000). https://doi.org/10.1109/IFIC.2000.862699
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Prashanth, K., Kalidas, Y., Jay, R.B.K., Sai, A.P.K., Aakash, D.: An algorithm for semantic vectorization of video scenes - applications to retrieval and anomaly detection. In: International Conference on Computer Vision and Image Processing (CVIP) 2020, vol. 1378, pp. 369–381 (2020). https://doi.org/10.1007/978-981-16-1103-2_31
Kumar, R., Guillaume, C., Monique, T.: Multiple object tracking by efficient graph partitioning. In: Brown, M.S., Cham, T.-J., Matsushita, Y. (eds.) ACCV - 12th Asian Conference on Computer Vision, November 2014, Singapore, Singapore. ffhal-01061450f (2014). https://hal.inria.fr/hal-01061450
Nadia, R., Imas, S.S.: Determination of optimal epsilon (EPS) value on DBScan algorithm to clustering data on peatland hotspots in Sumatra. IOP Conf. Ser. Earth Environ. Sci. https://iopscience.iop.org/article/10.1088/1755-1315/31/1/012012/pdf
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788 (2016). https://doi.org/10.1109/CVPR.2016.91
Hu, W., Li, X., Luo, W., Zhang, X., Maybank, S., Zhang, Z.: Single and multiple object tracking using log-Euclidean Riemannian subspace and block-division appearance model. IEEE Trans. Pattern Anal. Mach. Intell. (2012). https://doi.org/10.1109/TPAMI.2012.42
Article Google Scholar
Nicolai, W., Alex, B., Dietrich, P.: Simple online and realtime tracking with a deep association metric (2017). https://arxiv.org/abs/1703.07402
Xiaogang, W.: Intelligent multi-camera video surveillance: a review. Pattern Recogn. Lett. ELSEVIER, 3–19 (2012). https://doi.org/10.1016/j.patrec.2012.07.005, https://www.sciencedirect.com/science/article/pii/S016786551200219X
Chen, X., Wang, X., Xuan, J.: Tracking multiple moving objects using unscented Kalman filtering techniques. In: International Conference on Engineering and Applied Science (ICEAS 2012), March 2012. https://arxiv.org/abs/1802.01235

Download references

Author information

Authors and Affiliations

Indian Institute of Technology Tirupati, Tirupati, India
Sripranav Mannepalli, Ravi Venkata Sai Maheswara Reddy Satti, Rohit Shakya & Kalidas Yeturu

Authors

Sripranav Mannepalli
View author publications
You can also search for this author in PubMed Google Scholar
Ravi Venkata Sai Maheswara Reddy Satti
View author publications
You can also search for this author in PubMed Google Scholar
Rohit Shakya
View author publications
You can also search for this author in PubMed Google Scholar
Kalidas Yeturu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sripranav Mannepalli .

Editor information

Editors and Affiliations

Visvesvaraya National Institute of Technology Nagpur, Nagpur, India
Deep Gupta
Visvesvaraya National Institute of Technology Nagpur, Nagpur, India
Kishor Bhurchandi
Indian Institute of Technology Ropar, Rupnagar, India
Subrahmanyam Murala
Indian Institute of Technology Roorkee, Roorkee, India
Balasubramanian Raman
Indian Institute of Technology Roorkee, Roorkee, India
Sanjeev Kumar

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 210 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mannepalli, S., Satti, R.V.S.M.R., Shakya, R., Yeturu, K. (2023). Multiple Object Tracking Based on Temporal Local Slice Representation of Sub-regions. In: Gupta, D., Bhurchandi, K., Murala, S., Raman, B., Kumar, S. (eds) Computer Vision and Image Processing. CVIP 2022. Communications in Computer and Information Science, vol 1777. Springer, Cham. https://doi.org/10.1007/978-3-031-31417-9_53

Download citation

DOI: https://doi.org/10.1007/978-3-031-31417-9_53
Published: 07 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31416-2
Online ISBN: 978-3-031-31417-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multiple Object Tracking Based on Temporal Local Slice Representation of Sub-regions