Vehicle ReID: Learning Robust Feature Using Vision Transformer and Gradient Accumulation for Vehicle Re-identification

Kishore, Rishi; Aslam, Nazia; Kolekar, Maheshkumar H.

doi:10.1007/978-3-031-31417-9_8

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1777))

Included in the following conference series:

International Conference on Computer Vision and Image Processing

453 Accesses
1 Citations

Abstract

Vehicle re-identification involves searching for images of identical vehicles across different cameras. For intelligent traffic control, re-identification of vehicles is very important. Convolutional Neural Networks (CNN) have succeeded in re-identification, but CNN-based methods process only one neighbourhood at a time and information is lost during pooling operation. To mitigate this shortcoming of CNN, We have proposed a novel vehicle re-identification framework (Vehicle ReID) based on vision transformer with gradient accumulation. The training images are split into different overlapped patches, and each patch is flattened into a 1D vector. Positional, camera and view embeddings are added to the patch embeddings and given as input to the vision transformer to generate a global feature. After that, this global feature is fed to three branches: ID, colour and type classification. For ID branch, triplet and cross-entropy losses are used. For colour branch and type branch, only cross-entropy loss is used. Gradient accumulation is employed at the training time to accumulate the gradient during each iteration in an epoch, and the neural network weights get updated only when the number of iterations reaches a predefined step size. This allows the model to work like being trained with a greater batch size without upgrading GPUs. To validate the effectiveness of the proposed framework, mean average precision (mAP), Rank-1, and Rank-5 hit rate have been computed on the VeRi dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

He, S., Luo, H., Wang, P., Wang, F., Li, H., Jiang, W.: Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 15013–15022 (2021)
Google Scholar
Liu, X., Liu, W., Mei, T., Ma, H.: Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans. Multimed. 20(3), 645–658 (2017). https://doi.org/10.10007/1234567890
Wen, Y., Lu, Y., Yan, J., Zhou, Z., von Deneen, K.M., Shi, P.: An algorithm for license plate recognition applied to intelligent transportation system. IEEE Trans. Intell. Transp. Syst. 12(3), 830–845 (2011). https://doi.org/10.1109/TITS.2011.2114346
Article Google Scholar
Tang, Z., et al.: Pamtri: pose-aware multi-task learning for vehicle re-identification using highly randomized synthetic data. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 211–220 (2019)
Google Scholar
Liu, H., Tian, Y., Wang, Y., Pang, L., Huang, T.: Deep relative distance learning: tell the difference between similar vehicles. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016, pp. 2167–2175 (2016). https://doi.org/10.1109/CVPR.2016.238
Dosovitskiy, A., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Vaswani, A., et al.: Attention is all you need. Advances in neural information processing systems 30 (2017)
Google Scholar
Xiong, Z., Li, M., Ma, Y., Xinkai, W.: Vehicle re-identification with image processing and car-following model using multiple surveillance cameras from urban arterials. IEEE Trans. Intell. Transp. Syst. 22(12), 7619–7630 (2020)
Article Google Scholar
Zapletal, D., Herout, A.: Vehicle re-identification for automatic video traffic surveillance. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 25–31 (2016)
Google Scholar
Sanchez, R.O., Flores, C., Horowitz, R., Rajagopal, R., Varaiya, P.: Arterial travel time estimation based on vehicle re-identification using magnetic sensors: performance analysis. In: 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp. 997–1002 (2011). https://doi.org/10.1109/ITSC.2011.6083003
Sun, C.C., Ritchie, S.G., Joyce Tsai, K., Jayakrishnan, R.: Use of vehicle signature analysis and lexicographic optimization for vehicle reidentification on freeways. Transp. Res. Part C-emerging Technol. 7, 167–185 (1999)
Google Scholar
Luo, H., Gu, Y., Liao, X., Lai, S., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019, pp. 1487–1495 (2019). https://doi.org/10.1109/CVPRW.2019.00190
Li, Y., Liu, K., Jin, Y., Wang, T., Lin, W.: VARID: viewpoint-aware re-identification of vehicle based on triplet loss. IEEE Trans. Intell. Transp. Syst. (2020)
Google Scholar
Liu, X., Zhang, S., Wang, X., Hong, R., Tian, Q.: Group-group loss-based global-regional feature learning for vehicle re-identification. IEEE Trans. Image Process. 29, 2638–2652 (2020). https://doi.org/10.1109/TIP.2019.2950796
Article MATH Google Scholar
Zhou, Y., Liu, L., Shao, L.: Vehicle re-identification by deep hidden multi-view inference. IEEE Trans. Image Process. 27(7), 3275–3287 (2018). https://doi.org/10.1109/TIP.2018.2819820
Article MathSciNet Google Scholar
Aslam, N., Rai, P.K., Kolekar, M.H.: A3N: attention-based adversarial autoencoder network for detecting anomalies in video sequence. J. Visual Commun. Image Representation 87, 103598 (2022)
Google Scholar
Liu, X., Liu, W., Mei, T., Ma, H.: A deep learning-based approach to progressive vehicle re-identification for urban surveillance. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 869–884. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_53
Chapter Google Scholar
Zhou, Y., Shao, L.: Cross-view GAN based vehicle generation for re-identification. In: BMVC, vol. 1, pp. 1–12 (September 2017)
Google Scholar
Zhouy, Y., Shao, L.: Viewpoint-aware attentive multi-view inference for vehicle re-identification. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018, pp. 6489–6498 (2018). https://doi.org/10.1109/CVPR.2018.00679
Wang, Z., et al.: Orientation invariant feature embedding and spatial temporal regularization for vehicle re-identification. In: IEEE International Conference on Computer Vision (ICCV) 2017, pp. 379–387 (2017). https://doi.org/10.1109/ICCV.2017.49
Luo, W., Li, Y., Urtasun, R., Zemel, R.: Understanding the effective receptive field in deep convolutional neural networks. Advances in neural information processing systems 29 (2016)
Google Scholar
Chu, R., Sun, Y., Li, Y., Liu, Z., Zhang, C., Wei, Y.: Vehicle re-identification with viewpoint-aware metric learning. In: IEEE/CVF International Conference on Computer Vision (ICCV) 2019, pp. 8281–8290 (2019). https://doi.org/10.1109/ICCV.2019.00837
Aslam, N., Kolekar, M.H.: Unsupervised anomalous event detection in videos using spatio-temporal inter-fused autoencoder. Multimedia Tools and Applications, pp. 1–26 (2022)
Google Scholar
Chen, T.-S., Liu, C.-T., Wu, C.-W., Chien, S.-Y.: Orientation-aware vehicle re-identification with semantics-guided part attention network. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12347, pp. 330–346. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58536-5_20
Chapter Google Scholar
Lou, Y., Bai, Y., Liu, J., Wang, S., Duan, L.: VERI-wild: a large dataset and a new method for vehicle re-identification in the wild. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019, pp. 3230–3238 (2019). https://doi.org/10.1109/CVPR.2019.00335
Khorramshahi, P., Kumar, A., Peri, N., Rambhatla, S.S., Chen, J.-C., Chellappa, R.: A dual-path model with adaptive attention for vehicle re-identification. In: IEEE/CVF International Conference on Computer Vision (ICCV) 2019, pp. 6131–6140 (2019). https://doi.org/10.1109/ICCV.2019.00623
Liu, X., Zhang, S., Huang, Q., Gao, W.: RAM: a region-aware deep model for vehicle re-identification. In: IEEE International Conference on Multimedia and Expo (ICME) 2018, pp. 1–6 (2018). https://doi.org/10.1109/ICME.2018.8486589

Download references

Author information

Authors and Affiliations

Video Surveillance Lab, Department of Electrical Engineering, Indian Institute of Technology Patna, Daulatpur, Bihar, India
Rishi Kishore & Nazia Aslam
Department of Electrical Engineering, Indian Institute of Technology Patna, Daulatpur, Bihar, India
Maheshkumar H. Kolekar

Authors

Rishi Kishore
View author publications
You can also search for this author in PubMed Google Scholar
Nazia Aslam
View author publications
You can also search for this author in PubMed Google Scholar
Maheshkumar H. Kolekar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nazia Aslam .

Editor information

Editors and Affiliations

Visvesvaraya National Institute of Technology Nagpur, Nagpur, India
Deep Gupta
Visvesvaraya National Institute of Technology Nagpur, Nagpur, India
Kishor Bhurchandi
Indian Institute of Technology Ropar, Rupnagar, India
Subrahmanyam Murala
Indian Institute of Technology Roorkee, Roorkee, India
Balasubramanian Raman
Indian Institute of Technology Roorkee, Roorkee, India
Sanjeev Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kishore, R., Aslam, N., Kolekar, M.H. (2023). Vehicle ReID: Learning Robust Feature Using Vision Transformer and Gradient Accumulation for Vehicle Re-identification. In: Gupta, D., Bhurchandi, K., Murala, S., Raman, B., Kumar, S. (eds) Computer Vision and Image Processing. CVIP 2022. Communications in Computer and Information Science, vol 1777. Springer, Cham. https://doi.org/10.1007/978-3-031-31417-9_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-31417-9_8
Published: 07 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31416-2
Online ISBN: 978-3-031-31417-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Vehicle ReID: Learning Robust Feature Using Vision Transformer and Gradient Accumulation for Vehicle Re-identification