Skip to main content

Vehicle ReID: Learning Robust Feature Using Vision Transformer and Gradient Accumulation for Vehicle Re-identification

  • Conference paper
  • First Online:
Computer Vision and Image Processing (CVIP 2022)

Abstract

Vehicle re-identification involves searching for images of identical vehicles across different cameras. For intelligent traffic control, re-identification of vehicles is very important. Convolutional Neural Networks (CNN) have succeeded in re-identification, but CNN-based methods process only one neighbourhood at a time and information is lost during pooling operation. To mitigate this shortcoming of CNN, We have proposed a novel vehicle re-identification framework (Vehicle ReID) based on vision transformer with gradient accumulation. The training images are split into different overlapped patches, and each patch is flattened into a 1D vector. Positional, camera and view embeddings are added to the patch embeddings and given as input to the vision transformer to generate a global feature. After that, this global feature is fed to three branches: ID, colour and type classification. For ID branch, triplet and cross-entropy losses are used. For colour branch and type branch, only cross-entropy loss is used. Gradient accumulation is employed at the training time to accumulate the gradient during each iteration in an epoch, and the neural network weights get updated only when the number of iterations reaches a predefined step size. This allows the model to work like being trained with a greater batch size without upgrading GPUs. To validate the effectiveness of the proposed framework, mean average precision (mAP), Rank-1, and Rank-5 hit rate have been computed on the VeRi dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. He, S., Luo, H., Wang, P., Wang, F., Li, H., Jiang, W.: Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 15013–15022 (2021)

    Google Scholar 

  2. Liu, X., Liu, W., Mei, T., Ma, H.: Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans. Multimed. 20(3), 645–658 (2017). https://doi.org/10.10007/1234567890

  3. Wen, Y., Lu, Y., Yan, J., Zhou, Z., von Deneen, K.M., Shi, P.: An algorithm for license plate recognition applied to intelligent transportation system. IEEE Trans. Intell. Transp. Syst. 12(3), 830–845 (2011). https://doi.org/10.1109/TITS.2011.2114346

    Article  Google Scholar 

  4. Tang, Z., et al.: Pamtri: pose-aware multi-task learning for vehicle re-identification using highly randomized synthetic data. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 211–220 (2019)

    Google Scholar 

  5. Liu, H., Tian, Y., Wang, Y., Pang, L., Huang, T.: Deep relative distance learning: tell the difference between similar vehicles. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016, pp. 2167–2175 (2016). https://doi.org/10.1109/CVPR.2016.238

  6. Dosovitskiy, A., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)

  7. Vaswani, A., et al.: Attention is all you need. Advances in neural information processing systems 30 (2017)

    Google Scholar 

  8. Xiong, Z., Li, M., Ma, Y., Xinkai, W.: Vehicle re-identification with image processing and car-following model using multiple surveillance cameras from urban arterials. IEEE Trans. Intell. Transp. Syst. 22(12), 7619–7630 (2020)

    Article  Google Scholar 

  9. Zapletal, D., Herout, A.: Vehicle re-identification for automatic video traffic surveillance. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 25–31 (2016)

    Google Scholar 

  10. Sanchez, R.O., Flores, C., Horowitz, R., Rajagopal, R., Varaiya, P.: Arterial travel time estimation based on vehicle re-identification using magnetic sensors: performance analysis. In: 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp. 997–1002 (2011). https://doi.org/10.1109/ITSC.2011.6083003

  11. Sun, C.C., Ritchie, S.G., Joyce Tsai, K., Jayakrishnan, R.: Use of vehicle signature analysis and lexicographic optimization for vehicle reidentification on freeways. Transp. Res. Part C-emerging Technol. 7, 167–185 (1999)

    Google Scholar 

  12. Luo, H., Gu, Y., Liao, X., Lai, S., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019, pp. 1487–1495 (2019). https://doi.org/10.1109/CVPRW.2019.00190

  13. Li, Y., Liu, K., Jin, Y., Wang, T., Lin, W.: VARID: viewpoint-aware re-identification of vehicle based on triplet loss. IEEE Trans. Intell. Transp. Syst. (2020)

    Google Scholar 

  14. Liu, X., Zhang, S., Wang, X., Hong, R., Tian, Q.: Group-group loss-based global-regional feature learning for vehicle re-identification. IEEE Trans. Image Process. 29, 2638–2652 (2020). https://doi.org/10.1109/TIP.2019.2950796

    Article  MATH  Google Scholar 

  15. Zhou, Y., Liu, L., Shao, L.: Vehicle re-identification by deep hidden multi-view inference. IEEE Trans. Image Process. 27(7), 3275–3287 (2018). https://doi.org/10.1109/TIP.2018.2819820

    Article  MathSciNet  Google Scholar 

  16. Aslam, N., Rai, P.K., Kolekar, M.H.: A3N: attention-based adversarial autoencoder network for detecting anomalies in video sequence. J. Visual Commun. Image Representation 87, 103598 (2022)

    Google Scholar 

  17. Liu, X., Liu, W., Mei, T., Ma, H.: A deep learning-based approach to progressive vehicle re-identification for urban surveillance. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 869–884. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_53

    Chapter  Google Scholar 

  18. Zhou, Y., Shao, L.: Cross-view GAN based vehicle generation for re-identification. In: BMVC, vol. 1, pp. 1–12 (September 2017)

    Google Scholar 

  19. Zhouy, Y., Shao, L.: Viewpoint-aware attentive multi-view inference for vehicle re-identification. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018, pp. 6489–6498 (2018). https://doi.org/10.1109/CVPR.2018.00679

  20. Wang, Z., et al.: Orientation invariant feature embedding and spatial temporal regularization for vehicle re-identification. In: IEEE International Conference on Computer Vision (ICCV) 2017, pp. 379–387 (2017). https://doi.org/10.1109/ICCV.2017.49

  21. Luo, W., Li, Y., Urtasun, R., Zemel, R.: Understanding the effective receptive field in deep convolutional neural networks. Advances in neural information processing systems 29 (2016)

    Google Scholar 

  22. Chu, R., Sun, Y., Li, Y., Liu, Z., Zhang, C., Wei, Y.: Vehicle re-identification with viewpoint-aware metric learning. In: IEEE/CVF International Conference on Computer Vision (ICCV) 2019, pp. 8281–8290 (2019). https://doi.org/10.1109/ICCV.2019.00837

  23. Aslam, N., Kolekar, M.H.: Unsupervised anomalous event detection in videos using spatio-temporal inter-fused autoencoder. Multimedia Tools and Applications, pp. 1–26 (2022)

    Google Scholar 

  24. Chen, T.-S., Liu, C.-T., Wu, C.-W., Chien, S.-Y.: Orientation-aware vehicle re-identification with semantics-guided part attention network. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12347, pp. 330–346. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58536-5_20

    Chapter  Google Scholar 

  25. Lou, Y., Bai, Y., Liu, J., Wang, S., Duan, L.: VERI-wild: a large dataset and a new method for vehicle re-identification in the wild. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019, pp. 3230–3238 (2019). https://doi.org/10.1109/CVPR.2019.00335

  26. Khorramshahi, P., Kumar, A., Peri, N., Rambhatla, S.S., Chen, J.-C., Chellappa, R.: A dual-path model with adaptive attention for vehicle re-identification. In: IEEE/CVF International Conference on Computer Vision (ICCV) 2019, pp. 6131–6140 (2019). https://doi.org/10.1109/ICCV.2019.00623

  27. Liu, X., Zhang, S., Huang, Q., Gao, W.: RAM: a region-aware deep model for vehicle re-identification. In: IEEE International Conference on Multimedia and Expo (ICME) 2018, pp. 1–6 (2018). https://doi.org/10.1109/ICME.2018.8486589

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nazia Aslam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kishore, R., Aslam, N., Kolekar, M.H. (2023). Vehicle ReID: Learning Robust Feature Using Vision Transformer and Gradient Accumulation for Vehicle Re-identification. In: Gupta, D., Bhurchandi, K., Murala, S., Raman, B., Kumar, S. (eds) Computer Vision and Image Processing. CVIP 2022. Communications in Computer and Information Science, vol 1777. Springer, Cham. https://doi.org/10.1007/978-3-031-31417-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-31417-9_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-31416-2

  • Online ISBN: 978-3-031-31417-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics