Abstract
Vehicle re-identification involves searching for images of identical vehicles across different cameras. For intelligent traffic control, re-identification of vehicles is very important. Convolutional Neural Networks (CNN) have succeeded in re-identification, but CNN-based methods process only one neighbourhood at a time and information is lost during pooling operation. To mitigate this shortcoming of CNN, We have proposed a novel vehicle re-identification framework (Vehicle ReID) based on vision transformer with gradient accumulation. The training images are split into different overlapped patches, and each patch is flattened into a 1D vector. Positional, camera and view embeddings are added to the patch embeddings and given as input to the vision transformer to generate a global feature. After that, this global feature is fed to three branches: ID, colour and type classification. For ID branch, triplet and cross-entropy losses are used. For colour branch and type branch, only cross-entropy loss is used. Gradient accumulation is employed at the training time to accumulate the gradient during each iteration in an epoch, and the neural network weights get updated only when the number of iterations reaches a predefined step size. This allows the model to work like being trained with a greater batch size without upgrading GPUs. To validate the effectiveness of the proposed framework, mean average precision (mAP), Rank-1, and Rank-5 hit rate have been computed on the VeRi dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
He, S., Luo, H., Wang, P., Wang, F., Li, H., Jiang, W.: Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 15013–15022 (2021)
Liu, X., Liu, W., Mei, T., Ma, H.: Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans. Multimed. 20(3), 645–658 (2017). https://doi.org/10.10007/1234567890
Wen, Y., Lu, Y., Yan, J., Zhou, Z., von Deneen, K.M., Shi, P.: An algorithm for license plate recognition applied to intelligent transportation system. IEEE Trans. Intell. Transp. Syst. 12(3), 830–845 (2011). https://doi.org/10.1109/TITS.2011.2114346
Tang, Z., et al.: Pamtri: pose-aware multi-task learning for vehicle re-identification using highly randomized synthetic data. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 211–220 (2019)
Liu, H., Tian, Y., Wang, Y., Pang, L., Huang, T.: Deep relative distance learning: tell the difference between similar vehicles. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016, pp. 2167–2175 (2016). https://doi.org/10.1109/CVPR.2016.238
Dosovitskiy, A., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Vaswani, A., et al.: Attention is all you need. Advances in neural information processing systems 30 (2017)
Xiong, Z., Li, M., Ma, Y., Xinkai, W.: Vehicle re-identification with image processing and car-following model using multiple surveillance cameras from urban arterials. IEEE Trans. Intell. Transp. Syst. 22(12), 7619–7630 (2020)
Zapletal, D., Herout, A.: Vehicle re-identification for automatic video traffic surveillance. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 25–31 (2016)
Sanchez, R.O., Flores, C., Horowitz, R., Rajagopal, R., Varaiya, P.: Arterial travel time estimation based on vehicle re-identification using magnetic sensors: performance analysis. In: 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp. 997–1002 (2011). https://doi.org/10.1109/ITSC.2011.6083003
Sun, C.C., Ritchie, S.G., Joyce Tsai, K., Jayakrishnan, R.: Use of vehicle signature analysis and lexicographic optimization for vehicle reidentification on freeways. Transp. Res. Part C-emerging Technol. 7, 167–185 (1999)
Luo, H., Gu, Y., Liao, X., Lai, S., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2019, pp. 1487–1495 (2019). https://doi.org/10.1109/CVPRW.2019.00190
Li, Y., Liu, K., Jin, Y., Wang, T., Lin, W.: VARID: viewpoint-aware re-identification of vehicle based on triplet loss. IEEE Trans. Intell. Transp. Syst. (2020)
Liu, X., Zhang, S., Wang, X., Hong, R., Tian, Q.: Group-group loss-based global-regional feature learning for vehicle re-identification. IEEE Trans. Image Process. 29, 2638–2652 (2020). https://doi.org/10.1109/TIP.2019.2950796
Zhou, Y., Liu, L., Shao, L.: Vehicle re-identification by deep hidden multi-view inference. IEEE Trans. Image Process. 27(7), 3275–3287 (2018). https://doi.org/10.1109/TIP.2018.2819820
Aslam, N., Rai, P.K., Kolekar, M.H.: A3N: attention-based adversarial autoencoder network for detecting anomalies in video sequence. J. Visual Commun. Image Representation 87, 103598 (2022)
Liu, X., Liu, W., Mei, T., Ma, H.: A deep learning-based approach to progressive vehicle re-identification for urban surveillance. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 869–884. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_53
Zhou, Y., Shao, L.: Cross-view GAN based vehicle generation for re-identification. In: BMVC, vol. 1, pp. 1–12 (September 2017)
Zhouy, Y., Shao, L.: Viewpoint-aware attentive multi-view inference for vehicle re-identification. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018, pp. 6489–6498 (2018). https://doi.org/10.1109/CVPR.2018.00679
Wang, Z., et al.: Orientation invariant feature embedding and spatial temporal regularization for vehicle re-identification. In: IEEE International Conference on Computer Vision (ICCV) 2017, pp. 379–387 (2017). https://doi.org/10.1109/ICCV.2017.49
Luo, W., Li, Y., Urtasun, R., Zemel, R.: Understanding the effective receptive field in deep convolutional neural networks. Advances in neural information processing systems 29 (2016)
Chu, R., Sun, Y., Li, Y., Liu, Z., Zhang, C., Wei, Y.: Vehicle re-identification with viewpoint-aware metric learning. In: IEEE/CVF International Conference on Computer Vision (ICCV) 2019, pp. 8281–8290 (2019). https://doi.org/10.1109/ICCV.2019.00837
Aslam, N., Kolekar, M.H.: Unsupervised anomalous event detection in videos using spatio-temporal inter-fused autoencoder. Multimedia Tools and Applications, pp. 1–26 (2022)
Chen, T.-S., Liu, C.-T., Wu, C.-W., Chien, S.-Y.: Orientation-aware vehicle re-identification with semantics-guided part attention network. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12347, pp. 330–346. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58536-5_20
Lou, Y., Bai, Y., Liu, J., Wang, S., Duan, L.: VERI-wild: a large dataset and a new method for vehicle re-identification in the wild. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019, pp. 3230–3238 (2019). https://doi.org/10.1109/CVPR.2019.00335
Khorramshahi, P., Kumar, A., Peri, N., Rambhatla, S.S., Chen, J.-C., Chellappa, R.: A dual-path model with adaptive attention for vehicle re-identification. In: IEEE/CVF International Conference on Computer Vision (ICCV) 2019, pp. 6131–6140 (2019). https://doi.org/10.1109/ICCV.2019.00623
Liu, X., Zhang, S., Huang, Q., Gao, W.: RAM: a region-aware deep model for vehicle re-identification. In: IEEE International Conference on Multimedia and Expo (ICME) 2018, pp. 1–6 (2018). https://doi.org/10.1109/ICME.2018.8486589
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kishore, R., Aslam, N., Kolekar, M.H. (2023). Vehicle ReID: Learning Robust Feature Using Vision Transformer and Gradient Accumulation for Vehicle Re-identification. In: Gupta, D., Bhurchandi, K., Murala, S., Raman, B., Kumar, S. (eds) Computer Vision and Image Processing. CVIP 2022. Communications in Computer and Information Science, vol 1777. Springer, Cham. https://doi.org/10.1007/978-3-031-31417-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-31417-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31416-2
Online ISBN: 978-3-031-31417-9
eBook Packages: Computer ScienceComputer Science (R0)