Skip to main content

Advertisement

Log in

Frequency transformer with local feature enhancement for improved vehicle re-identification

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

With the rapid development of intelligent security systems, the demand for vehicle re-identification has surged exponentially. Vehicle re-identification involves recognizing the same vehicle across different camera perspectives, necessitating robust local feature processing. While transformers have shown promising results in this field, their inherent self-attention mechanism tends to dilute high-frequency texture details, hindering local feature extraction. Additionally, challenges such as occlusion and misalignment can lead to information loss and noise introduction, reducing re-identification accuracy. To address these issues, we introduce the frequency transformer with local feature enhancement (LFFT). The proposed framework comprises a frequency layer and a jigsaw select patches module (JSPM). The frequency layer enhances the weights of high-frequency component features using fast Fourier transform to improve local feature extraction at the lower layers. Meanwhile, the attention layer at the higher layers continues to extract global features. The JSPM incorporates discriminative patches obtained from attention layers into randomly shuffled and reorganized groups, enhancing the global discriminative capability of local features. The method does not utilize additional information or auxiliary networks. Experimental evaluations on two vehicle re-identification datasets, VeRi-776 and VehicleID, demonstrate the effectiveness of our method compared to recent approaches. The code is available at https://github.com/xianghlin/LFFT, accompanied by detailed usage instructions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availibility

No datasets were generated or analyzed during the current study.

References

  1. Guo H, Zhu K, Tang M, Wang J (2019) Two-level attention network with multi-grain ranking loss for vehicle re-identification. IEEE Trans Image Process 28(9):4328–4338

    Article  MathSciNet  MATH  Google Scholar 

  2. Lou Y, Bai Y, Liu J, Wang S, Duan L-Y (2019) Embedding adversarial learning for vehicle re-identification. IEEE Trans Image Process 28(8):3794–3807

    Article  MathSciNet  MATH  Google Scholar 

  3. Zhou Y, Liu L, Shao L (2018) Vehicle re-identification by deep hidden multi-view inference. IEEE Trans Image Process 27(7):3275–3287

    Article  MathSciNet  MATH  Google Scholar 

  4. Liu X, Liu W, Ma H, Fu H (2016) Large-scale vehicle re-identification in urban surveillance videos. In: 2016 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6

  5. Liu X, Liu W, Mei T, Ma H (2017) Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20(3):645–658

    Article  MATH  Google Scholar 

  6. Ning E, Zhang C, Wang C, Ning X, Chen H, Bai X (2023) Pedestrian re-id based on feature consistency and contrast enhancement. Displays 79:102467

    Article  MATH  Google Scholar 

  7. Qu J, Zhang Y, Zhang Z (2023) Pma-net: a parallelly mixed attention network for person re-identification. Displays 78:102437

    Article  MATH  Google Scholar 

  8. Zheng W-S, Gong S, Xiang T (2012) Reidentification by relative distance comparison. IEEE Trans Pattern Anal Mach Intell 35(3):653–668

    Article  MATH  Google Scholar 

  9. Wang H, Peng J, Jiang G, Xu F, Fu X (2021) Discriminative feature and dictionary learning with part-aware model for vehicle re-identification. Neurocomputing 438:55–62

    Article  MATH  Google Scholar 

  10. Zhang L, Xiang T, Gong S (2016) Learning a discriminative null space for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1239–1248

  11. Zhu K, Guo H, Zhang S, Wang Y, Liu J, Wang J, Tang M (2024) AAformer: auto-aligned transformer for person re-identification. IEEE Trans Neural Networks Learn Syst 35(12):17307–17317

    Article  Google Scholar 

  12. Shen F, Zhu J, Zhu X, Xie Y, Huang J (2021) Exploring spatial significance via hybrid pyramidal graph network for vehicle re-identification. IEEE Trans Intell Transp Syst 23(7):8793–8804

    Article  MATH  Google Scholar 

  13. Shen F, Xie Y, Zhu J, Zhu X, Zeng H (2023) Git: graph interactive transformer for vehicle re-identification. IEEE Trans Image Process 32:1039–1051

    Article  MATH  Google Scholar 

  14. Dosovitskiy A (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929

  15. He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 15013–15022

  16. Zhu X, Su W, Lu L, Li B, Wang X, Dai J (2020) Deformable detr: deformable transformers for end-to-end object detection. arXiv preprint arXiv:2010.04159

  17. Chen J, Lu Y, Yu Q, Luo X, Adeli E, Wang Y, Lu L, Yuille AL, Zhou Y (2021) Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306

  18. Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H (2021) Training data-efficient image transformers & distillation through attention. In: International Conference on Machine Learning. PMLR, pp 10347–10357

  19. Zhang G, Zhang Y, Zhang T, Li B, Pu S (2023) PHA: patch-wise high-frequency augmentation for transformer-based person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 14133–14142

  20. Liu X, Liu W, Mei T, Ma H (2016) A deep learning-based approach to progressive vehicle re-identification for urban surveillance. In: Computer Vision—ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part II 14. Springer, pp 869–884

  21. Liu H, Tian Y, Yang Y, Pang L, Huang T (2016) Deep relative distance learning: tell the difference between similar vehicles. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2167–2175

  22. Li J, Yu C, Shi J, Zhang C, Ke T (2022) Vehicle re-identification method based on swin-transformer network. Array 16:100255

    Article  MATH  Google Scholar 

  23. Wang T, Liu H, Song P, Guo T, Shi W (2022) Pose-guided feature disentangling for occluded person re-identification based on transformer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 36, pp 2540–2549

  24. Yu Z, Pei J, Zhu M, Zhang J, Li J (2022) Multi-attribute adaptive aggregation transformer for vehicle re-identification. Inf Process Manag 59(2):102868

    Article  MATH  Google Scholar 

  25. Lin W, Zhang J, Meng W, Liu X, Zhang X (2024) Hide: hierarchical iterative decoding enhancement for multi-view 3d human parameter regression. Comput Anim Virtual Worlds 35(3):2266

    Article  MATH  Google Scholar 

  26. Li Z, Zhang X, Tian C, Gao X, Gong Y, Wu J, Zhang G, Li J, Liu H (2023) Tvg-reid: transformer-based vehicle-graph re-identification. IEEE Trans Intel Veh 8(11):4644–4652

    Article  MATH  Google Scholar 

  27. Lee-Thorp J, Ainslie J, Eckstein I, Ontanon S (2021) Fnet: mixing tokens with Fourier transforms. arXiv preprint arXiv:2105.03824

  28. Rao Y, Zhao W, Zhu Z, Lu J, Zhou J (2021) Global filter networks for image classification. Adv Neural Inf Process Syst 34:980–993

    MATH  Google Scholar 

  29. Yao T, Pan Y, Li Y, Ngo C.-W, Mei T (2022) Wave-vit: unifying wavelet and transformers for visual representation learning. In: European Conference on Computer Vision. Springer, pp 328–345

  30. Patro BN, Namboodiri VP, Agneeswaran VS (2023) Spectformer: frequency and attention is what you need in a vision transformer. arXiv preprint arXiv:2304.06446

  31. Khorramshahi P, Kumar A, Peri N, Rambhatla SS, Chen J-C, Chellappa R (2019) A dual-path model with adaptive attention for vehicle re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 6132–6141

  32. Zhu J, Zeng H, Huang J, Liao S, Lei Z, Cai C, Zheng L (2019) Vehicle re-identification using quadruple directional deep learning features. IEEE Trans Intell Transp Syst 21(1):410–420

    Article  Google Scholar 

  33. Liu X, Zhang S, Huang Q, Gao W (2018) Ram: a region-aware deep model for vehicle re-identification. In: 2018 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6

  34. He J, Chen J-N, Liu S, Kortylewski A, Yang C, Bai Y, Wang C (2022) Transfg: a transformer architecture for fine-grained recognition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 36, pp 852–860

  35. Zhang M, Tian X (2023) Transformer architecture based on mutual attention for image-anomaly detection. Virtual Real Intell Hardw 5(1):57–67

    Article  MathSciNet  MATH  Google Scholar 

  36. Zhu H, Ke W, Li D, Liu J, Tian L, Shan Y (2022) Dual cross-attention learning for fine-grained visual categorization and object re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4692–4702

  37. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Proc Syst. https://doi.org/10.48550/ARXIV.1706.03762

    Article  Google Scholar 

  38. Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned CNN embedding for person reidentification. ACM Trans Multimed Comput Commun Appl 14(1):1–20

    Article  MATH  Google Scholar 

  39. Liu H, Feng J, Qi M, Jiang J, Yan S (2017) End-to-end comparative attention networks for person re-identification. IEEE Trans Image Process 26(7):3492–3506

    Article  MathSciNet  MATH  Google Scholar 

  40. Shen F, Lin L, Wei M, Liu J, Zhu J, Zeng H, Cai C, Zheng L (2019) A large benchmark for fabric image retrieval. In: 2019 IEEE 4th International Conference on Image, Vision and Computing (ICIVC). IEEE, pp 247–251

  41. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1116–1124

  42. Zhu J, Zeng H, Huang J, Zhu X, Lei Z, Cai C, Zheng L (2019) Body symmetry and part-locality-guided direct nonparametric deep feature enhancement for person reidentification. IEEE Internet Things J 7(3):2053–2065

    Article  MATH  Google Scholar 

  43. Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L et al (2019) Pytorch: an imperative style, high-performance deep learning library. Adv Neural Inf Proc Syst. https://doi.org/10.48550/arXiv.1912.01703

    Article  Google Scholar 

  44. Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 34, pp 13001–13008

  45. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Proc Syst. https://doi.org/10.1145/3065386

    Article  MATH  Google Scholar 

  46. Zhu J, Zeng H, Lei Z, Liao S, Zheng L, Cai C (2018) A shortly and densely connected convolutional neural network for vehicle re-identification. In: 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, pp 3285–3290

  47. Chen T-S, Liu C-T, Wu C-W, Chien S-Y (2020) Orientation-aware vehicle re-identification with semantics-guided part attention network. In: Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16. Springer, pp 330–346

  48. Zhu R, Fang J, Xu H, Yu H, Xue J (2020) Dcdlearn: multi-order deep cross-distance learning for vehicle re-identification. arXiv preprint arXiv:2003.11315

  49. Zhang R, Zhong X, Wang X, Huang W, Liu W, Zhao S (2022) Graph-based structural attributes for vehicle re-identification. In: 2022 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6

  50. Qian J, Jiang W, Luo H, Yu H (2020) Stripe-based and attribute-aware network: a two-branch deep model for vehicle re-identification. Meas Sci Technol 31(9):095401

    Article  MATH  Google Scholar 

  51. Sun Z, Nie X, Xi X, Yin Y (2020) CFVMNet: a multi-branch network for vehicle re-identification based on common field of view. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 3523–3531

  52. Liu X, Liu W, Zheng J, Yan C, Mei T (2020) Beyond the parts: learning multi-view cross-part correlation for vehicle re-identification. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 907–915

  53. Meng D, Li L, Liu X, Li Y, Yang S, Zha Z-J, Gao X, Wang S, Huang Q (2020) Parsing-based view-aware embedding network for vehicle re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7103–7112

  54. Khorramshahi P, Peri N, Chen J-C, Chellappa R (2020) The devil is in the details: self-supervised attention for vehicle re-identification. In: Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIV 16. Springer, pp 369–386

  55. Peri N, Khorramshahi P, Rambhatla SS, Shenoy V, Rawat S, Chen J-C, Chellappa R (2020) Towards real-time systems for vehicle re-identification, multi-camera tracking, and anomaly detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 622–623

  56. Kishore R, Aslam N, Kolekar MH (2024) PATReId: pose apprise transformer network for vehicle re-identification. IEEE Trans Emerg Top Comput Intell 8(5):3691–3702

    Article  MATH  Google Scholar 

  57. Xu Z, Wei L, Lang C, Feng S, Wang T, Bors AG (2021) HSS-GCN: a hierarchical spatial structural graph convolutional network for vehicle re-identification. In: Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part V. Springer, pp 356–364

  58. Xu Y, Jiang N, Zhang L, Zhou Z, Wu W (2019) Multi-scale vehicle re-identification using self-adapting label smoothing regularization. In: ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 2117–2121

  59. Zhu Y, Zha Z-J, Zhang T, Liu J, Luo J (2020) A structured graph attention network for vehicle re-identification. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 646–654

  60. He Y, Dong C, Wei Y (2019) Combination of appearance and license plate features for vehicle re-identification. In: 2019 IEEE International Conference on Image Processing (ICIP). IEEE, pp 3108–3112

  61. Yang X, Lang C, Peng P, Xing J (2019) Vehicle re-identification by multi-grain Learni. In: 2019 IEEE International Conference on Image Processing (ICIP). IEEE, pp 3113–3117

Download references

Author information

Authors and Affiliations

Authors

Contributions

HX conducted research method design, experimental design, etc., write the initial draft of the paper, and participate in discussions and revisions. JW and YS participate in research design, method selection, etc., participate in paper writing and revision, and participate in drawing pictures and tables. MY is responsible for the planning, design, and implementation of the entire study, responsible for supervising and coordinating the overall research, ensure the completeness and accuracy of the research, write and revise papers, and provide equipment and technical support.

Corresponding author

Correspondence to Ming Ye.

Ethics declarations

Conflict of interest

The authors declare that they have no known conflict of interest or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xiang, H., Wang, J., Sun, Y. et al. Frequency transformer with local feature enhancement for improved vehicle re-identification. J Supercomput 81, 570 (2025). https://doi.org/10.1007/s11227-025-07012-4

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11227-025-07012-4

Keywords