Abstract
With the rapid development of intelligent security systems, the demand for vehicle re-identification has surged exponentially. Vehicle re-identification involves recognizing the same vehicle across different camera perspectives, necessitating robust local feature processing. While transformers have shown promising results in this field, their inherent self-attention mechanism tends to dilute high-frequency texture details, hindering local feature extraction. Additionally, challenges such as occlusion and misalignment can lead to information loss and noise introduction, reducing re-identification accuracy. To address these issues, we introduce the frequency transformer with local feature enhancement (LFFT). The proposed framework comprises a frequency layer and a jigsaw select patches module (JSPM). The frequency layer enhances the weights of high-frequency component features using fast Fourier transform to improve local feature extraction at the lower layers. Meanwhile, the attention layer at the higher layers continues to extract global features. The JSPM incorporates discriminative patches obtained from attention layers into randomly shuffled and reorganized groups, enhancing the global discriminative capability of local features. The method does not utilize additional information or auxiliary networks. Experimental evaluations on two vehicle re-identification datasets, VeRi-776 and VehicleID, demonstrate the effectiveness of our method compared to recent approaches. The code is available at https://github.com/xianghlin/LFFT, accompanied by detailed usage instructions.









Similar content being viewed by others
Data availibility
No datasets were generated or analyzed during the current study.
References
Guo H, Zhu K, Tang M, Wang J (2019) Two-level attention network with multi-grain ranking loss for vehicle re-identification. IEEE Trans Image Process 28(9):4328–4338
Lou Y, Bai Y, Liu J, Wang S, Duan L-Y (2019) Embedding adversarial learning for vehicle re-identification. IEEE Trans Image Process 28(8):3794–3807
Zhou Y, Liu L, Shao L (2018) Vehicle re-identification by deep hidden multi-view inference. IEEE Trans Image Process 27(7):3275–3287
Liu X, Liu W, Ma H, Fu H (2016) Large-scale vehicle re-identification in urban surveillance videos. In: 2016 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6
Liu X, Liu W, Mei T, Ma H (2017) Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20(3):645–658
Ning E, Zhang C, Wang C, Ning X, Chen H, Bai X (2023) Pedestrian re-id based on feature consistency and contrast enhancement. Displays 79:102467
Qu J, Zhang Y, Zhang Z (2023) Pma-net: a parallelly mixed attention network for person re-identification. Displays 78:102437
Zheng W-S, Gong S, Xiang T (2012) Reidentification by relative distance comparison. IEEE Trans Pattern Anal Mach Intell 35(3):653–668
Wang H, Peng J, Jiang G, Xu F, Fu X (2021) Discriminative feature and dictionary learning with part-aware model for vehicle re-identification. Neurocomputing 438:55–62
Zhang L, Xiang T, Gong S (2016) Learning a discriminative null space for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1239–1248
Zhu K, Guo H, Zhang S, Wang Y, Liu J, Wang J, Tang M (2024) AAformer: auto-aligned transformer for person re-identification. IEEE Trans Neural Networks Learn Syst 35(12):17307–17317
Shen F, Zhu J, Zhu X, Xie Y, Huang J (2021) Exploring spatial significance via hybrid pyramidal graph network for vehicle re-identification. IEEE Trans Intell Transp Syst 23(7):8793–8804
Shen F, Xie Y, Zhu J, Zhu X, Zeng H (2023) Git: graph interactive transformer for vehicle re-identification. IEEE Trans Image Process 32:1039–1051
Dosovitskiy A (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 15013–15022
Zhu X, Su W, Lu L, Li B, Wang X, Dai J (2020) Deformable detr: deformable transformers for end-to-end object detection. arXiv preprint arXiv:2010.04159
Chen J, Lu Y, Yu Q, Luo X, Adeli E, Wang Y, Lu L, Yuille AL, Zhou Y (2021) Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306
Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H (2021) Training data-efficient image transformers & distillation through attention. In: International Conference on Machine Learning. PMLR, pp 10347–10357
Zhang G, Zhang Y, Zhang T, Li B, Pu S (2023) PHA: patch-wise high-frequency augmentation for transformer-based person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 14133–14142
Liu X, Liu W, Mei T, Ma H (2016) A deep learning-based approach to progressive vehicle re-identification for urban surveillance. In: Computer Vision—ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part II 14. Springer, pp 869–884
Liu H, Tian Y, Yang Y, Pang L, Huang T (2016) Deep relative distance learning: tell the difference between similar vehicles. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2167–2175
Li J, Yu C, Shi J, Zhang C, Ke T (2022) Vehicle re-identification method based on swin-transformer network. Array 16:100255
Wang T, Liu H, Song P, Guo T, Shi W (2022) Pose-guided feature disentangling for occluded person re-identification based on transformer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 36, pp 2540–2549
Yu Z, Pei J, Zhu M, Zhang J, Li J (2022) Multi-attribute adaptive aggregation transformer for vehicle re-identification. Inf Process Manag 59(2):102868
Lin W, Zhang J, Meng W, Liu X, Zhang X (2024) Hide: hierarchical iterative decoding enhancement for multi-view 3d human parameter regression. Comput Anim Virtual Worlds 35(3):2266
Li Z, Zhang X, Tian C, Gao X, Gong Y, Wu J, Zhang G, Li J, Liu H (2023) Tvg-reid: transformer-based vehicle-graph re-identification. IEEE Trans Intel Veh 8(11):4644–4652
Lee-Thorp J, Ainslie J, Eckstein I, Ontanon S (2021) Fnet: mixing tokens with Fourier transforms. arXiv preprint arXiv:2105.03824
Rao Y, Zhao W, Zhu Z, Lu J, Zhou J (2021) Global filter networks for image classification. Adv Neural Inf Process Syst 34:980–993
Yao T, Pan Y, Li Y, Ngo C.-W, Mei T (2022) Wave-vit: unifying wavelet and transformers for visual representation learning. In: European Conference on Computer Vision. Springer, pp 328–345
Patro BN, Namboodiri VP, Agneeswaran VS (2023) Spectformer: frequency and attention is what you need in a vision transformer. arXiv preprint arXiv:2304.06446
Khorramshahi P, Kumar A, Peri N, Rambhatla SS, Chen J-C, Chellappa R (2019) A dual-path model with adaptive attention for vehicle re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 6132–6141
Zhu J, Zeng H, Huang J, Liao S, Lei Z, Cai C, Zheng L (2019) Vehicle re-identification using quadruple directional deep learning features. IEEE Trans Intell Transp Syst 21(1):410–420
Liu X, Zhang S, Huang Q, Gao W (2018) Ram: a region-aware deep model for vehicle re-identification. In: 2018 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6
He J, Chen J-N, Liu S, Kortylewski A, Yang C, Bai Y, Wang C (2022) Transfg: a transformer architecture for fine-grained recognition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 36, pp 852–860
Zhang M, Tian X (2023) Transformer architecture based on mutual attention for image-anomaly detection. Virtual Real Intell Hardw 5(1):57–67
Zhu H, Ke W, Li D, Liu J, Tian L, Shan Y (2022) Dual cross-attention learning for fine-grained visual categorization and object re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4692–4702
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Proc Syst. https://doi.org/10.48550/ARXIV.1706.03762
Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned CNN embedding for person reidentification. ACM Trans Multimed Comput Commun Appl 14(1):1–20
Liu H, Feng J, Qi M, Jiang J, Yan S (2017) End-to-end comparative attention networks for person re-identification. IEEE Trans Image Process 26(7):3492–3506
Shen F, Lin L, Wei M, Liu J, Zhu J, Zeng H, Cai C, Zheng L (2019) A large benchmark for fabric image retrieval. In: 2019 IEEE 4th International Conference on Image, Vision and Computing (ICIVC). IEEE, pp 247–251
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1116–1124
Zhu J, Zeng H, Huang J, Zhu X, Lei Z, Cai C, Zheng L (2019) Body symmetry and part-locality-guided direct nonparametric deep feature enhancement for person reidentification. IEEE Internet Things J 7(3):2053–2065
Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L et al (2019) Pytorch: an imperative style, high-performance deep learning library. Adv Neural Inf Proc Syst. https://doi.org/10.48550/arXiv.1912.01703
Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 34, pp 13001–13008
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Proc Syst. https://doi.org/10.1145/3065386
Zhu J, Zeng H, Lei Z, Liao S, Zheng L, Cai C (2018) A shortly and densely connected convolutional neural network for vehicle re-identification. In: 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, pp 3285–3290
Chen T-S, Liu C-T, Wu C-W, Chien S-Y (2020) Orientation-aware vehicle re-identification with semantics-guided part attention network. In: Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16. Springer, pp 330–346
Zhu R, Fang J, Xu H, Yu H, Xue J (2020) Dcdlearn: multi-order deep cross-distance learning for vehicle re-identification. arXiv preprint arXiv:2003.11315
Zhang R, Zhong X, Wang X, Huang W, Liu W, Zhao S (2022) Graph-based structural attributes for vehicle re-identification. In: 2022 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6
Qian J, Jiang W, Luo H, Yu H (2020) Stripe-based and attribute-aware network: a two-branch deep model for vehicle re-identification. Meas Sci Technol 31(9):095401
Sun Z, Nie X, Xi X, Yin Y (2020) CFVMNet: a multi-branch network for vehicle re-identification based on common field of view. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 3523–3531
Liu X, Liu W, Zheng J, Yan C, Mei T (2020) Beyond the parts: learning multi-view cross-part correlation for vehicle re-identification. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 907–915
Meng D, Li L, Liu X, Li Y, Yang S, Zha Z-J, Gao X, Wang S, Huang Q (2020) Parsing-based view-aware embedding network for vehicle re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7103–7112
Khorramshahi P, Peri N, Chen J-C, Chellappa R (2020) The devil is in the details: self-supervised attention for vehicle re-identification. In: Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIV 16. Springer, pp 369–386
Peri N, Khorramshahi P, Rambhatla SS, Shenoy V, Rawat S, Chen J-C, Chellappa R (2020) Towards real-time systems for vehicle re-identification, multi-camera tracking, and anomaly detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 622–623
Kishore R, Aslam N, Kolekar MH (2024) PATReId: pose apprise transformer network for vehicle re-identification. IEEE Trans Emerg Top Comput Intell 8(5):3691–3702
Xu Z, Wei L, Lang C, Feng S, Wang T, Bors AG (2021) HSS-GCN: a hierarchical spatial structural graph convolutional network for vehicle re-identification. In: Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part V. Springer, pp 356–364
Xu Y, Jiang N, Zhang L, Zhou Z, Wu W (2019) Multi-scale vehicle re-identification using self-adapting label smoothing regularization. In: ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 2117–2121
Zhu Y, Zha Z-J, Zhang T, Liu J, Luo J (2020) A structured graph attention network for vehicle re-identification. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 646–654
He Y, Dong C, Wei Y (2019) Combination of appearance and license plate features for vehicle re-identification. In: 2019 IEEE International Conference on Image Processing (ICIP). IEEE, pp 3108–3112
Yang X, Lang C, Peng P, Xing J (2019) Vehicle re-identification by multi-grain Learni. In: 2019 IEEE International Conference on Image Processing (ICIP). IEEE, pp 3113–3117
Author information
Authors and Affiliations
Contributions
HX conducted research method design, experimental design, etc., write the initial draft of the paper, and participate in discussions and revisions. JW and YS participate in research design, method selection, etc., participate in paper writing and revision, and participate in drawing pictures and tables. MY is responsible for the planning, design, and implementation of the entire study, responsible for supervising and coordinating the overall research, ensure the completeness and accuracy of the research, write and revise papers, and provide equipment and technical support.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known conflict of interest or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xiang, H., Wang, J., Sun, Y. et al. Frequency transformer with local feature enhancement for improved vehicle re-identification. J Supercomput 81, 570 (2025). https://doi.org/10.1007/s11227-025-07012-4
Accepted:
Published:
DOI: https://doi.org/10.1007/s11227-025-07012-4