Frequency transformer with local feature enhancement for improved vehicle re-identification

Xiang, Honglin; Wang, Jiahao; Sun, Yulong; Ye, Ming

doi:10.1007/s11227-025-07012-4

Frequency transformer with local feature enhancement for improved vehicle re-identification

Published: 03 March 2025

Volume 81, article number 570, (2025)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Honglin Xiang¹,
Jiahao Wang¹,
Yulong Sun¹ &
…
Ming Ye¹

177 Accesses
Explore all metrics

Abstract

With the rapid development of intelligent security systems, the demand for vehicle re-identification has surged exponentially. Vehicle re-identification involves recognizing the same vehicle across different camera perspectives, necessitating robust local feature processing. While transformers have shown promising results in this field, their inherent self-attention mechanism tends to dilute high-frequency texture details, hindering local feature extraction. Additionally, challenges such as occlusion and misalignment can lead to information loss and noise introduction, reducing re-identification accuracy. To address these issues, we introduce the frequency transformer with local feature enhancement (LFFT). The proposed framework comprises a frequency layer and a jigsaw select patches module (JSPM). The frequency layer enhances the weights of high-frequency component features using fast Fourier transform to improve local feature extraction at the lower layers. Meanwhile, the attention layer at the higher layers continues to extract global features. The JSPM incorporates discriminative patches obtained from attention layers into randomly shuffled and reorganized groups, enhancing the global discriminative capability of local features. The method does not utilize additional information or auxiliary networks. Experimental evaluations on two vehicle re-identification datasets, VeRi-776 and VehicleID, demonstrate the effectiveness of our method compared to recent approaches. The code is available at https://github.com/xianghlin/LFFT, accompanied by detailed usage instructions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A vehicle re-identification framework based on the improved multi-branch feature fusion network

Article Open access 12 October 2021

Vehicle Re-identification Using Joint Pyramid Feature Representation Network

Joint Pyramid Feature Representation Network for Vehicle Re-identification

Article 13 June 2020

Data availibility

No datasets were generated or analyzed during the current study.

References

Guo H, Zhu K, Tang M, Wang J (2019) Two-level attention network with multi-grain ranking loss for vehicle re-identification. IEEE Trans Image Process 28(9):4328–4338
Article MathSciNet MATH Google Scholar
Lou Y, Bai Y, Liu J, Wang S, Duan L-Y (2019) Embedding adversarial learning for vehicle re-identification. IEEE Trans Image Process 28(8):3794–3807
Article MathSciNet MATH Google Scholar
Zhou Y, Liu L, Shao L (2018) Vehicle re-identification by deep hidden multi-view inference. IEEE Trans Image Process 27(7):3275–3287
Article MathSciNet MATH Google Scholar
Liu X, Liu W, Ma H, Fu H (2016) Large-scale vehicle re-identification in urban surveillance videos. In: 2016 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6
Liu X, Liu W, Mei T, Ma H (2017) Provid: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20(3):645–658
Article MATH Google Scholar
Ning E, Zhang C, Wang C, Ning X, Chen H, Bai X (2023) Pedestrian re-id based on feature consistency and contrast enhancement. Displays 79:102467
Article MATH Google Scholar
Qu J, Zhang Y, Zhang Z (2023) Pma-net: a parallelly mixed attention network for person re-identification. Displays 78:102437
Article MATH Google Scholar
Zheng W-S, Gong S, Xiang T (2012) Reidentification by relative distance comparison. IEEE Trans Pattern Anal Mach Intell 35(3):653–668
Article MATH Google Scholar
Wang H, Peng J, Jiang G, Xu F, Fu X (2021) Discriminative feature and dictionary learning with part-aware model for vehicle re-identification. Neurocomputing 438:55–62
Article MATH Google Scholar
Zhang L, Xiang T, Gong S (2016) Learning a discriminative null space for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1239–1248
Zhu K, Guo H, Zhang S, Wang Y, Liu J, Wang J, Tang M (2024) AAformer: auto-aligned transformer for person re-identification. IEEE Trans Neural Networks Learn Syst 35(12):17307–17317
Article Google Scholar
Shen F, Zhu J, Zhu X, Xie Y, Huang J (2021) Exploring spatial significance via hybrid pyramidal graph network for vehicle re-identification. IEEE Trans Intell Transp Syst 23(7):8793–8804
Article MATH Google Scholar
Shen F, Xie Y, Zhu J, Zhu X, Zeng H (2023) Git: graph interactive transformer for vehicle re-identification. IEEE Trans Image Process 32:1039–1051
Article MATH Google Scholar
Dosovitskiy A (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 15013–15022
Zhu X, Su W, Lu L, Li B, Wang X, Dai J (2020) Deformable detr: deformable transformers for end-to-end object detection. arXiv preprint arXiv:2010.04159
Chen J, Lu Y, Yu Q, Luo X, Adeli E, Wang Y, Lu L, Yuille AL, Zhou Y (2021) Transunet: transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306
Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H (2021) Training data-efficient image transformers & distillation through attention. In: International Conference on Machine Learning. PMLR, pp 10347–10357
Zhang G, Zhang Y, Zhang T, Li B, Pu S (2023) PHA: patch-wise high-frequency augmentation for transformer-based person re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 14133–14142
Liu X, Liu W, Mei T, Ma H (2016) A deep learning-based approach to progressive vehicle re-identification for urban surveillance. In: Computer Vision—ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part II 14. Springer, pp 869–884
Liu H, Tian Y, Yang Y, Pang L, Huang T (2016) Deep relative distance learning: tell the difference between similar vehicles. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2167–2175
Li J, Yu C, Shi J, Zhang C, Ke T (2022) Vehicle re-identification method based on swin-transformer network. Array 16:100255
Article MATH Google Scholar
Wang T, Liu H, Song P, Guo T, Shi W (2022) Pose-guided feature disentangling for occluded person re-identification based on transformer. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 36, pp 2540–2549
Yu Z, Pei J, Zhu M, Zhang J, Li J (2022) Multi-attribute adaptive aggregation transformer for vehicle re-identification. Inf Process Manag 59(2):102868
Article MATH Google Scholar
Lin W, Zhang J, Meng W, Liu X, Zhang X (2024) Hide: hierarchical iterative decoding enhancement for multi-view 3d human parameter regression. Comput Anim Virtual Worlds 35(3):2266
Article MATH Google Scholar
Li Z, Zhang X, Tian C, Gao X, Gong Y, Wu J, Zhang G, Li J, Liu H (2023) Tvg-reid: transformer-based vehicle-graph re-identification. IEEE Trans Intel Veh 8(11):4644–4652
Article MATH Google Scholar
Lee-Thorp J, Ainslie J, Eckstein I, Ontanon S (2021) Fnet: mixing tokens with Fourier transforms. arXiv preprint arXiv:2105.03824
Rao Y, Zhao W, Zhu Z, Lu J, Zhou J (2021) Global filter networks for image classification. Adv Neural Inf Process Syst 34:980–993
MATH Google Scholar
Yao T, Pan Y, Li Y, Ngo C.-W, Mei T (2022) Wave-vit: unifying wavelet and transformers for visual representation learning. In: European Conference on Computer Vision. Springer, pp 328–345
Patro BN, Namboodiri VP, Agneeswaran VS (2023) Spectformer: frequency and attention is what you need in a vision transformer. arXiv preprint arXiv:2304.06446
Khorramshahi P, Kumar A, Peri N, Rambhatla SS, Chen J-C, Chellappa R (2019) A dual-path model with adaptive attention for vehicle re-identification. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 6132–6141
Zhu J, Zeng H, Huang J, Liao S, Lei Z, Cai C, Zheng L (2019) Vehicle re-identification using quadruple directional deep learning features. IEEE Trans Intell Transp Syst 21(1):410–420
Article Google Scholar
Liu X, Zhang S, Huang Q, Gao W (2018) Ram: a region-aware deep model for vehicle re-identification. In: 2018 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6
He J, Chen J-N, Liu S, Kortylewski A, Yang C, Bai Y, Wang C (2022) Transfg: a transformer architecture for fine-grained recognition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 36, pp 852–860
Zhang M, Tian X (2023) Transformer architecture based on mutual attention for image-anomaly detection. Virtual Real Intell Hardw 5(1):57–67
Article MathSciNet MATH Google Scholar
Zhu H, Ke W, Li D, Liu J, Tian L, Shan Y (2022) Dual cross-attention learning for fine-grained visual categorization and object re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4692–4702
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Proc Syst. https://doi.org/10.48550/ARXIV.1706.03762
Article Google Scholar
Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned CNN embedding for person reidentification. ACM Trans Multimed Comput Commun Appl 14(1):1–20
Article MATH Google Scholar
Liu H, Feng J, Qi M, Jiang J, Yan S (2017) End-to-end comparative attention networks for person re-identification. IEEE Trans Image Process 26(7):3492–3506
Article MathSciNet MATH Google Scholar
Shen F, Lin L, Wei M, Liu J, Zhu J, Zeng H, Cai C, Zheng L (2019) A large benchmark for fabric image retrieval. In: 2019 IEEE 4th International Conference on Image, Vision and Computing (ICIVC). IEEE, pp 247–251
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1116–1124
Zhu J, Zeng H, Huang J, Zhu X, Lei Z, Cai C, Zheng L (2019) Body symmetry and part-locality-guided direct nonparametric deep feature enhancement for person reidentification. IEEE Internet Things J 7(3):2053–2065
Article MATH Google Scholar
Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L et al (2019) Pytorch: an imperative style, high-performance deep learning library. Adv Neural Inf Proc Syst. https://doi.org/10.48550/arXiv.1912.01703
Article Google Scholar
Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 34, pp 13001–13008
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Proc Syst. https://doi.org/10.1145/3065386
Article MATH Google Scholar
Zhu J, Zeng H, Lei Z, Liao S, Zheng L, Cai C (2018) A shortly and densely connected convolutional neural network for vehicle re-identification. In: 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, pp 3285–3290
Chen T-S, Liu C-T, Wu C-W, Chien S-Y (2020) Orientation-aware vehicle re-identification with semantics-guided part attention network. In: Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16. Springer, pp 330–346
Zhu R, Fang J, Xu H, Yu H, Xue J (2020) Dcdlearn: multi-order deep cross-distance learning for vehicle re-identification. arXiv preprint arXiv:2003.11315
Zhang R, Zhong X, Wang X, Huang W, Liu W, Zhao S (2022) Graph-based structural attributes for vehicle re-identification. In: 2022 IEEE International Conference on Multimedia and Expo (ICME). IEEE, pp 1–6
Qian J, Jiang W, Luo H, Yu H (2020) Stripe-based and attribute-aware network: a two-branch deep model for vehicle re-identification. Meas Sci Technol 31(9):095401
Article MATH Google Scholar
Sun Z, Nie X, Xi X, Yin Y (2020) CFVMNet: a multi-branch network for vehicle re-identification based on common field of view. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 3523–3531
Liu X, Liu W, Zheng J, Yan C, Mei T (2020) Beyond the parts: learning multi-view cross-part correlation for vehicle re-identification. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 907–915
Meng D, Li L, Liu X, Li Y, Yang S, Zha Z-J, Gao X, Wang S, Huang Q (2020) Parsing-based view-aware embedding network for vehicle re-identification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7103–7112
Khorramshahi P, Peri N, Chen J-C, Chellappa R (2020) The devil is in the details: self-supervised attention for vehicle re-identification. In: Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XIV 16. Springer, pp 369–386
Peri N, Khorramshahi P, Rambhatla SS, Shenoy V, Rawat S, Chen J-C, Chellappa R (2020) Towards real-time systems for vehicle re-identification, multi-camera tracking, and anomaly detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 622–623
Kishore R, Aslam N, Kolekar MH (2024) PATReId: pose apprise transformer network for vehicle re-identification. IEEE Trans Emerg Top Comput Intell 8(5):3691–3702
Article MATH Google Scholar
Xu Z, Wei L, Lang C, Feng S, Wang T, Bors AG (2021) HSS-GCN: a hierarchical spatial structural graph convolutional network for vehicle re-identification. In: Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part V. Springer, pp 356–364
Xu Y, Jiang N, Zhang L, Zhou Z, Wu W (2019) Multi-scale vehicle re-identification using self-adapting label smoothing regularization. In: ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 2117–2121
Zhu Y, Zha Z-J, Zhang T, Liu J, Luo J (2020) A structured graph attention network for vehicle re-identification. In: Proceedings of the 28th ACM International Conference on Multimedia, pp 646–654
He Y, Dong C, Wei Y (2019) Combination of appearance and license plate features for vehicle re-identification. In: 2019 IEEE International Conference on Image Processing (ICIP). IEEE, pp 3108–3112
Yang X, Lang C, Peng P, Xing J (2019) Vehicle re-identification by multi-grain Learni. In: 2019 IEEE International Conference on Image Processing (ICIP). IEEE, pp 3113–3117

Download references

Author information

Authors and Affiliations

College of Artificial Intelligence, Southwest University, Chongqing, 400715, China
Honglin Xiang, Jiahao Wang, Yulong Sun & Ming Ye

Authors

Honglin Xiang
View author publications
You can also search for this author inPubMed Google Scholar
Jiahao Wang
View author publications
You can also search for this author inPubMed Google Scholar
Yulong Sun
View author publications
You can also search for this author inPubMed Google Scholar
Ming Ye
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

HX conducted research method design, experimental design, etc., write the initial draft of the paper, and participate in discussions and revisions. JW and YS participate in research design, method selection, etc., participate in paper writing and revision, and participate in drawing pictures and tables. MY is responsible for the planning, design, and implementation of the entire study, responsible for supervising and coordinating the overall research, ensure the completeness and accuracy of the research, write and revise papers, and provide equipment and technical support.

Corresponding author

Correspondence to Ming Ye.

Ethics declarations

Conflict of interest

The authors declare that they have no known conflict of interest or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Xiang, H., Wang, J., Sun, Y. et al. Frequency transformer with local feature enhancement for improved vehicle re-identification. J Supercomput 81, 570 (2025). https://doi.org/10.1007/s11227-025-07012-4

Download citation

Accepted: 30 January 2025
Published: 03 March 2025
DOI: https://doi.org/10.1007/s11227-025-07012-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Frequency transformer with local feature enhancement for improved vehicle re-identification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A vehicle re-identification framework based on the improved multi-branch feature fusion network

Vehicle Re-identification Using Joint Pyramid Feature Representation Network

Joint Pyramid Feature Representation Network for Vehicle Re-identification

Data availibility

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now