Skip to main content

Advertisement

Log in

UnifiedSC: a unified framework via collaborative optimization for multi-task person re-identification

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Person re-identification (ReID) encompasses two independent study branches, i.e., single-modality and cross-modality identifications. Since single-modality and cross-modality pedestrian data have completely different properties, it is hard to accomplish both tasks at once. However, studying either of the two tasks alone limits the application of person ReID. Therefore, we first explore the relationship between single-modality and cross-modality person ReID and attempt to solve the multi-task optimization problem. To this end, we propose a unified framework, termed UnifiedSC, to mine identity-invariant discriminative features for multi-task person ReID. To effectively optimize the deep model, we construct a collaborative optimization strategy to simultaneously train visible and infrared images from two aspects. On the one hand, two independent classifiers are designed to separately perform single-modality and cross-modality pedestrian identification. On the other hand, we handle the identity-aware feature discrepancy problem at both the feature and classifier levels. At the feature level, we introduce a verification model to distinguish positive/negative sample pairs and employ the weighted regularization triplet to constrain the relative feature distribution. Meanwhile, at the classifier level, we create a shared-weight classifier to map pedestrian features from different domains into a similar feature space. With the promotion of collaborative optimization, the proposed UnifiedSC framework could perceive different pedestrian information and mine identity-invariant features. Our method achieves a mean rank-1 of = 84.7% on the Market1501 and SYSU-MM01 databases, while it also achieves a mean rank-1 of = 78.9% on the DukeMTMC-reID and SYSU-MM01 databases. Abundant experiments adequately demonstrate that UnifiedSC achieves state-of-the-art performance in both tasks and is valuable for person ReID.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

  1. Bai X, Yang M, Huang T, Dou Z, Yu R, Xu Y (2020) Deep-person: Learning discriminative deep features for person re-identification. Pattern Recogn 98:107036

    Article  Google Scholar 

  2. Chen D, Wu P, Jia T, Xu F (2022) Hob-net: high-order block network via deep metric learning for person re-identification. Appl Intell 52:4844–4857

    Article  Google Scholar 

  3. Chen S, Jiang K, Liu X, Yang K, Lei Z (2023) Tgas-reid: Efficient architecture search for person re-identification via greedy decisions with topological order. Appl Intell 53(7):7343–7354

    Article  Google Scholar 

  4. Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: A deep quadruplet network for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 403–412

  5. Chen X, Zheng X, Lu X (2021) Bidirectional interaction network for person re-identification. IEEE Trans Image Process 30:1935–1948

    Article  Google Scholar 

  6. Chen X, Zheng X, Lu X (2023) Identity feature disentanglement for visible-infrared person re-identification. ACM Transactions on Multimedia Computing, Communications and Applications

    Book  Google Scholar 

  7. Chen Y, Wan L, Li Z, Jing Q, Sun Z (2021b). Neural feature search for rgb-infrared person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 587–597

  8. Fu C, Hu Y, Wu X, Shi H, Mei T, He R (2021) Cm-nas: Cross-modality neural architecture search for visible-infrared person re-identification. In Proceedings of the IEEE international conference on computer vision, pp 11823–11832

  9. Gao X, He F, Zhang S, Luo J, Fan B (2023) A fast nondominated sorting-based moea with convergence and diversity adjusted adaptively. J Supercomputing, pp 1–38

  10. Gao Y, Liang T, Jin Y, Gu X, Liu W, Li Y, Lang C (2021) Mso: Multi-feature space joint optimization network for rgb-infrared person re-identification. In Proceedings of the ACM international conference on multimedia, pp 5257–5265

  11. Gong S, Cheng J, Hou Z (2020) Faster person re-identification. In Proceedings of the European conference on computer vision, pp 275–292

  12. Guo C, Zhao X, Zou Q (2022) Relation network based on multi-granular hypergraphs for person re-identification. Appl Intell 52(10):11394–11406

    Article  Google Scholar 

  13. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  14. He L, Liu W (2020) Guided saliency feature learning for person re-identification in crowded scenes. In: Proceedings of the European conference on computer vision, pp 357–373

  15. He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) Transreid: Transformer-based object re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 15013–15022

  16. Jia M, Cheng X, Lu S, Zhang J (2022) Learning disentangled representation implicitly via transformer for occluded person re-identification. IEEE Trans Multimed

  17. Li D, Wei X, Hong X, Gong Y (2020a) Infrared-visible cross-modal person re-identification with an x modality. In: Proceedings of the association for the advance of artificial intelligence, pp 4610–4617

  18. Li H, Wu G, Zheng W (2021) Combined depth space based architecture search for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6729–6738

  19. Li P, Wang Y, Si T, Ullah K, Han W, Wang L (2023) Dsfa: cross-scene domain style and feature adaptation for landslide detection from high spatial resolution images. International Journal of Digital Earth 16(1):2426–2447

    Article  Google Scholar 

  20. Li P, Wang Y, Xu G, Wang L (2023) Landslidecl: towards robust landslide analysis guided by contrastive learning. Landslides 20:461–474

    Article  Google Scholar 

  21. Li S, Yu H, Hu R (2020) Attributes-aided part detection and refinement for person re-identification. Pattern Recogn 97:107016

    Article  Google Scholar 

  22. Liang T, Jin Y, Liu W, Li Y (2023) Cross-modality transformer with modality mining for visible-infrared person re-identification. IEEE Trans Multimed

  23. Liu C, Chang X, Shen Y (2020) Unity style transfer for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6887–6896

  24. Liu J, Song W, Chen C, Liu F (2022) Cross-modality person re-identification via channel-based partition network. Appl Intell 52:2423–2435

    Article  Google Scholar 

  25. Lu Z, Lin R, Hu H (2023) Tri-level modality-information disentanglement for visible-infrared person re-identification. IEEE Trans Multimed

  26. Luo H, Jiang W, Gu Y, Liu F, Liao X, Lai S, Gu J (2020) A strong baseline and batch normalization neck for deep person re-identification. IEEE Trans Multimedia 22(10):2597–2609

    Article  Google Scholar 

  27. Nguyen BX, Nguyen BD, Do T, Tjiputra E, Tran QD, Nguyen A (2021) Graph-based person signature for person re-identifications. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3492–3501

  28. Park H, Lee S, Lee J, Ham B (2021) Learning by aligning: Visible-infrared person re-identification using cross-modal correspondences. In: Proceedings of the IEEE international conference on computer vision, pp 12046–12055

  29. Qi J, Liang T, Liu W, Li Y, Jin Y (2023) A generative-based image fusion strategy for visible-infrared person re-identification. IEEE Trans Circ Syst Vid Technol

  30. Rao Y, Chen G, Lu J, Zhou J (2021) Counterfactual attention learning for fine-grained visual categorization and re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 1025–1034

  31. Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: Proceedings of the European conference on computer vision, pp 17–35

  32. Serbetci A, Akgul YS (2020) End-to-end training of cnn ensembles for person re-identification. Pattern Recogn 104:107319

    Article  Google Scholar 

  33. Si T, He F, Li P, Gao X (2023) Tri-modality consistency optimization with heterogeneous augmented images for visible-infrared person re-identification. Neurocomputing 523:170–181

    Article  Google Scholar 

  34. Si T, He F, Li P, Song Y, Fan L (2023) Diversity feature constraint based on heterogeneous data for unsupervised person re-identification. Information Processing & Management 60(3):103304

    Article  Google Scholar 

  35. Si T, He F, Li P, Ye M (2023c) Homogeneous and heterogeneous optimization for unsupervised cross-modality person re-identification in visual internet of things. IEEE Internet of Things Journal

  36. Si T, He F, Wu H, Duan Y (2022) Spatial-driven features based on image dependencies for person re-identification. Pattern Recogn 124:108462

    Article  Google Scholar 

  37. Si T, He F, Zhang Z, Duan Y (2022) Hybrid contrastive learning for unsupervised person re-identification. IEEE Trans Multimedia 25:4323–4334

    Article  Google Scholar 

  38. Si T, Zhang Z, Liu S (2019) Compact triplet loss for person re-identification in camera sensor networks. Ad Hoc Netw 95:101984

    Article  Google Scholar 

  39. Tian X, Zhang Z, Lin S, Qu Y, Xie Y, Ma L (2021) Farewell to mutual information: Variational distillation for cross-modal person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1522–1531

  40. Wan L, Sun Z, Jing Q, Chen Y, Lu L, Li Z (2023) G2da: Geometry-guided dual-alignment learning for rgb-infrared person re-identification. Pattern Recogn 135:109150

    Article  Google Scholar 

  41. Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. In: ACM International conference on multimedia, pp 274–282

  42. Wang G, Zhang T, Yang Y, Cheng J, Chang J, Liang X, Hou Z (2020) Cross-modality paired-images generation for rgb-infrared person re-identification. In: Proceedings of the association for the advance of artificial intelligence, pp 12144–12151

  43. Wang Z, Jiang J, Wu Y, Ye M, Bai X, Satoh S (2019) Learning sparse and identity-preserved hidden attributes for person re-identification. IEEE Trans Image Process 29:2013–2025

    Article  Google Scholar 

  44. Wei Z, Yang X, Wang N, Gao X (2021) Syncretic modality collaborative learning for visible infrared person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 225–234

  45. Wu A, Zheng W, Yu H, Gong S, Lai J (2017) Rgb-infrared cross-modality person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5380–5389

  46. Wu G, Zhu X, Gong S (2021) Learning hybrid ranking representation for person re-identification. Pattern Recogn 121:108239

    Article  Google Scholar 

  47. Wu G, Zhu X, Gong S (2022) Learning hybrid ranking representation for person re-identification. Pattern Recogn 121:108239

    Article  Google Scholar 

  48. Wu H, He F, Duan Y, Yan S (2023) Perceptual metric-guided human image generation. Integr Comput-Aided Engineer. 2022, 29(2): 141–151

  49. Ye M, Chen C, Shen J, Shao L (2022) Dynamic tri-level relation mining with attentive graph for visible infrared re-identification. IEEE Trans Inf Forensics Secur 17:386–398

    Article  Google Scholar 

  50. Ye M, Ruan W, Du B, Shou MZ (2021) Channel augmented joint learning for visible-infrared recognition. In: Proceedings of the IEEE International conference on computer vision, pp 13567–13576

  51. Ye M, Shen J, J Crandall D, Shao L, Luo J (2020a) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: Proceedings of the European conference on computer vision, pp 229–247

  52. Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SC (2022) Deep learning for person re-identification: A survey and outlook. IEEE Trans Pattern Anal Mach Intell 44(6):2872–2893

    Article  Google Scholar 

  53. Ye M, Shen J, Shao L (2020) Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Trans Inf Forensics Secur 16:728–739

    Article  Google Scholar 

  54. Zhang D, Zhang Z, Ju Y, Wang C, Xie Y, Qu Y (2022) Dual mutual learning for cross-modality person re-identification. IEEE Trans Circuits Syst Video Technol 32(8):5361–5373

    Article  Google Scholar 

  55. Zhang L, Guo H, Zhu K, Qiao H, Huang G, Zhang S, Zhang H, Sun J, Wang J (2022) Hybrid modality metric learning for visible-infrared person re-identification. ACM Trans Multimed Comput Commun Appl 18(1):1–15

    Google Scholar 

  56. Zhang Q, Lai C, Liu J, Huang N, Han J (2022c) Fmcnet: Feature-level modality compensation for visible-infrared person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7349–7358

  57. Zhang Q, Lai J, Xie X (2021) Learning modal-invariant angular metric by cyclic projection network for vis-nir person re-identification. IEEE Trans Image Process 30:8019–8033

    Article  MathSciNet  Google Scholar 

  58. Zhang T, Sun X, Li X, Yi Z (2021) Image generation and constrained two-stage feature fusion for person re-identification. Appl Intell 51:7679–7689

    Article  Google Scholar 

  59. Zheng L, Huang Y, Lu H, Yang Y (2019) Pose-invariant embedding for deep person re-identification. IEEE Trans Image Process 28(9):4500–4509

    Article  MathSciNet  Google Scholar 

  60. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124

  61. Zheng Z, Yang X, Yu Z, Zheng L, Yang Y, Kautz J (2019b) Joint discriminative and generative learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp2138–2147

  62. Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned cnn embedding for person reidentification. ACM Trans Multimed Comput Commun Appl 14(1):1–20

    Article  Google Scholar 

  63. Zhong X, Lu T, Huang W, Ye M, Jia X, Lin C (2022) Grayscale enhancement colorization network for visible-infrared person re-identification. IEEE Trans Circuits Syst Video Technol 32:1418–1430

    Article  Google Scholar 

  64. Zhong Z, Zheng L, Zheng Z, Li S, Yang Y (2018) Camera style adaptation for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5157–5166

  65. Zhou K, Yang Y, Cavallaro A, Xiang T (2021) Learning generalisable omni-scale representations for person re-identification. IEEE Trans Pattern Anal Mach Intell 44(9):5056–5069

    Google Scholar 

  66. Zhou Q, Zhong B, Lan X, Sun G, Zhang Y, Zhang B, Ji R (2020) Fine-grained spatial alignment model for person re-identification with focal triplet loss. IEEE Trans Image Process 29:7578–7589

    Article  Google Scholar 

  67. Zhu Y, Yang Z, Wang L, Zhao S, Hu X, Tao D (2020) Hetero-center loss for cross-modality person re-identification. Neurocomputing 386:97–109

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China under Grant No. 62072348, the New Introduced Talents Program of University of Jinan under Grant No. 1009569. The numerical calculations in this paper have been done on the supercomputing system in the Supercomputing Center of Wuhan University. The authors would also like to thank the Editor and anonymous reviewers for their valuable comments and suggestions, which significantly improved the quality of this article.

Author information

Authors and Affiliations

Authors

Contributions

Tongzhen Si: Conceptualization, Methodology, Software, Investigation, Validation, Writing - original draft, Writing - revision. Fazhi He: Writing - review, Supervision, Project administration, Funding acquisition. Penglei Li: Methodology, Writing - review, Validation.

Corresponding author

Correspondence to Fazhi He.

Ethics declarations

Ethical approval

This article does not contain any studies with human participants performed by any of the authors

Conflict of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Si, T., He, F. & Li, P. UnifiedSC: a unified framework via collaborative optimization for multi-task person re-identification. Appl Intell 54, 2962–2975 (2024). https://doi.org/10.1007/s10489-024-05333-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-024-05333-0

Keywords