skip to main content
research-article

GreyReID: A Novel Two-stream Deep Framework with RGB-grey Information for Person Re-identification

Published: 16 April 2021 Publication History

Abstract

In this article, we observe that most false positive images (i.e., different identities with query images) in the top ranking list usually have the similar color information with the query image in person re-identification (Re-ID). Meanwhile, when we use the greyscale images generated from RGB images to conduct the person Re-ID task, some hard query images can obtain better performance compared with using RGB images. Therefore, RGB and greyscale images seem to be complementary to each other for person Re-ID. In this article, we aim to utilize both RGB and greyscale images to improve the person Re-ID performance. To this end, we propose a novel two-stream deep neural network with RGB-grey information, which can effectively fuse RGB and greyscale feature representations to enhance the generalization ability of Re-ID. First, we convert RGB images to greyscale images in each training batch. Based on these RGB and greyscale images, we train the RGB and greyscale branches, respectively. Second, to build up connections between RGB and greyscale branches, we merge the RGB and greyscale branches into a new joint branch. Finally, we concatenate the features of all three branches as the final feature representation for Re-ID. Moreover, in the training process, we adopt the joint learning scheme to simultaneously train each branch by the independent loss function, which can enhance the generalization ability of each branch. Besides, a global loss function is utilized to further fine-tune the final concatenated feature. The extensive experiments on multiple benchmark datasets fully show that the proposed method can outperform the state-of-the-art person Re-ID methods. Furthermore, using greyscale images can indeed improve the person Re-ID performance in the proposed deep framework.

References

[1]
Binghui Chen, Weihong Deng, and Jiani Hu. 2019. Mixed high-order attention network for person re-identification. In IEEE International Conference on Computer Vision (ICCV’19).
[2]
Weihua Chen, Xiaotang Chen, Jianguo Zhang, and Kaiqi Huang. 2017. Beyond triplet loss: A deep quadruplet network for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[3]
De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, and Nanning Zheng. 2016. Person re-identification by multi-channel parts-based CNN with improved triplet loss function. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16).
[4]
Zuozhuo Dai, Mingqiang Chen, Siyu Zhu, and Ping Tan. 2018. Batch feature erasing for person re-identification and beyond. In IEEE International Conference on Computer Vision (ICCV'19).
[5]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Fei-Fei Li. 2009. ImageNet: A large-scale hierarchical image database. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’09).
[6]
Hehe Fan, Liang Zheng, Chenggang Yan, and Yi Yang. 2018. Unsupervised person re-identification: Clustering and fine-tuning. ACM Trans. Multimedia Comput. Commun. Applic. 14, 4 (2018).
[7]
Jianyuan Guo, Yuhui Yuan, Lang Huang, Chao Zhang, Jin-Ge Yao, and Kai Han. 2019. Beyond human parts: Dual part-aligned representations for person re-identification. In IEEE International Conference on Computer Vision (ICCV’19).
[8]
Raia Hadsell, Sumit Chopra, and Yann LeCun. 2006. Dimensionality reduction by learning an invariant mapping. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’06).
[9]
Kai Han, Jianyuan Guo, Chao Zhang, and Mingjian Zhu. 2018. Attribute-aware attention model for fine-grained representation learning. In ACM Multimedia Conference on Multimedia Conference (ACM MM’18).
[10]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16).
[11]
Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv (2017).
[12]
Ruibing Hou, Bingpeng Ma, Hong Chang, Xinqian Gu, Shiguang Shan, and Xilin Chen. 2019. Interaction-and-aggregation network for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[13]
Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[14]
Mahdi M. Kalayeh, Emrah Basaran, Muhittin Gökmen, Mustafa E. Kamasak, and Mubarak Shah. 2018. Human semantic parsing for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[15]
Wei Li, Rui Zhao, Tong Xiao, and Xiaogang Wang. 2014. DeepReID: Deep filter pairing neural network for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’14).
[16]
Wei Li, Xiatian Zhu, and Shaogang Gong. 2017. Person re-identification by deep joint learning of multi-loss classification. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI'17).
[17]
Hao Liu, Jiashi Feng, Meibin Qi, Jianguo Jiang, and Shuicheng Yan. 2017. End-to-end comparative attention networks for person re-identification. IEEE Trans. Image Proc. 26, 7 (2017).
[18]
Jinxian Liu, Bingbing Ni, Yichao Yan, Peng Zhou, Shuo Cheng, and Jianguo Hu. 2018. Pose transferrable person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[19]
Jiawei Liu, Zheng-Jun Zha, Hongtao Xie, Zhiwei Xiong, and Yongdong Zhang. 2018. CANet: Contextual-attentional attribute-appearance network for person re-identification. In ACM Multimedia Conference on Multimedia Conference (ACM MM’19).
[20]
Lei Qi, Jing Huo, Xiaocong Fan, Yinghuan Shi, and Yang Gao. 2018. Unsupervised joint subspace and dictionary learning for enhanced cross-domain person re-identification. IEEE J. Select. Topics Sig. Proc. 12, 6 (2018).
[21]
Lei Qi, Jing Huo, Lei Wang, Yinghuan Shi, and Yang Gao. 2018. MaskReID: A mask based deep ranking neural network for person re-identification. arXiv (2018).
[22]
Lei Qi, Lei Wang, Jing Huo, Luping Zhou, Yinghuan Shi, and Yang Gao. 2019. A novel unsupervised camera-aware domain adaptation framework for person re-identification. In IEEE International Conference on Computer Vision (ICCV'19).
[23]
Alec Radford, Luke Metz, and Soumith Chintala. 2016. Unsupervised representation learning with deep convolutional generative adversarial networks. In International Conference on Learning Representations (ICLR’16).
[24]
Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In The European Conference on Computer Vision (ECCV’16).
[25]
Jianlou Si, Honggang Zhang, Chun-Guang Li, Jason Kuen, Xiangfei Kong, Alex C. Kot, and Gang Wang. 2018. Dual attention matching network for context-aware feature sequence based person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[26]
Chunfeng Song, Yan Huang, Wanli Ouyang, and Liang Wang. 2018. Mask-guided contrastive attention model for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[27]
Sijie Song, Wei Zhang, Jiaying Liu, and Tao Mei. 2019. Unsupervised person image generation with semantic parsing transformation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[28]
Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, and Qi Tian. 2017. Pose-driven deep convolutional model for person re-identification. In IEEE International Conference on Computer Vision (CVPR’17).
[29]
Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In The European Conference on Computer Vision (ECCV’18).
[30]
Chiat-Pin Tay, Sharmili Roy, and Kim-Hui Yap. 2019. AANet: Attribute attention network for person re-identifications. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[31]
Maoqing Tian, Shuai Yi, Hongsheng Li, Shihua Li, Xuesen Zhang, Jianping Shi, Junjie Yan, and Xiaogang Wang. 2018. Eliminating background-bias for robust person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[32]
Rahul Rama Varior, Bing Shuai, Jiwen Lu, Dong Xu, and Gang Wang. 2016. A Siamese long short-term memory architecture for human re-identification. In European Conference on Computer Vision (ECCV’16).
[33]
Cheng Wang, Qian Zhang, Chang Huang, Wenyu Liu, and Xinggang Wang. 2018. Mancs: A multi-task attentional network with curriculum sampling for person re-identification. In The European Conference on Computer Vision (ECCV’18).
[34]
Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person transfer GAN to bridge domain gap for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[35]
Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person transfer GAN to bridge domain gap for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).
[36]
Longhui Wei, Shiliang Zhang, Hantao Yao, Wen Gao, and Qi Tian. 2017. GLAD: Global-local-alignment descriptor for pedestrian retrieval. In ACM Multimedia Conference on Multimedia Conference (ACM MM’17).
[37]
Ancong Wu, Wei-Shi Zheng, Shaogang Gong, and Jianhuang Lai. 2020. RGB-IR person re-identification by cross-modality similarity preservation. Int. J. Comput. Vis. 128, 6 (2020).
[38]
Ancong Wu, Wei-Shi Zheng, Hong-Xing Yu, Shaogang Gong, and Jianhuang Lai. 2017. RGB-infrared cross-modality person re-identification. In IEEE International Conference on Computer Vision (ICCV’17).
[39]
Tong Xiao, Hongsheng Li, Wanli Ouyang, and Xiaogang Wang. 2016. Learning deep feature representations with domain guided dropout for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16).
[40]
Wenjie Yang, Houjing Huang, Zhang Zhang, Xiaotang Chen, Kaiqi Huang, and Shu Zhang. 2019. Towards rich feature discovery with class activation maps augmentation for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[41]
Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z. Li. 2014. Deep metric learning for person re-identification. In International Conference on Pattern Recognition (ICPR’14).
[42]
Quan Zhang, Haijie Cheng, Jianhuang Lai, and Xiaohua Xie. 2019. DHML: Deep heterogeneous metric learning for VIS-NIR person re-identification. In Chinese Conference on Biometric Recognition (CCBR’19).
[43]
Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, and Xiaoou Tang. 2017. Spindle Net: Person re-identification with human body region guided feature decomposition and fusion. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[44]
Yiru Zhao, Xu Shen, Zhongming Jin, Hongtao Lu, and Xian-sheng Hua. 2019. Attribute-driven feature disentangling and temporal aggregation for video person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[45]
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In IEEE International Conference on Computer Vision (CVPR’15).
[46]
Meng Zheng, Srikrishna Karanam, Ziyan Wu, and Richard J. Radke. 2019. Re-identification with consistent attentive siamese networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[47]
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, and Jan Kautz. 2019. Joint discriminative and generative learning for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’19).
[48]
Zhedong Zheng, Liang Zheng, and Yi Yang. 2017. Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In IEEE International Conference on Computer Vision (ICCV’17).
[49]
Zhedong Zheng, Liang Zheng, and Yi Yang. 2018. A discriminatively learned CNN embedding for person reidentification. ACM Trans. Multimedia Comput. Commun. Applic. 14, 1 (2018).
[50]
Zhun Zhong, Liang Zheng, Donglin Cao, and Shaozi Li. 2017. Re-ranking person re-identification with k-reciprocal encoding. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).
[51]
Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, and Yi Yang. 2017. Random erasing data augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI'20).
[52]
Zhun Zhong, Liang Zheng, Shaozi Li, and Yi Yang. 2018. Generalizing a person retrieval model hetero-and homogeneously. In The European Conference on Computer Vision (ECCV’18).
[53]
Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, and Yi Yang. 2018. Camera style adaptation for person re-identification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).

Cited By

View all
  • (2024)Generalizable Metric Network for Cross-Domain Person Re-IdentificationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.339541134:10_Part_1(9039-9052)Online publication date: 1-Oct-2024
  • (2023)A Feature Map is Worth a Video Frame: Rethinking Convolutional Features for Visible-Infrared Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/361737520:2(1-20)Online publication date: 24-Aug-2023
  • (2023)Dynamic Message Propagation Network for RGB-D and Video Salient Object DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359761220:1(1-21)Online publication date: 19-May-2023
  • Show More Cited By

Index Terms

  1. GreyReID: A Novel Two-stream Deep Framework with RGB-grey Information for Person Re-identification

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 17, Issue 1
      February 2021
      392 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3453992
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 April 2021
      Accepted: 01 August 2020
      Revised: 01 April 2020
      Received: 01 August 2019
      Published in TOMM Volume 17, Issue 1

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Person re-identification
      2. greyscale person images
      3. two-stream deep framework

      Qualifiers

      • Research-article
      • Refereed

      Funding Sources

      • Natural Science Foundation of China
      • Jiangsu Natural Science Foundation
      • National Key Research and Development Program of China
      • Science and Technology Innovation 2030-“New Generation Artificial Intelligence”

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)31
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 03 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Generalizable Metric Network for Cross-Domain Person Re-IdentificationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.339541134:10_Part_1(9039-9052)Online publication date: 1-Oct-2024
      • (2023)A Feature Map is Worth a Video Frame: Rethinking Convolutional Features for Visible-Infrared Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/361737520:2(1-20)Online publication date: 24-Aug-2023
      • (2023)Dynamic Message Propagation Network for RGB-D and Video Salient Object DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359761220:1(1-21)Online publication date: 19-May-2023
      • (2023)A Novel Mix-Normalization Method for Generalizable Multi-Source Person Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2022.318339325(4856-4867)Online publication date: 1-Jan-2023
      • (2022)A Large-Scale Synthetic Gait Dataset Towards in-the-Wild Simulation and Comparison StudyACM Transactions on Multimedia Computing, Communications, and Applications10.1145/351719919:1(1-23)Online publication date: 21-Jul-2022
      • (2022)Clustering Matters: Sphere Feature for Fully Unsupervised Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/350140418:4(1-18)Online publication date: 15-Mar-2022

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media