skip to main content
10.1145/3394171.3413607acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

A Structured Graph Attention Network for Vehicle Re-Identification

Published: 12 October 2020 Publication History

Abstract

Vehicle re-identification aims to identify the same vehicle across different surveillance cameras and plays an important role in public security. Existing approaches mainly focus on exploring informative regions or learning an appropriate distance metric. However, they not only neglect the inherent structured relationship between discriminative regions within an image, but also ignore the extrinsic structured relationship among images. The inherent and extrinsic structured relationships are crucial to learning effective vehicle representation. In this paper, we propose a Structured Graph ATtention network (SGAT) to fully exploit these relationships and allow the message propagation to update the features of graph nodes. SGAT creates two graphs for one probe image. One is an inherent structured graph based on the geometric relationship between the landmarks that can use features of their neighbors to enhance themselves. The other is an extrinsic structured graph guided by the attribute similarity to update image representations. Experimental results on two public vehicle re-identification datasets including VeRi-776 and VehicleID have shown that our proposed method achieves significant improvements over the state-of-the-art methods.

Supplementary Material

MP4 File (3394171.3413607.mp4)
In this paper, we proposed a novel Structured Graph ATtention network (SGAT) for vehicle re-identification by jointly exploiting the inherent structured relationship between landmarks and the extrinsic structured relationship between images. In particular, an inherent structured graph attention network based on the geometric relationship between landmarks is designed to enhance the landmark features by neighboring landmarks. An extrinsic structured graph attention network is designed with a gated attribute similarity matrix to improve vehicle feature by exploiting the relationship between images. We conducted extensive experiments to evaluate SGAT on two public vehicle re-identification datasets, i.e.,VeRi-776 and VehicleID, and reported significant performance improvements over the 17 state-of-the-art methods.

References

[1]
Yan Bai, Yihang Lou, Feng Gao, Shiqi Wang, Yuwei Wu, and Ling-Yu Duan. 2018. Group-Sensitive Triplet Embedding for Vehicle Re-Identification. IEEE Transactions on Multimedia, Vol. 20, 9 (2018), 2385--2399.
[2]
Joan Bruna, Wojciech Zaremba, Arthur Szlam, and Yann LeCun. 2014. Spectral Networks and Locally Connected Networks on Graphs. International Conference on Learning Representations (2014).
[3]
Michaël Defferrard, Xavier Bresson, and Pierre Vandergheynst. 2016. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. In Advances in Neural Information Processing Systems. 3844--3852.
[4]
Yan Em, Feng Gag, Yihang Lou, Shiqi Wang, Tiejun Huang, and Ling-Yu Duan. 2017. Incorporating Intra-Class Variance to Fine-Grained Visual Recognition. In International Conference on Multimedia and Expo. IEEE, 1452--1457.
[5]
Junyu Gao, Tianzhu Zhang, and Changsheng Xu. 2019. Graph Convolutional Tracking. In Computer Vision and Pattern Recognition. IEEE.
[6]
Haiyun Guo, Chaoyang Zhao, Zhiwei Liu, Jinqiao Wang, and Hanqing Lu. 2018. Learning Coarse-to-Fine Structured Feature Embedding for Vehicle Re-Identification. In AAAI Conference on Artificial Intelligence.
[7]
Mikael Henaff, Joan Bruna, and Yann LeCun. 2015. Deep Convolutional Networks on Graph-Structured Data. arXiv preprint arXiv:1506.05163 (2015).
[8]
Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In Defense of the Triplet Loss for Person Re-Identification. arXiv preprint arXiv:1703.07737 (2017).
[9]
Yukun Huang, Zheng-Jun Zha, Xueyang Fu, Richang Hong, and Liang Li. 2020. Real-World Person Re-Identification via Degradation Invariance Learning. In Computer Vision and Pattern Recognition. IEEE.
[10]
Aytacc Kanaci, Xiatian Zhu, and Shaogang Gong. 2017. Vehicle Re-Identification by Fine-Grained Cross-Level Deep Learning. In The British Machine Vision Conference Workshop, Vol. 2. 772--788.
[11]
Pirazh Khorramshahi, Amit Kumar, Neehar Peri, Sai Saketh Rambhatla, Jun-Cheng Chen, and Rama Chellappa. 2019. A Dual-Path Model With Adaptive Attention for Vehicle Re-Identification. In International Conference on Computer Vision. IEEE.
[12]
Thomas N Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. International Conference on Learning Representations (2017).
[13]
John Boaz Lee, Ryan Rossi, and Xiangnan Kong. 2018. Graph Classification Using Structural Attention. In International Conference on Knowledge Discovery and Data Mining. ACM, 1666--1674.
[14]
Yuqi Li, Yanghao Li, Hongfei Yan, and Jiaying Liu. 2017. Deep Joint Discriminative Learning for Vehicle Re-Identification and Retrieval. In International Conference on Image Processing. IEEE, 395--399.
[15]
Shengcai Liao, Yang Hu, Xiangyu Zhu, and Stan Z Li. 2015. Person Re-Identification by Local Maximal Occurrence Representation and Metric Learning. In Computer Vision and Pattern Recognition. IEEE, 2197--2206.
[16]
Hongye Liu, Yonghong Tian, Yaowei Yang, Lu Pang, and Tiejun Huang. 2016c. Deep Relative Distance Learning: Tell the Difference between Similar Vehicles. In Computer Vision and Pattern Recognition. IEEE, 2167--2175.
[17]
Jiawei Liu, Zheng-Jun Zha, Di Chen, Richang Hong, and Meng Wang. 2019 a. Adaptive Transfer Network for Cross-Domain Person Re-Identification. In Computer Vision and Pattern Recognition. IEEE, 7202--7211.
[18]
Jiawei Liu, Zheng-Jun Zha, Xuejin Chen, Zilei Wang, and Yongdong Zhang. 2019 b. Dense 3d-Convolutional Neural Network for Person Re-identification in Videos. ACM Transactions on Multimedia Computing, Communications, and Applications, Vol. 15, 1s (2019), 8.
[19]
Jiawei Liu, Zheng-Jun Zha, QI Tian, Dong Liu, Ting Yao, Qiang Ling, and Tao Mei. 2016 d. Multi-Scale Triplet CNN for Person Re-Identification. In ACM international conference on Multimedia. ACM, 192--196.
[20]
Jiawei Liu, Zheng-Jun Zha, Hongtao Xie, Zhiwei Xiong, and Yongdong Zhang. 2018a. CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification. In ACM International Conference on Multimedia. ACM, 737--745.
[21]
Xinchen Liu, Wu Liu, Huadong Ma, and Huiyuan Fu. 2016a. Large-scale Vehicle Re-Identification in Urban Surveillance Videos. In International Conference on Multimedia and Expo. IEEE, 1--6.
[22]
Xinchen Liu, Wu Liu, Tao Mei, and Huadong Ma. 2016b. A Deep Learning-Based Approach to Progressive Vehicle Re-Identification for Urban Surveillance. In European Conference on Computer Vision. Springer, 869--884.
[23]
Xinchen Liu, Wu Liu, Tao Mei, and Huadong Ma. 2017. Provid: Progressive and Multimodal Vehicle Re-Identification for Large-Scale Urban Surveillance. IEEE Transactions on Multimedia, Vol. 20, 3 (2017), 645--658.
[24]
Xiaobin Liu, Shiliang Zhang, Qingming Huang, and Wen Gao. 2018b. RAM: A Region-Aware Deep Model for Vehicle Re-Identification. In International Conference on Multimedia and Expo. IEEE, 1--6.
[25]
Yihang Lou, Yan Bai, Jun Liu, Shiqi Wang, and Ling-Yu Duan. 2019. Embedding Adversarial Learning for Vehicle Re-Identification. IEEE Transactions on Image Processing (2019).
[26]
Kenneth Marino, Ruslan Salakhutdinov, and Abhinav Gupta. 2017. The More You Know: Using Knowledge Graphs for Image Classification. (2017), 2673--2681.
[27]
Dechao Meng, Liang Li, Xuejing Liu, Yadong Li, Shijie Yang, Zheng-Jun Zha, Xingyu Gao, Shuhui Wang, and Qingming Huang. 2020 a. Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification. In Computer Vision and Pattern Recognition. IEEE, 7103--7112.
[28]
Dechao Meng, Liang Li, Shuhui Wang, Zheng-Jun Zha, Xingyu Gao, and Qingming Huang. 2020 b. Fine-Grained Feature Alignment with Part Perspective Transformation for Vehicle ReID. In ACM International Conference on Multimedia. ACM.
[29]
Will Norcliffe-Brown, Stathis Vafeias, and Sarah Parisot. 2018. Learning Conditioned Graph Structures for Interpretable Visual Question Answering. In Advances in Neural Information Processing Systems. 8334--8343.
[30]
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic Differentiation in Pytorch. (2017).
[31]
Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, and Xiaogang Wang. 2017. Learning Deep Neural Networks for Vehicle Re-Id with Visual-Spatio-Temporal Path Proposals. In International Conference on Computer Vision. IEEE, 1900--1909.
[32]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. International Conference on Learning Representations (2015).
[33]
Yong Tang, Congzhe Zhang, Renshu Gu, Peng Li, and Bin Yang. 2017. Vehicle Detection and Recognition for Intelligent Traffic Surveillance System. Multimedia Tools and Applications, Vol. 76, 4 (2017), 5817--5832.
[34]
Shangzhi Teng, Xiaobin Liu, Shiliang Zhang, and Qingming Huang. 2018. SCAN: Spatial and Channel Attention Network for Vehicle Re-Identification. In Pacific Rim Conference on Multimedia. Springer, 350--361.
[35]
Petar Velivc ković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. 2018. Graph Attention Networks. International Conference on Learning Representations (2018).
[36]
Xiaolong Wang and Abhinav Gupta. 2018. Videos as Space-Time Region Graphs. In European Conference on Computer Vision. Springer, 399--417.
[37]
Yue Wang, Yongbin Sun, Ziwei Liu, Sanjay E Sarma, Michael M Bronstein, and Justin M Solomon. 2019. Dynamic graph cnn for learning on point clouds. ACM Transactions On Graphics, Vol. 38, 5 (2019), 1--12.
[38]
Zheng Wang, Ruimin Hu, Chao Liang, Yi Yu, Junjun Jiang, Mang Ye, Jun Chen, and Qingming Leng. 2015. Zero-Shot Person Re-Identification via Cross-View Consistency. IEEE Transactions on Multimedia, Vol. 18, 2 (2015), 260--272.
[39]
Zhongdao Wang, Luming Tang, Xihui Liu, Zhuliang Yao, Shuai Yi, Jing Shao, Junjie Yan, Shengjin Wang, Hongsheng Li, and Xiaogang Wang. 2017. Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-Identification. In International Conference on Computer Vision. IEEE, 379--387.
[40]
Xiu-Shen Wei, Chen-Lin Zhang, Lingqiao Liu, Chunhua Shen, and Jianxin Wu. 2018. Coarse-to-Fine: A RNN-based Hierarchical Attention Model for Vehicle Re-Identification. Asian Conference on Computer Vision (2018).
[41]
Lin Wu, Yang Wang, Junbin Gao, and Xue Li. 2018. Where-and-When to Look: Deep Siamese Attention Networks for Video-Based Person Re-Identification. IEEE Transactions on Multimedia, Vol. 21, 6 (2018), 1412--1424.
[42]
Bin Xiao, Haiping Wu, and Yichen Wei. 2018. Simple Baselines for Human Pose Estimation and Tracking. In European Conference on Computer Vision. Springer, 466--481.
[43]
Ke Yan, Yonghong Tian, Yaowei Wang, Wei Zeng, and Tiejun Huang. 2017. Exploiting Multi-Grain Ranking Constraints for Precisely Searching Visually-Similar Vehicles. In International Conference on Computer Vision. IEEE, 562--570.
[44]
Zheng-Jun Zha, Jiawei Liu, Di Chen, and Feng Wu. 2020. Adversarial Attribute-Text Embedding for Person Search With Natural Language Query. IEEE Transactions on Multimedia, Vol. 22, 7 (2020), 1836--1846.
[45]
Junping Zhang, Fei-Yue Wang, Kunfeng Wang, Wei-Hua Lin, Xin Xu, and Cheng Chen. 2011. Data-Driven Intelligent Transportation Systems: A Survey. IEEE Transactions on Intelligent Transportation Systems, Vol. 12, 4 (2011), 1624--1639.
[46]
Li Zhang, Tao Xiang, and Shaogang Gong. 2016. Learning a Discriminative Null Space for Person Re-Identification. In Computer Vision and Pattern Recognition. IEEE, 1239--1248.
[47]
Muhan Zhang, Zhicheng Cui, Marion Neumann, and Yixin Chen. 2018. An End-to-End Deep Learning Architecture for Graph Classification. In AAAI Conference on Artificial Intelligence.
[48]
Yiheng Zhang, Dong Liu, and Zheng-Jun Zha. 2017. Improving Triplet-Wise Training of Convolutional Neural Network for Vehicle Re-identification. In International Conference on Multimedia and Expo. IEEE, 1386--1391.
[49]
Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban Computing: Concepts, Methodologies, and Applications. ACM Transactions on Intelligent Systems and Technology, Vol. 5, 3 (2014), 38.
[50]
Yi Zhou and Ling Shao. 2017. Cross-View GAN Based Vehicle Generation for Re-Identification. In The British Machine Vision Conference, Vol. 1. 1--12.
[51]
Yi Zhou and Ling Shao. 2018a. Vehicle Re-Identification by Adversarial Bi-Directional LSTM Network. In Winter Conference on Applications of Computer Vision. IEEE, 653--662.
[52]
Yi Zhou and Ling Shao. 2018b. Viewpoint-Aware Attentive Multi-View Inference for Vehicle Re-Identification. In Computer Vision and Pattern Recognition. IEEE.
[53]
Jianqing Zhu, Huanqiang Zeng, Jingchang Huang, Shengcai Liao, Zhen Lei, Canhui Cai, and Lixin Zheng. 2019. Vehicle Re-Identification Using Quadruple Directional Deep Learning Features. IEEE Transactions on Intelligent Transportation Systems (2019).

Cited By

View all
  • (2025)Frequency transformer with local feature enhancement for improved vehicle re-identificationThe Journal of Supercomputing10.1007/s11227-025-07012-481:4Online publication date: 3-Mar-2025
  • (2024)Weighted Graph-Structured Semantics Constraint Network for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2023.328289426(1551-1564)Online publication date: 1-Jan-2024
  • (2024)DSA-SCGC: A Dual Self-Attention Mechanism based on Space-Channel Grouped Compression for Vehicle Re-Identification2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650480(1-8)Online publication date: 30-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '20: Proceedings of the 28th ACM International Conference on Multimedia
October 2020
4889 pages
ISBN:9781450379885
DOI:10.1145/3394171
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 October 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graph attention network
  2. landmark
  3. structured relationship
  4. vehicle re-identification

Qualifiers

  • Research-article

Funding Sources

  • National Natural Science Foundation of China
  • National Key R&D Program of China

Conference

MM '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)40
  • Downloads (Last 6 weeks)6
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Frequency transformer with local feature enhancement for improved vehicle re-identificationThe Journal of Supercomputing10.1007/s11227-025-07012-481:4Online publication date: 3-Mar-2025
  • (2024)Weighted Graph-Structured Semantics Constraint Network for Cross-Modal RetrievalIEEE Transactions on Multimedia10.1109/TMM.2023.328289426(1551-1564)Online publication date: 1-Jan-2024
  • (2024)DSA-SCGC: A Dual Self-Attention Mechanism based on Space-Channel Grouped Compression for Vehicle Re-Identification2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650480(1-8)Online publication date: 30-Jun-2024
  • (2023)Image-Pair Correlation Learning for Vehicle Re-Identification2023 42nd Chinese Control Conference (CCC)10.23919/CCC58697.2023.10240969(7376-7381)Online publication date: 24-Jul-2023
  • (2023)SSR-Net: A Spatial Structural Relation Network for Vehicle Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/357857819:6(1-22)Online publication date: 12-Jul-2023
  • (2023)Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReIDIEEE Transactions on Multimedia10.1109/TMM.2022.315410225(2954-2965)Online publication date: 1-Jan-2023
  • (2023)Progressive Context-Aware Graph Feature Learning for Target Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2022.314064725(1230-1242)Online publication date: 1-Jan-2023
  • (2023)GiT: Graph Interactive Transformer for Vehicle Re-IdentificationIEEE Transactions on Image Processing10.1109/TIP.2023.323864232(1039-1051)Online publication date: 2023
  • (2023)A Spatial–Temporal Structural Estimation Model Based on GATE-PCGRU for Multirate Industrial ProcessIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2023.329179672(1-11)Online publication date: 2023
  • (2023)A Dual-Branch Network for Few-Shot Vehicle Re-Identification With Enhanced Global and Local FeaturesIEEE Transactions on Instrumentation and Measurement10.1109/TIM.2023.328597872(1-12)Online publication date: 2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media