short-paper

Collaborative Representation for Deep Meta Metric Learning

Authors:

Baodi LiuAuthors Info & Claims

ICMR '21: Proceedings of the 2021 International Conference on Multimedia Retrieval

Pages 506 - 510

https://doi.org/10.1145/3460426.3463583

Published: 01 September 2021 Publication History

Abstract

Most metric learning methods utilize all training data to construct a single metric, and it is usually over-fitting on the "salient" feature. To overcome this issue, we propose a deep meta metric learning method based on collaborative representation. We construct multiple episodes from the original training data to train a general metric, where each episode consists of a query set and a support set. Then, we introduce a collaborative representation method, which fits the query sample with the support samples per class. We predict the query sample's label via the optimal fitness among the query sample and the support samples in each specific class. Besides, we adopt a hard mining strategy to learn a more discriminative metric according to increasing the training tasks' difficulty. Experiments verify that our method achieves state-of-the-art results on three re-ID benchmark datasets.

References

[1]

Gal Chechik, Varun Sharma, Uri Shalit, and Samy Bengio. 2010. Large Scale Online Learning of Image Similarity through Ranking. Journal of Machine Learning Research, 1109--1135.

[2]

Guangyi Chen, Tianren Zhang, Jiwen Lu, and Jie Zhou. 2019. Deep Meta Metric Learning. In ICCV. 9546--9555.

[3]

Jia Deng, Wei Dong, Richard Socher, Li Jia Li, and Fei Fei Li. 2009. ImageNet: A large-scale hierarchical image database. In CVPR. 248--255.

[4]

R. Hadsell, S. Chopra, and Y. Lecun. 2006. Dimensionality reduction by learning an invariant mapping. In CVPR. 1735--1742.

[5]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In CVPR. 770--778.

[6]

Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In Defense of the Triplet Loss for Person Re-Identification. arXiv preprint arXiv:1703.07737 (2017).

[7]

Wei Li, Xiatian Zhu, and Shaogang Gong. 2018. Harmonious Attention Network for Person Re-Identification. In CVPR . 2285--2294.

[8]

Xinchen Liu, Wu Liu, Tao Mei, and Huadong Ma. 2016. A Deep Learning-Based Approach to Progressive Vehicle Re-identification for Urban Surveillance. In ECCV. 869--884.

[9]

Xinchen Liu, Wu Liu, Tao Mei, and Huadong Ma. 2017. PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance. IEEE Transactions on Multimedia, Vol. 20, 3 (2017), 645--658.

Digital Library

[10]

Jiaxu Miao, Yu Wu, Ping Liu, Yuhang Ding, and Yi Yang. 2019. Pose-Guided Feature Alignment for Occluded Person Re-Identification. In ICCV . 542--551.

[11]

Yair Movshovitz-Attias, Alexander Toshev, Thomas K Leung, Sergey Ioffe, and Saurabh Singh. 2017. No Fuss Distance Metric Learning using Proxies. In ICCV. 360--368.

[12]

Ergys Ristani, Francesco Solera, Roger S Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking. In ECCV . 17--35.

[13]

Jake Snell, Kevin Swersky, and Richard S Zemel. 2017. Prototypical Networks for Few-shot Learning. arXiv preprint arXiv:1703.05175 (2017).

[14]

Kihyuk Sohn. 2016. Improved deep metric learning with multi-class N-pair loss objective. In NeurIPS . 1857--1865.

[15]

Hyun Oh Song, Yu Xiang, Stefanie Jegelka, and Silvio Savarese. 2016. Deep Metric Learning via Lifted Structured Feature Embedding. In CVPR. 4004--4012.

[16]

Wenchen Sun, Fangai Liu, and Weizhi Xu. 2017a. Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro. In ICCV. 3774--3782.

[17]

Yifan Sun, Qin Xu, Yali Li, Chi Zhang, Yikang Li, Shengjin Wang, and Jian Sun. 2019. Perceive Where to Focus: Learning Visibility-aware Part-level Features for Partial Person Re-identification. In CVPR. 393--402.

[18]

Yifan Sun, Liang Zheng, Weijian Deng, and Shengjin Wang. 2017b. SVDNet for Pedestrian Retrieval. In ICCV. 3800--3808.

[19]

Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline). In ECCV. 480--496.

[20]

Qiqi Xiao, Hao Luo, and Chi Zhang. 2017. Margin Sample Mining Loss: A Deep Learning Based Method for Person Re-identification. arXiv preprint arXiv:1710.00478 (2017).

[21]

Baosheng Yu and Dacheng Tao. 2019. Deep Metric Learning With Tuplet Margin Loss. In ICCV. 6489--6498.

[22]

Fahong Zhang, Qi Wang, and Xuelong Li. 2020. Deep Meta-Relation Network for Visual Few-Shot Learning. In ICASSP 2020 . 1509--1513.

[23]

Liang Zheng, Liyue Shen, Shengjin Tian, Lu abibnd Wang, and Qi Tian. 2015. Scalable Person Re-identification: A Benchmark. In ICCV. 1116--1124.

[24]

Wenzhao Zheng, Zhaodong Chen, Jiwen Lu, and Jie Zhou. 2019. Hardness-Aware Deep Metric Learning. In CVPR. 1--1.

[25]

Zhun Zhong, Liang Zheng, Donglin Cao, and Shaozi Li. 2017. Re-ranking Person Re-identification with k-reciprocal Encoding. In CVPR . 3652--3661.

[26]

Jiahuan Zhou, Bing Su, and Ying Wu. 2020. Online Joint Multi-Metric Adaptation From Frequent Sharing-Subset Mining for Person Re-Identification. In CVPR. 2906--2915.

Cited By

Qin XLi LPang GHao F(2024)Heterogeneous Graph Fusion Network for cross-modal image-text retrievalExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.123842249:PCOnline publication date: 17-Jul-2024
https://dl.acm.org/doi/10.1016/j.eswa.2024.123842

Index Terms

Collaborative Representation for Deep Meta Metric Learning
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object identification
      2. Computer vision representations
        Image representations
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification

Recommendations

Label propagation based on collaborative representation for face recognition

Recently, collaborative representation (CR) has been shown to produce impressive performance on face recognition. However, the performances of CR depend on the number of labeled training samples for each class. When the labeled training samples per ...
Metric Learning from Probabilistic Labels
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Metric learning aims to learn a good distance metric that can capture the relationships among instances, and its importance has long been recognized in many fields. In the traditional settings of metric learning, an implicit assumption is that the ...
Deep Variational Metric Learning
Computer Vision – ECCV 2018
Abstract
Deep metric learning has been extensively explored recently, which trains a deep neural network to produce discriminative embedding features. Most existing methods usually enforce the model to be indiscriminating to intra-class variance, which ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '21: Proceedings of the 2021 International Conference on Multimedia Retrieval

August 2021

715 pages

ISBN:9781450384636

DOI:10.1145/3460426

General Chairs:
Wen-Huang Cheng
National Yang Ming Chiao Tung University, Taiwan
,
Mohan Kankanhalli
National University of Singapore, Singapore
,
Meng Wang
Hefei University of Technology, China
,
Program Chairs:
Wei-Ta Chu
National Cheng Kung University, Taiwan
,
Jiaying Liu
Peking University, China
,
Marcel Worring
University of Amsterdam, Netherlands

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Funding Sources

Fundamental Research Funds for the Central Universities China University of Petroleum (East China)
Natural Science Foundation of Shandong Province China
Major Scientific and Technological Projects of CNPC
Creative Research Team of Young Scholars at Universities in Shandong Province

Conference

ICMR '21

Sponsor:

SIGMM

ICMR '21: International Conference on Multimedia Retrieval

August 21 - 24, 2021

Taipei, Taiwan

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
105
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)1

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Qin XLi LPang GHao F(2024)Heterogeneous Graph Fusion Network for cross-modal image-text retrievalExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.123842249:PCOnline publication date: 17-Jul-2024
https://dl.acm.org/doi/10.1016/j.eswa.2024.123842

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten