Learning relations in human-like style for few-shot fine-grained image classification

Li, Shenming; Feng, Lin; Xue, Linsong; Wang, Yifan; Wang, Dong

doi:10.1007/s13042-021-01473-8

Learning relations in human-like style for few-shot fine-grained image classification

Original Article
Published: 01 December 2021

Volume 14, pages 377–385, (2023)
Cite this article

International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Shenming Li^1,2,3,
Lin Feng¹,
Linsong Xue²,
Yifan Wang^1,3 &
…
Dong Wang³

537 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Fine-grained classification is a challenging problem with small inter-class variance and large intra-class variance. It becomes more difficult when only a few labeled training samples are available. Inspired by the procedure of human recognition that two similar objects are usually distinguished by comparing their key parts, we develop a novel few-shot fine-grained classification method, which learns to model the inter-class boundaries in human-like style, i.e., extracting key-part structure information of objects and performing part-by-part comparison. To this end, we first extract the key parts of objects by using the designed key-part detector, which are then encoded by our structure encoder for the final comparison. To tackle with the scarce labeled samples, we train the proposed network under the metric-based few-shot learning methodology. Experiments on benchmark datasets demonstrate the effectiveness of the proposed method compared with the state-of-the-art counterparts. Besides, extensive investigations are conducted to verify the contributions of the key components of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A few-shot fine-grained image classification method leveraging global and local structures

Article 05 March 2022

Siyu Cao, Wen Wang, … Qingyong Li

Attentive fine-grained recognition for cross-domain few-shot classification

Article 31 January 2022

Liangbing Sa, Chongchong Yu, … Tao Xie

Learning to focus: cascaded feature matching network for few-shot image recognition

Article 30 July 2021

Mengting Chen, Xinggang Wang, … Wenyu Liu

References

Alfassy A, Karlinsky L, Aides A, Shtok J, Harary S, Feris R, Giryes R, Bronstein A.M (2019) Laso: Label-set operations networks for multi-label few-shot learning. In: CVPR
Chen Y, Bai Y, Zhang W, Mei T (2019) Destruction and construction learning for fine-grained image recognition. In: CVPR. pp 5157–5166
Cui Y, Zhou F, Wang J, Liu X, Lin Y, Belongie S (2017) Kernel pooling for convolutional neural networks. In: CVPR
Fu J, Zheng H, Mei T. Look closer to see better: recurrent attention convolutional neural network for fine-grained image recognition. In: CVPR. pp 4476–4484
Fu J, Zheng H, Tao M (2017) Look closer to see better: recurrent attention convolutional neural network for fine-grained image recognition. In: CVPR
Goring C, Rodner E, Freytag A, Denzler J (2014) Nonparametric part transfer for fine-grained recognition. In: CVPR
He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: CVPR. pp 770–778
Horn GV, Branson S, Farrell R, Haber S, Belongie S (2015) Building a bird recognition app and large scale dataset with citizen scientists: the fine print in fine-grained dataset collection. In: CVPR
Horn G.V, Perona P (2017)The devil is in the tails: fine-grained classification in the wild. arXiv arXiv:1709.01450
Kim J, Kim T, Kim S, Yoo CD (2019) Edge-labeling graph neural network for few-shot learning. In: CVPR
Kong S, Fowlkes C (2017) Low-rank bilinear pooling for fine-grained classification. CVPR. pp 7025–7034
Kong S, Fowlkes C (2017) Low-rank bilinear pooling for fine-grained classification. In: CVPR
Li P, Xie J, Wang Q, Gao Z (2018) Towards faster training of global covariance pooling networks by iterative matrix square root normalization. In: CVPR
Li P, Xie J, Wang Q, Zuo W(2017) Is second-order information helpful for large-scale visual recognition? In: ICCV
Li W, Wang L, Xu J, Huo J, Gao Y, Luo J (2019)Revisiting local descriptor based image-to-class measure for few-shot learning. In: CVPR
Li W, Xu J, Huo J, Wang L, Luo J(2019) Distribution consistency based covariance metric networks for few-shot learning. In: AAAI
Lin TY, RoyChowdhury A, Maji S (2015) Bilinear CNN models for fine-grained visual recognition. In: ICCV
Liu J, Belhumeur PN (2013) Bird part localization using exemplar-based models with enforced pose and subcategory consistency. In: CVPR
Pfister T, Charles J, Zisserman A (2014) Domain-adaptive discriminative one-shot learning of gestures. In: ECCV
Redmon J, Farhadi A (2018) Yolov3: an incremental improvement. arXiv arXiv:1804.02767
Snell J, Swersky K, Zemel RS (2017) Prototypical networks for few-shot learning. In: NeurIPS
Sung F, Yang Y, Zhang L, Xiang T, Torr PHS, Hospedales TM (2017) Learning to compare: Relation network for few-shot learning. In: CVPR
Vinyals O, Blundell C, Lillicrap T, Kavukcuoglu K, Wierstra D (2016) Matching networks for one shot learning. In: NeurIPS
Wah C, Branson S, Welinder P, Perona P, Belongie S (2011) The Caltech-UCSD birds 200-2011 dataset. California Institute of Technology
Wang K, Wang X, Zhang T, Cheng Y(2021) Few-shot learning with deep balanced network and acceleration strategy. Int J Mach Learn Cybern 1–12
Wang Y, Morariu VI, Davis LS. Learning a discriminative filter bank within a CNN for fine-grained recognition. In: CVPR. pp 4148–4157
Wang YX, Girshick R, Hebert M, Hariharan B (2018) Low-shot learning from imaginary data. In: CVPR
Wu Z, Li Y, Guo L, Jia K (2019) Parn: position-aware relation networks for few-shot learning. In: ICCV
Yuxiong W, Martial H (2016) Learning from small sample sets by combining unsupervised meta-training with CNNS. In: NeurIPS
Zhang L, Huang S, Liu W, Tao D (2019) Learning a mixture of granularity-specific experts for fine-grained categorization. In: CVPR
Zheng H, Fu J, Zha Z, Luo J. Looking for the devil in the details: learning trilinear attention sampling network for fine-grained image recognition. In: CVPR. 5012–5021
Zheng H, Fu J, Zha ZJ, Luo J (2019) Learning deep bilinear transformation for fine-grained image representation. In: NeurIPS
Zhong SH, Huang X, Xiao Z (2020) Fine-art painting classification via two-channel dual path networks. Int J Mach Learn Cybern 11(1):137–152
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Innovation and Entrepreneurship, Dalian University of Technology, Dalian, China
Shenming Li, Lin Feng & Yifan Wang
School of Computer Science and Technology, Dalian University of Technology, Dalian, China
Shenming Li & Linsong Xue
Ningbo Institute of Dalian University of Technology, Ningbo, China
Shenming Li, Yifan Wang & Dong Wang

Authors

Shenming Li
View author publications
You can also search for this author in PubMed Google Scholar
Lin Feng
View author publications
You can also search for this author in PubMed Google Scholar
Linsong Xue
View author publications
You can also search for this author in PubMed Google Scholar
Yifan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dong Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yifan Wang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, S., Feng, L., Xue, L. et al. Learning relations in human-like style for few-shot fine-grained image classification. Int. J. Mach. Learn. & Cyber. 14, 377–385 (2023). https://doi.org/10.1007/s13042-021-01473-8

Download citation

Received: 11 April 2021
Accepted: 01 November 2021
Published: 01 December 2021
Issue Date: February 2023
DOI: https://doi.org/10.1007/s13042-021-01473-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Learning relations in human-like style for few-shot fine-grained image classification

Abstract

Access this article

Similar content being viewed by others

A few-shot fine-grained image classification method leveraging global and local structures

Attentive fine-grained recognition for cross-domain few-shot classification

Learning to focus: cascaded feature matching network for few-shot image recognition

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Learning relations in human-like style for few-shot fine-grained image classification

Abstract

Access this article

Similar content being viewed by others

A few-shot fine-grained image classification method leveraging global and local structures

Attentive fine-grained recognition for cross-domain few-shot classification

Learning to focus: cascaded feature matching network for few-shot image recognition

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation