Disentangled Feature Network for Fine-Grained Recognition

Miao, Shuyu; Li, Shuaicheng; Zheng, Lin; Yu, Wei; Liu, Jingjing; Gong, Mingming; Feng, Rui

doi:10.1007/978-3-030-92270-2_38

Shuyu Miao¹³,
Shuaicheng Li¹⁴,
Lin Zheng¹³,
Wei Yu¹⁵,
Jingjing Liu¹³,
Mingming Gong¹³ &
…
Rui Feng¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 13109))

Included in the following conference series:

International Conference on Neural Information Processing

1608 Accesses
1 Citations

Abstract

Most of fine-grained recognition researches are implemented based on generic classification models as the backbone. However, it is a sub-optimal choice because the differences between similar categories in this task are so small that the models must capture discriminative fine-grained subtle variances. In this paper, we design a dedicated backbone network for fine-grained recognition. To this end, we propose a novel Disentangled Feature Network (DFN) that gradually disentangles and incorporates coarse- and fine-grained features to explicitly capture multi-grained features. Thus, it promotes the models to learn more representative features that potentially determine the classification results via easily replacing the original inappropriate backbone. Moreover, we further present an optional error correction loss to adaptively penalize misclassification between extremely similar categories and guide to capture fine-grained feature diversity. Extensive experiments fully demonstrate that when adopting our DFN as the backbone, like freebies, the baseline models boost the performance by about 2% with negligible extra parameters on widely used CUB, AirCraft, and Stanford Car dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chang, D., et al.: The devil is in the channels: mutual-channel loss for fine-grained image classification. In: TIP, pp. 4683–4695 (2020)
Google Scholar
Chen, Y., Dai, X., Liu, M., Chen, D., Yuan, L., Liu, Z.: Dynamic convolution: attention over convolution kernels. In: CVPR, June 2020
Google Scholar
Chen, Y., Bai, Y., Zhang, W., Mei, T.: Destruction and construction learning for fine-grained image recognition. In: CVPR, pp. 5157–5166 (2019)
Google Scholar
Cheng, C., et al.: Dual skipping networks. In: CVPR (2018)
Google Scholar
Ding, Y., Zhou, Y., Zhu, Y., Ye, Q., Jiao, J.: Selective sparse sampling for fine-grained image recognition. In: ICCV, October 2019
Google Scholar
Duta, I.C., Liu, L., Zhu, F., Shao, L.: Pyramidal convolution: rethinking convolutional neural networks for visual recognition (2020)
Google Scholar
Gao, S., Cheng, M., Zhao, K., Zhang, X., Yang, M., Torr, P.H.S.: Res2Net: a new multi-scale backbone architecture. In: TPAMI, p. 1 (2019)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, June 2016
Google Scholar
Hinton, G.E., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network. ArXiv (2015)
Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: CVPR, June 2018
Google Scholar
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: CVPR, July 2017
Google Scholar
Ji, R., et al.: Attention convolutional binary neural tree for fine-grained visual categorization. In: CVPR, June 2020
Google Scholar
Lin, T.Y., Dollar, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: CVPR, July 2017
Google Scholar
Lin, T.Y., RoyChowdhury, A., Maji, S.: Bilinear CNN models for fine-grained visual recognition. In: ICCV, December 2015
Google Scholar
Liu, J.J., Hou, Q., Cheng, M.M., Feng, J., Wang, C.: Improving convolutional networks with self-calibrated convolutions. In: CVPR (2020)
Google Scholar
shawnleezx: calculating receptive field of CNN (2017). http://shawnleezx.github.io/blog/2017/02/11/calculating-receptive-field-of-cnn
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: CVPR (2014)
Google Scholar
Sun, M., Yuan, Y., Zhou, F., Ding, E.: Multi-attention multi-class constraint for fine-grained image recognition. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11220, pp. 834–850. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01270-0_49
Chapter Google Scholar
Yu, C., Zhao, X., Zheng, Q., Zhang, P., You, X.: Hierarchical bilinear pooling for fine-grained visual recognition. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11220, pp. 595–610. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01270-0_35
Chapter Google Scholar
Zhang, H., et al.: ResNeSt: split-attention networks. ArXiv (2020)
Google Scholar
Zhou, M., Bai, Y., Zhang, W., Zhao, T., Mei, T.: Look-into-object: self-supervised structure modeling for object recognition. In: CVPR, June 2020
Google Scholar
Zhuang, P., Wang, Y., Qiao, Y.: Learning attentive pairwise interaction for fine-grained classification. In: AAAI, vol. 34, pp. 13130–13137 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Ant Group, Hangzhou, China
Shuyu Miao, Lin Zheng, Jingjing Liu & Mingming Gong
School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai, China
Shuaicheng Li & Rui Feng
College of Computer, National University of Defense Technology, Changsha, China
Wei Yu

Authors

Shuyu Miao
View author publications
You can also search for this author in PubMed Google Scholar
Shuaicheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Lin Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Wei Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jingjing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mingming Gong
View author publications
You can also search for this author in PubMed Google Scholar
Rui Feng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rui Feng .

Editor information

Editors and Affiliations

Sampoerna University, Jakarta, Indonesia
Teddy Mantoro
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Sampoerna University, Jakarta, Indonesia
Media Anugerah Ayu
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Universitas Indonesia, Depok, Indonesia
Achmad Nizar Hidayanto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miao, S. et al. (2021). Disentangled Feature Network for Fine-Grained Recognition. In: Mantoro, T., Lee, M., Ayu, M.A., Wong, K.W., Hidayanto, A.N. (eds) Neural Information Processing. ICONIP 2021. Lecture Notes in Computer Science(), vol 13109. Springer, Cham. https://doi.org/10.1007/978-3-030-92270-2_38

Download citation

DOI: https://doi.org/10.1007/978-3-030-92270-2_38
Published: 07 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92269-6
Online ISBN: 978-3-030-92270-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Disentangled Feature Network for Fine-Grained Recognition