Skip to main content

Fine-Grained Image Classification Based on Target Acquisition and Feature Fusion

  • Conference paper
  • First Online:
Knowledge Science, Engineering and Management (KSEM 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12817))

Abstract

Fine-grained images classification aims to analyze visual objects from subordinate categories, e.g., models of cars or species of dogs, which is challenging due to the small inter-class variations and the large intra-class variations. Recent research has focused on extracting distinguishable local features via part-based model or attention networks to locate discriminative regions/parts, which highly relies on the part annotation, incurs high background noise and ignores the semantic information during the learning. In this paper, we present a fine-grained image classification model, RPN-SCA-BCNN (RSCAB), based on target acquisition and feature fusion. In RSCAB, we first adopt a new Soft-NMS enhanced Region Proposal Network (RPN) for weak supervised object detection to separate the distinguishable object and its background to remove the background noise and costly annotation requirement. Then, we provide a new attention mechanism Spatial-Channel Attention (SCA) to focus on the spatial discriminative parts of the image to reduce the feature redundancy. Based on SCA, we further construct a Bilinear Convolutional Neural Network (BCNN) to fuse the high and low dimensional features by leveraging two networks obtaining more abundant multi-dimensional attention features for classification. The experimental results show that the proposed RSCAB model achieves higher classification accuracy over the state-of-the-art algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Luo, J., Wu, J.: A survey on fine-grained image categorization using deep convolutional features. Acta Automatica Sinica 43(8), 1306–1318 (2017)

    Google Scholar 

  2. Wah, C., Branson, S., Welinder, P., Perona, P., Belongie, S.: The caltech-UCSD birds-200–2011 dataset (2011)

    Google Scholar 

  3. Krause, J., Stark, M., Deng, J., Fei-Fei, L.: 3D object representations for fine-grained categorization. In: ICCV, pp. 554–561 (2013)

    Google Scholar 

  4. Parkhi, O.M., Vedaldi, A., Zisserman, A., Jawahar, C.V.: Cats and dogs. In: CVPR, pp. 3498–3505 (2012)

    Google Scholar 

  5. Nilsback, M.E., Zisserman, A.: Automated flower classification over a large number of classes. In: ICVGIP, pp. 722–729 (2008)

    Google Scholar 

  6. Maji, S., Rahtu, E., Kannala, J., Blaschko, M., Vedaldi, A.: Fine-grained visual classification of aircraft. arXiv:1306.5151 (2013)

  7. Ge, W., Lin, X., Yu, Y.: Weakly supervised complementary parts models for fine-grained image classification from the bottom up. In: CVPR, pp. 3034–3043 (2020)

    Google Scholar 

  8. Zhang, N., Donahue, J., Girshick, R., Darrell, T.: Part-based R-CNNs for fine-grained category detection. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 834–849. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_54

    Chapter  Google Scholar 

  9. Steve, B., Van Horn, G., Belongie, S., et al.: Bird species categorization using pose normalized deep convolutional nets. Eprint Arxiv, 68–77 (2014)

    Google Scholar 

  10. Krause, J., Jin, H., Yang, J., Fei-Fei, L.: Fine-grained recognition without part annotations. In: CVPR, pp. 5546–5555 (2015)

    Google Scholar 

  11. Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2

  12. Lin, T.-Y., et al.: Feature pyramid networks for object detection. In: CVPR, pp. 2117–2125 (2017)

    Google Scholar 

  13. Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)

    Google Scholar 

  14. Xiao, T., Xu, Y., et al.: The application of two-level attention models in deep convolutional neural network for fine-grained image classification. In: CVPR, pp. 842–850 (2015)

    Google Scholar 

  15. Simon, M., Rodner, E.: Neural activation constellations: Unsupervised part model discovery with convolutional networks. In: ICCV, pp. 1143–1151 (2015)

    Google Scholar 

  16. Zhang, X., Xiong, H., Zhou, W., et al.: Picking deep filter responses for fine-grained image recognition. In: CVPR, pp. 1134–1142 (2016)

    Google Scholar 

  17. Lin, T.-Y., Roychowdhury, A., Maji, S.: Bilinear CNNs for fine-grained visual recognition. arXiv:1504.07889 (2015)

  18. Liu, X., Xia, T., Wang, J.: Fully convolutional attention localization networks: efficient attention localization for fine-grained recognition. arXiv:1603.06765 (2016)

  19. Zhao, B., Wu, X., Feng, J., et al.: Diversified visual attention networks for fine-grained object classification. IEEE Trans. Multimed. 19(6), 1245–1256 (2017)

    Article  Google Scholar 

  20. Fu, J., Zheng, H., Mei, T.: Look closer to see better: recurrent attention convolutional neural network for fine-grained image recognition. In: CVPR, pp. 4438–4446 (2017)

    Google Scholar 

  21. Fu, J., Zheng, H., Luo, J.: Learning multi-attention convolutional neural network for fine-grained image recognition. In: ICCV, pp. 5209–5217 (2017)

    Google Scholar 

  22. Sun, M., Yuan, Y., Zhou, F., Ding, E.: Multi-attention multi-class constraint for fine-grained image recognition. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11220, pp. 834–850. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01270-0_49

    Chapter  Google Scholar 

  23. Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: visual explanations from deep networks via gradient-based localization. Int. J. Comput. Vis. 128(2), 336–359 (2019). https://doi.org/10.1007/s11263-019-01228-7

    Article  Google Scholar 

  24. Chu, Y., Yue, X., Wang, Q., Wang, Z.: SecureAS: a vulnerability as-sessment system for deep neural network based on adversarial examples. IEEE Access 8, 109156–109167 (2020)

    Google Scholar 

  25. Chu, Y., Yue, X., Yu, L., Sergei, M., Wang, Z.: Automatic image captioning based on ResNet50 and LSTM with soft attention. Wirel. Commun. Mob. Comput. 2020, 8909458 (2020)

    Google Scholar 

Download references

Acknowledgement

This research is supported by National Natural Science Foundation of China under Grant No. 61771155 and Singapore MOE TIF grant MOE2017-TIF-1-G018.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Yan Chu , Zhengkui Wang or Qingchao Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chu, Y., Wang, Z., Wang, L., Zhao, Q., Shan, W. (2021). Fine-Grained Image Classification Based on Target Acquisition and Feature Fusion. In: Qiu, H., Zhang, C., Fei, Z., Qiu, M., Kung, SY. (eds) Knowledge Science, Engineering and Management. KSEM 2021. Lecture Notes in Computer Science(), vol 12817. Springer, Cham. https://doi.org/10.1007/978-3-030-82153-1_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-82153-1_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-82152-4

  • Online ISBN: 978-3-030-82153-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics