skip to main content
research-article

Hierarchical Convolutional Neural Network with Knowledge Complementation for Long-Tailed Classification

Published:26 April 2024Publication History
Skip Abstract Section

Abstract

Existing methods based on transfer learning leverage auxiliary information to help tail generalization and improve the performance of the tail classes. However, they cannot fully exploit the relationships between auxiliary information and tail classes and bring irrelevant knowledge to the tail classes. To solve this problem, we propose a hierarchical CNN with knowledge complementation, which regards hierarchical relationships as auxiliary information and transfers relevant knowledge to tail classes. First, we integrate semantics and clustering relationships as hierarchical knowledge into the CNN to guide feature learning. Then, we design a complementary strategy to jointly exploit the two types of knowledge, where semantic knowledge acts as a prior dependence and clustering knowledge reduces the negative information caused by excessive semantic dependence (i.e., semantic gaps). In this way, the CNN facilitates the utilization of the two complementary hierarchical relationships and transfers useful knowledge to tail data to improve long-tailed classification accuracy. Experimental results on public benchmarks show that the proposed model outperforms existing methods. In particular, our model improves accuracy by 3.46% compared with the second-best method on the long-tailed tieredImageNet dataset.

REFERENCES

  1. [1] Abdi Lida and Hashemi Sattar. 2015. To combat multi-class imbalanced problems by means of over-sampling techniques. Transactions on Knowledge and Data Engineering 28, 1 (2015), 238251.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Cao Kaidi, Wei Colin, Gaidon Adrien, Arechiga Nikos, and Ma Tengyu. 2019. Learning imbalanced datasets with label-distribution-aware margin loss. In International Conference on Neural Information Processing Systems. 15671578.Google ScholarGoogle Scholar
  3. [3] Chang Jianlong, Meng Gaofeng, Wang Lingfeng, Xiang Shiming, and Pan Chunhong. 2018. Deep self-evolution clustering. Transactions on Pattern Analysis and Machine Intelligence 42, 4 (2018), 809823.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Chawla Nitesh V., Bowyer Kevin W., Hall Lawrence O., and Kegelmeyer W. Philip. 2002. SMOTE: Synthetic minority over-sampling technique. Artificial Intelligence Research 16, 1 (2002), 321357.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Chen Haibin, Ma Qianli, Lin Zhenxi, and Yan Jiangyue. 2021. Hierarchy-aware label semantics matching network for hierarchical text classification. In Annual Meeting of the Association for Computational Linguistics. 43704379.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Cui Yin, Jia Menglin, Lin Tsung Yi, Song Yang, and Belongie Serge. 2019. Class-balanced loss based on effective number of samples. In Conference on Computer Vision and Pattern Recognition. 92689277.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Deng Jia, Dong Wei, Socher Richard, Li Li Jia, Li Kai, and Fei Li Fei. 2009. ImageNet: A large-scale hierarchical image database. In Conference on Computer Vision and Pattern Recognition. 248255.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Deng Jia, Krause Jonathan, Berg Alexander C., and Fei Li Fei. 2012. Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition. In Conference on Computer Vision and Pattern Recognition. 34503457.Google ScholarGoogle Scholar
  9. [9] Fan Saite, Zhang Xinmin, and Song Zhihuan. 2022. Imbalanced sample selection with deep reinforcement learning for fault diagnosis. Transactions on Industrial Informatics 18, 4 (2022), 25182527.Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Ge Yubin, Li Site, Li Xuyang, Fan Fangfang, Xie Wanqing, You Jane, and Liu Xiaofeng. 2021. Embedding semantic hierarchy in discrete optimal transport for risk minimization. In International Conference on Acoustics, Speech and Signal Processing. 28352839.Google ScholarGoogle Scholar
  11. [11] Guo Hao and Wang Song. 2021. Long-tailed multi-label visual recognition by collaborative training on uniform and re-balanced samplings. In Conference on Computer Vision and Pattern Recognition. 1508915098.Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. 2016. Deep residual learning for image recognition. In Conference on Computer Vision and Pattern Recognition. 770778.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Huang Chen, Li Yining, Loy Chen Change, and Tang Xiaoou. 2016. Learning deep representation for imbalanced classification. In Conference on Computer Vision and Pattern Recognition. 53755384.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Hung Ling Chien, Hu Ya Han, Tsai Chih Fong, and Huang Min Wei. 2022. A dynamic time warping approach for handling class imbalanced medical datasets with missing values: A case study of protein localization site prediction. Expert Systems with Applications 192 (2022), 116437.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Inoue Matheus, Forster Carlos Henrique, and Santos Antonio Carlos dos. 2020. Semantic hierarchy-based convolutional neural networks for image classification. In International Joint Conference on Neural Networks. 18.Google ScholarGoogle Scholar
  16. [16] Japkowicz Nathalie. 2000. The class imbalance problem: Significance and strategies. In International Conference on Artificial Intelligence. 111117.Google ScholarGoogle Scholar
  17. [17] Kim Jaehyung, Jeong Jongheon, and Shin Jinwoo. 2020. M2m: Imbalanced classification via major-to-minor translation. In Conference on Computer Vision and Pattern Recognition. 1389613905.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Kosmopoulos Aris, Partalas Ioannis, Gaussier Eric, Paliouras Georgios, and Androutsopoulos Ion. 2015. Evaluation measures for hierarchical classification: A unified view and novel approaches. Data Mining and Knowledge Discovery 29, 3 (2015), 820865.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Krizhevsky Alex, Nair Vinod, and Hinton Geoffrey. 2009. Learning Multiple Layers of Features from Tiny Images. Master’s thesis, University of Toronto. 158.Google ScholarGoogle Scholar
  20. [20] Lin Tsung Yi, Goyal Priya, Girshick Ross, He Kaiming, and Dollr Piotr. 2017. Focal loss for dense object detection. In International Conference on Computer Vision. 29802988.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Lin Wei Chao, Tsai Chih Fong, Hu Ya Han, and Jhang Jing Shang. 2017. Clustering-based undersampling in class-imbalanced data. Information Sciences 409 (2017), 1726.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Liu Huafeng, Wang Jiaqi, and Jing Liping. 2021. Cluster-wise hierarchical generative model for deep amortized clustering. In Conference on Computer Vision and Pattern Recognition. 1510915118.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Liu Ziwei, Miao Zhongqi, Zhan Xiaohang, Wang Jiayun, Gong Boqing, and Yu Stella X.. 2019. Large-scale long-tailed recognition in an open world. In Conference on Computer Vision and Pattern Recognition. 25372546.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Ma Jianghong, Chow Tommy W. S., and Zhang Haijun. 2022. Semantic-gap-oriented feature selection and classifier construction in multilabel learning. Transactions on Cybernetics 52, 1 (2022), 101115.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Maldonado Sebastin, Vairetti Carla, Fernandez Alberto, and Herrera Francisco. 2022. FW-SMOTE: A feature-weighted oversampling approach for imbalanced classification. Pattern Recognition 124 (2022), 108511.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Naumov Stanislav, Yaroslavtsev Grigory, and Avdiukhin Dmitrii. 2021. Objective-based hierarchical clustering of deep embedding vectors. In Conference on Artificial Intelligence. 90559063.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Obeso Abraham Montoya, Benois-Pineau Jenny, Vzquez Mireya Sara Garca, and Acosta Alejandro lvaro Ramrez. 2022. Visual vs internal attention mechanisms in deep neural networks for image classification and object detection. Pattern Recognition 123 (2022), 108411.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Oram Peter. 2001. WordNet: An electronic lexical database. Applied Psycholinguistics 22, 1 (2001), 131134.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Ren Mengye, Triantafillou Eleni, Ravi Sachin, Snell Jake, Swersky Kevin, Tenenbaum Joshua B., Larochelle Hugo, and Zemel Richard S.. 2018. Meta-learning for semi-supervised few-shot classification. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  30. [30] Rui Yong, Huang Thomas S., and Chang Shih Fu. 1999. Image retrieval: Current techniques, promising directions, and open issues. Journal of Visual Communication and Image Representation 10, 1 (1999), 3962.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Russakovsky Olga, Deng Jia, Su Hao, Krause Jonathan, Satheesh Sanjeev, Ma Sean, Huang Zhiheng, Karpathy Andrej, Khosla Aditya, Bernstein Michael, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet large scale visual recognition challenge. International Journal of Computer Vision 115, 3 (2015), 211252.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Suh Sungho, Lukowicz Paul, and Lee Yong Oh. 2022. Discriminative feature generation for classification of imbalanced data. Pattern Recognition 122 (2022), 108302.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Tahir Muhammad Atif, Kittler Josef, and Yan Fei. 2012. Inverse random under sampling for class imbalance problem and its application to multi-label classification. Pattern Recognition 45, 10 (2012), 37383750.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Tan Jingru, Lu Xin, Zhang Gang, Yin Changqing, and Li Quanquan. 2021. Equalization loss v2: A new gradient balance approach for long-tailed object detection. In Conference on Computer Vision and Pattern Recognition. 16851694.Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Tan Jingru, Wang Changbao, Li Buyu, Li Quanquan, Ouyang Wanli, Yin Changqing, and Yan Junjie. 2020. Equalization loss for long-tailed object recognition. In Conference on Computer Vision and Pattern Recognition. 1166211671.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Horn Grant Van, Aodha Oisin Mac, Song Yang, Cui Yin, Sun Chen, Shepard Alex, Adam Hartwig, Perona Pietro, and Belongie Serge. 2018. The iNaturalist species classification and detection dataset. In Conference on Computer Vision and Pattern Recognition. 87698778.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Wang Guoyin, Yang Jie, and Xu Ji. 2017. Granular computing: From granularity optimization to multi-granularity joint problem solving. Granular Computing 2, 3 (2017), 105120.Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Wang Jiaqi, Zhang Wenwei, Zang Yuhang, Cao Yuhang, Pang Jiangmiao, Gong Tao, Chen Kai, Liu Ziwei, Loy Chen Change, and Lin Dahua. 2021. Seesaw loss for long-tailed instance segmentation. In Conference on Computer Vision and Pattern Recognition. 96959704.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Wang Yu, Liu Ruonan, Lin Di, Chen Dongyue, Li Ping, Hu Qinghua, and Chen C. L. Philip. 2023. Coarse-to-fine: Progressive knowledge transfer based multi-task convolutional neural network for intelligent large-scale fault diagnosis. Transactions on Neural Networks and Learning Systems 34, 2 (2023), 761–774.Google ScholarGoogle Scholar
  40. [40] Wang Yu Xiong, Ramanan Deva, and Hebert Martial. 2017. Learning to model the tail. In Conference on Neural Information Processing Systems. 70327042.Google ScholarGoogle Scholar
  41. [41] Xiao Jianxiong, Ehinger Krista A., Hays James, Torralba Antonio, and Oliva Aude. 2016. SUN database: Exploring a large collection of scene categories. International Journal of Computer Vision 119, 1 (2016), 322.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] Xiao Jianxiong, Hays James, Ehinger Krista A., Oliva Aude, and Torralba Antonio. 2010. SUN database: Large-scale scene recognition from abbey to zoo. In Conference on Computer Vision and Pattern Recognition. 34853492.Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Xu Chaoyang, Lin Renjie, Cai Jinyu, and Wang Shiping. 2022. Deep image clustering by fusing contrastive learning and neighbor relation mining. Knowledge-Based Systems 238 (2022), 107967.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Yi Huaikuan, Jiang Qingchao, Yan Xuefeng, and Wang Bei. 2021. Imbalanced classification based on minority clustering synthetic minority oversampling technique with wind turbine fault detection application. Transactions on Industrial Informatics 17, 9 (2021), 58675875.Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Zhang Renhui, Lin Tiancheng, Zhang Rui, and Xu Yi. 2022. Solving the long-tailed problem via intra-and inter-category balance. In International Conference on Acoustics, Speech and Signal Processing. 23552359.Google ScholarGoogle Scholar
  46. [46] Zhao Hong, Hu Qinghua, Zhu Pengfei, Wang Yu, and Wang Ping. 2021. A recursive regularization based feature selection framework for hierarchical classification. Transactions on Knowledge and Data Engineering 33, 7 (2021), 28332846.Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Zhong Wei and Gu Feng. 2022. Predicting local protein 3D structures using clustering deep recurrent neural network. Transactions on Computational Biology and Bioinformatics 19, 1 (2022), 593–604.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Zhou Boyan, Cui Quan, Wei Xiu Shen, and Chen Zhao Min. 2020. BBN: Bilateral-branch network with cumulative learning for long-tailed visual recognition. In Conference on Computer Vision and Pattern Recognition. 97199728.Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Zhou Ning and Fan Jianping. 2013. Jointly learning visually correlated dictionaries for large-scale visual recognition applications. Transactions on Pattern Analysis and Machine Intelligence 36, 4 (2013), 715730.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Zhou Yu, Li Xiaoni, Zhou Yucan, Wang Yu, Hu Qinghua, and Wang Weiping. 2022. Deep collaborative multi-task network: A human decision process inspired model for hierarchical image classification. Pattern Recognition 124 (2022), 108449.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Zhu Linchao and Yang Yi. 2022. Label independent memory for semi-supervised few-shot video classification. Transactions on Pattern Analysis and Machine Intelligence 44, 1 (2022), 273285.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Hierarchical Convolutional Neural Network with Knowledge Complementation for Long-Tailed Classification

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Knowledge Discovery from Data
      ACM Transactions on Knowledge Discovery from Data  Volume 18, Issue 6
      July 2024
      760 pages
      ISSN:1556-4681
      EISSN:1556-472X
      DOI:10.1145/3613684
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 April 2024
      • Online AM: 22 March 2024
      • Accepted: 16 March 2024
      • Revised: 12 December 2023
      • Received: 9 October 2022
      Published in tkdd Volume 18, Issue 6

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)117
      • Downloads (Last 6 weeks)43

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text