research-article

Hierarchical Convolutional Neural Network with Knowledge Complementation for Long-Tailed Classification

Authors:
Hong Zhao

Minnan Normal University, Zhangzhou, China

Minnan Normal University, Zhangzhou, China

0000-0001-9339-1829
Search about this author

,
Zhengyu Li

Minnan Normal University, Zhangzhou, China

Minnan Normal University, Zhangzhou, China

0000-0003-0548-254X
Search about this author

,
Wenwei He

Minnan Normal University, Zhangzhou, China

Minnan Normal University, Zhangzhou, China

0009-0000-7207-1594
Search about this author

,
Yan Zhao

Minnan Normal University, Zhangzhou China

Minnan Normal University, Zhangzhou China

0009-0001-2295-7570
Search about this author

ACM Transactions on Knowledge Discovery from Data Volume 18 Issue 6Article No.: 154pp 1–22https://doi.org/10.1145/3653717

Published:26 April 2024Publication History

ACM Transactions on Knowledge Discovery from Data

Abstract

Existing methods based on transfer learning leverage auxiliary information to help tail generalization and improve the performance of the tail classes. However, they cannot fully exploit the relationships between auxiliary information and tail classes and bring irrelevant knowledge to the tail classes. To solve this problem, we propose a hierarchical CNN with knowledge complementation, which regards hierarchical relationships as auxiliary information and transfers relevant knowledge to tail classes. First, we integrate semantics and clustering relationships as hierarchical knowledge into the CNN to guide feature learning. Then, we design a complementary strategy to jointly exploit the two types of knowledge, where semantic knowledge acts as a prior dependence and clustering knowledge reduces the negative information caused by excessive semantic dependence (i.e., semantic gaps). In this way, the CNN facilitates the utilization of the two complementary hierarchical relationships and transfers useful knowledge to tail data to improve long-tailed classification accuracy. Experimental results on public benchmarks show that the proposed model outperforms existing methods. In particular, our model improves accuracy by 3.46% compared with the second-best method on the long-tailed tieredImageNet dataset.

REFERENCES

[1] Abdi Lida and Hashemi Sattar. 2015. To combat multi-class imbalanced problems by means of over-sampling techniques. Transactions on Knowledge and Data Engineering 28, 1 (2015), 238–251.Google ScholarDigital Library
[2] Cao Kaidi, Wei Colin, Gaidon Adrien, Arechiga Nikos, and Ma Tengyu. 2019. Learning imbalanced datasets with label-distribution-aware margin loss. In International Conference on Neural Information Processing Systems. 1567–1578.Google Scholar
[3] Chang Jianlong, Meng Gaofeng, Wang Lingfeng, Xiang Shiming, and Pan Chunhong. 2018. Deep self-evolution clustering. Transactions on Pattern Analysis and Machine Intelligence 42, 4 (2018), 809–823.Google ScholarDigital Library
[4] Chawla Nitesh V., Bowyer Kevin W., Hall Lawrence O., and Kegelmeyer W. Philip. 2002. SMOTE: Synthetic minority over-sampling technique. Artificial Intelligence Research 16, 1 (2002), 321–357.Google ScholarDigital Library
[5] Chen Haibin, Ma Qianli, Lin Zhenxi, and Yan Jiangyue. 2021. Hierarchy-aware label semantics matching network for hierarchical text classification. In Annual Meeting of the Association for Computational Linguistics. 4370–4379.Google ScholarCross Ref
[6] Cui Yin, Jia Menglin, Lin Tsung Yi, Song Yang, and Belongie Serge. 2019. Class-balanced loss based on effective number of samples. In Conference on Computer Vision and Pattern Recognition. 9268–9277.Google ScholarCross Ref
[7] Deng Jia, Dong Wei, Socher Richard, Li Li Jia, Li Kai, and Fei Li Fei. 2009. ImageNet: A large-scale hierarchical image database. In Conference on Computer Vision and Pattern Recognition. 248–255.Google ScholarCross Ref
[8] Deng Jia, Krause Jonathan, Berg Alexander C., and Fei Li Fei. 2012. Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition. In Conference on Computer Vision and Pattern Recognition. 3450–3457.Google Scholar
[9] Fan Saite, Zhang Xinmin, and Song Zhihuan. 2022. Imbalanced sample selection with deep reinforcement learning for fault diagnosis. Transactions on Industrial Informatics 18, 4 (2022), 2518–2527.Google ScholarCross Ref
[10] Ge Yubin, Li Site, Li Xuyang, Fan Fangfang, Xie Wanqing, You Jane, and Liu Xiaofeng. 2021. Embedding semantic hierarchy in discrete optimal transport for risk minimization. In International Conference on Acoustics, Speech and Signal Processing. 2835–2839.Google Scholar
[11] Guo Hao and Wang Song. 2021. Long-tailed multi-label visual recognition by collaborative training on uniform and re-balanced samplings. In Conference on Computer Vision and Pattern Recognition. 15089–15098.Google ScholarCross Ref
[12] He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. 2016. Deep residual learning for image recognition. In Conference on Computer Vision and Pattern Recognition. 770–778.Google ScholarCross Ref
[13] Huang Chen, Li Yining, Loy Chen Change, and Tang Xiaoou. 2016. Learning deep representation for imbalanced classification. In Conference on Computer Vision and Pattern Recognition. 5375–5384.Google ScholarCross Ref
[14] Hung Ling Chien, Hu Ya Han, Tsai Chih Fong, and Huang Min Wei. 2022. A dynamic time warping approach for handling class imbalanced medical datasets with missing values: A case study of protein localization site prediction. Expert Systems with Applications 192 (2022), 116437.Google ScholarDigital Library
[15] Inoue Matheus, Forster Carlos Henrique, and Santos Antonio Carlos dos. 2020. Semantic hierarchy-based convolutional neural networks for image classification. In International Joint Conference on Neural Networks. 1–8.Google Scholar
[16] Japkowicz Nathalie. 2000. The class imbalance problem: Significance and strategies. In International Conference on Artificial Intelligence. 111–117.Google Scholar
[17] Kim Jaehyung, Jeong Jongheon, and Shin Jinwoo. 2020. M2m: Imbalanced classification via major-to-minor translation. In Conference on Computer Vision and Pattern Recognition. 13896–13905.Google ScholarCross Ref
[18] Kosmopoulos Aris, Partalas Ioannis, Gaussier Eric, Paliouras Georgios, and Androutsopoulos Ion. 2015. Evaluation measures for hierarchical classification: A unified view and novel approaches. Data Mining and Knowledge Discovery 29, 3 (2015), 820–865.Google ScholarDigital Library
[19] Krizhevsky Alex, Nair Vinod, and Hinton Geoffrey. 2009. Learning Multiple Layers of Features from Tiny Images. Master’s thesis, University of Toronto. 1–58.Google Scholar
[20] Lin Tsung Yi, Goyal Priya, Girshick Ross, He Kaiming, and Dollr Piotr. 2017. Focal loss for dense object detection. In International Conference on Computer Vision. 2980–2988.Google ScholarCross Ref
[21] Lin Wei Chao, Tsai Chih Fong, Hu Ya Han, and Jhang Jing Shang. 2017. Clustering-based undersampling in class-imbalanced data. Information Sciences 409 (2017), 17–26.Google ScholarCross Ref
[22] Liu Huafeng, Wang Jiaqi, and Jing Liping. 2021. Cluster-wise hierarchical generative model for deep amortized clustering. In Conference on Computer Vision and Pattern Recognition. 15109–15118.Google ScholarCross Ref
[23] Liu Ziwei, Miao Zhongqi, Zhan Xiaohang, Wang Jiayun, Gong Boqing, and Yu Stella X.. 2019. Large-scale long-tailed recognition in an open world. In Conference on Computer Vision and Pattern Recognition. 2537–2546.Google ScholarCross Ref
[24] Ma Jianghong, Chow Tommy W. S., and Zhang Haijun. 2022. Semantic-gap-oriented feature selection and classifier construction in multilabel learning. Transactions on Cybernetics 52, 1 (2022), 101–115.Google ScholarCross Ref
[25] Maldonado Sebastin, Vairetti Carla, Fernandez Alberto, and Herrera Francisco. 2022. FW-SMOTE: A feature-weighted oversampling approach for imbalanced classification. Pattern Recognition 124 (2022), 108511.Google ScholarDigital Library
[26] Naumov Stanislav, Yaroslavtsev Grigory, and Avdiukhin Dmitrii. 2021. Objective-based hierarchical clustering of deep embedding vectors. In Conference on Artificial Intelligence. 9055–9063.Google ScholarCross Ref
[27] Obeso Abraham Montoya, Benois-Pineau Jenny, Vzquez Mireya Sara Garca, and Acosta Alejandro lvaro Ramrez. 2022. Visual vs internal attention mechanisms in deep neural networks for image classification and object detection. Pattern Recognition 123 (2022), 108411.Google ScholarDigital Library
[28] Oram Peter. 2001. WordNet: An electronic lexical database. Applied Psycholinguistics 22, 1 (2001), 131–134.Google ScholarCross Ref
[29] Ren Mengye, Triantafillou Eleni, Ravi Sachin, Snell Jake, Swersky Kevin, Tenenbaum Joshua B., Larochelle Hugo, and Zemel Richard S.. 2018. Meta-learning for semi-supervised few-shot classification. In International Conference on Learning Representations.Google Scholar
[30] Rui Yong, Huang Thomas S., and Chang Shih Fu. 1999. Image retrieval: Current techniques, promising directions, and open issues. Journal of Visual Communication and Image Representation 10, 1 (1999), 39–62.Google ScholarDigital Library
[31] Russakovsky Olga, Deng Jia, Su Hao, Krause Jonathan, Satheesh Sanjeev, Ma Sean, Huang Zhiheng, Karpathy Andrej, Khosla Aditya, Bernstein Michael, Alexander C. Berg, and Li Fei-Fei. 2015. ImageNet large scale visual recognition challenge. International Journal of Computer Vision 115, 3 (2015), 211–252.Google ScholarDigital Library
[32] Suh Sungho, Lukowicz Paul, and Lee Yong Oh. 2022. Discriminative feature generation for classification of imbalanced data. Pattern Recognition 122 (2022), 108302.Google ScholarDigital Library
[33] Tahir Muhammad Atif, Kittler Josef, and Yan Fei. 2012. Inverse random under sampling for class imbalance problem and its application to multi-label classification. Pattern Recognition 45, 10 (2012), 3738–3750.Google ScholarDigital Library
[34] Tan Jingru, Lu Xin, Zhang Gang, Yin Changqing, and Li Quanquan. 2021. Equalization loss v2: A new gradient balance approach for long-tailed object detection. In Conference on Computer Vision and Pattern Recognition. 1685–1694.Google ScholarCross Ref
[35] Tan Jingru, Wang Changbao, Li Buyu, Li Quanquan, Ouyang Wanli, Yin Changqing, and Yan Junjie. 2020. Equalization loss for long-tailed object recognition. In Conference on Computer Vision and Pattern Recognition. 11662–11671.Google ScholarCross Ref
[36] Horn Grant Van, Aodha Oisin Mac, Song Yang, Cui Yin, Sun Chen, Shepard Alex, Adam Hartwig, Perona Pietro, and Belongie Serge. 2018. The iNaturalist species classification and detection dataset. In Conference on Computer Vision and Pattern Recognition. 8769–8778.Google ScholarCross Ref
[37] Wang Guoyin, Yang Jie, and Xu Ji. 2017. Granular computing: From granularity optimization to multi-granularity joint problem solving. Granular Computing 2, 3 (2017), 105–120.Google ScholarCross Ref
[38] Wang Jiaqi, Zhang Wenwei, Zang Yuhang, Cao Yuhang, Pang Jiangmiao, Gong Tao, Chen Kai, Liu Ziwei, Loy Chen Change, and Lin Dahua. 2021. Seesaw loss for long-tailed instance segmentation. In Conference on Computer Vision and Pattern Recognition. 9695–9704.Google ScholarCross Ref
[39] Wang Yu, Liu Ruonan, Lin Di, Chen Dongyue, Li Ping, Hu Qinghua, and Chen C. L. Philip. 2023. Coarse-to-fine: Progressive knowledge transfer based multi-task convolutional neural network for intelligent large-scale fault diagnosis. Transactions on Neural Networks and Learning Systems 34, 2 (2023), 761–774.Google Scholar
[40] Wang Yu Xiong, Ramanan Deva, and Hebert Martial. 2017. Learning to model the tail. In Conference on Neural Information Processing Systems. 7032–7042.Google Scholar
[41] Xiao Jianxiong, Ehinger Krista A., Hays James, Torralba Antonio, and Oliva Aude. 2016. SUN database: Exploring a large collection of scene categories. International Journal of Computer Vision 119, 1 (2016), 3–22.Google ScholarDigital Library
[42] Xiao Jianxiong, Hays James, Ehinger Krista A., Oliva Aude, and Torralba Antonio. 2010. SUN database: Large-scale scene recognition from abbey to zoo. In Conference on Computer Vision and Pattern Recognition. 3485–3492.Google ScholarCross Ref
[43] Xu Chaoyang, Lin Renjie, Cai Jinyu, and Wang Shiping. 2022. Deep image clustering by fusing contrastive learning and neighbor relation mining. Knowledge-Based Systems 238 (2022), 107967.Google ScholarDigital Library
[44] Yi Huaikuan, Jiang Qingchao, Yan Xuefeng, and Wang Bei. 2021. Imbalanced classification based on minority clustering synthetic minority oversampling technique with wind turbine fault detection application. Transactions on Industrial Informatics 17, 9 (2021), 5867–5875.Google ScholarCross Ref
[45] Zhang Renhui, Lin Tiancheng, Zhang Rui, and Xu Yi. 2022. Solving the long-tailed problem via intra-and inter-category balance. In International Conference on Acoustics, Speech and Signal Processing. 2355–2359.Google Scholar
[46] Zhao Hong, Hu Qinghua, Zhu Pengfei, Wang Yu, and Wang Ping. 2021. A recursive regularization based feature selection framework for hierarchical classification. Transactions on Knowledge and Data Engineering 33, 7 (2021), 2833–2846.Google ScholarCross Ref
[47] Zhong Wei and Gu Feng. 2022. Predicting local protein 3D structures using clustering deep recurrent neural network. Transactions on Computational Biology and Bioinformatics 19, 1 (2022), 593–604.Google ScholarDigital Library
[48] Zhou Boyan, Cui Quan, Wei Xiu Shen, and Chen Zhao Min. 2020. BBN: Bilateral-branch network with cumulative learning for long-tailed visual recognition. In Conference on Computer Vision and Pattern Recognition. 9719–9728.Google ScholarCross Ref
[49] Zhou Ning and Fan Jianping. 2013. Jointly learning visually correlated dictionaries for large-scale visual recognition applications. Transactions on Pattern Analysis and Machine Intelligence 36, 4 (2013), 715–730.Google ScholarDigital Library
[50] Zhou Yu, Li Xiaoni, Zhou Yucan, Wang Yu, Hu Qinghua, and Wang Weiping. 2022. Deep collaborative multi-task network: A human decision process inspired model for hierarchical image classification. Pattern Recognition 124 (2022), 108449.Google ScholarDigital Library
[51] Zhu Linchao and Yang Yi. 2022. Label independent memory for semi-supervised few-shot video classification. Transactions on Pattern Analysis and Machine Intelligence 44, 1 (2022), 273–285.Google ScholarDigital Library

Index Terms

Hierarchical Convolutional Neural Network with Knowledge Complementation for Long-Tailed Classification
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Multi-task learning
        Transfer learning

Recommendations

Multi-task convolutional neural network with coarse-to-fine knowledge transfer for long-tailed classification
Abstract
Long-tailed classifications make it very challenging to deal with class-imbalanced problems using deep convolutional neural networks (CNNs). Existing solutions based on re-balancing methods perform well and use single-task CNNs to train each fine-...
Read More
Hierarchical long-tailed classification based on multi-granularity knowledge transfer driven by multi-scale feature fusion
Abstract
Long-tailed learning is attracting increasing attention due to the unbalanced distributions of real-world data. The aim is to train well-performing depth models. Traditional knowledge transfer methods for long-tailed learning are classified into ...
Highlights
- We propose a multi-scale feature fusion network about channel and spatial features.
- We investigate a multi-granularity relationship of class space.
- We explore a vertical transfer of coarse- to fine-grained knowledge.
Read More
Tacit Knowledge Transfer within Enterprises during Industry Conversion
ICIII '08: Proceedings of the 2008 International Conference on Information Management, Innovation Management and Industrial Engineering - Volume 03

Enterprises are confronted with the challenge of reengineering technological capability during industry conversion. Tacit knowledge accumulated in the former industry is vital for reengineering new technological capability. The research aims at the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Knowledge Discovery from Data Volume 18, Issue 6
July 2024
760 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3613684
Editor:
Jian Pei
Duke University, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 April 2024
- Online AM: 22 March 2024
- Accepted: 16 March 2024
- Revised: 12 December 2023
- Received: 9 October 2022
Published in tkdd Volume 18, Issue 6

Check for updates
Author Tags
Long-tailed classification
deep learning
knowledge transfer
hierarchical relationship
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 117
  Total Downloads
- Downloads (Last 12 months)117
- Downloads (Last 6 weeks)43
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

Hierarchical Convolutional Neural Network with Knowledge Complementation for Long-Tailed Classification

ACM Transactions on Knowledge Discovery from Data

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Multi-task convolutional neural network with coarse-to-fine knowledge transfer for long-tailed classification

Hierarchical long-tailed classification based on multi-granularity knowledge transfer driven by multi-scale feature fusion

Tacit Knowledge Transfer within Enterprises during Industry Conversion