Abstract
Label distribution learning, where deal with label ambiguity by describing the degree of relevance of each label to a specific instance. As a novel machine learning paradigm, the curse of dimensionality is one of the prominent problems. Feature selection is a vital preprocessing step to reduce the high dimensionality of data. However, most existing label distribution feature selection methods focus on selecting a feature subset that has relevant capabilities for all labels, ignoring label-specific features with the maximum discriminatory power for each label. To tackle this issue, a label distribution feature selection algorithm based on label-specific features is proposed in this paper. Initially, we introduce a feature selection optimization model for label distribution data that simultaneously considers common and label-specific features, leveraging sparse learning to further investigate the intricate relationships between features and labels. Subsequently, the label correlation coefficient is employed to enhance the collaborative learning effect of labels. Finally, the relevance between features and labels is taken into account to guide the feature selection process, which can effectively eliminate the redundant features. Comprehensive experiments demonstrate the advantage of our proposed method over other well-established feature selection algorithms for selecting label-specific features to label distribution data.







Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
Data sharing not applicable.
Code Availability
Code availability not applicable.
References
Akbari A, Awais M, Fatemifar S, Khalid SS, Kittler J (2021) A novel ground metric for optimal transport-based chronological age estimation. IEEE Trans Cybern 52(10):9986–9999
Al-Fahdawi S, Al-Waisy AS, Zeebaree DQ, Qahwaji R, Natiq H, Mohammed MA, Nedoma J, Martinek R, Deveci M (2024) Fundus-deepnet: multi-label deep learning classification system for enhanced detection of multiple ocular diseases through data fusion of fundus images. Inf Fusion 102:102059
Berger A, Della Pietra SA, Della Pietra VJ (1996) A maximum entropy approach to natural language processing. Comput linguist 22(1):39–71
Fan Y, Chen B, Huang W, Liu J, Weng W, Lan W (2022) Multi-label feature selection based on label correlations and feature redundancy. Knowl-Based Syst 241:108256
Fan Y, Liu J, Tang J, Liu P, Lin Y, Du Y (2024) Learning correlation information for multi-label feature selection. Pattern Recognit 145:109899
Geng X (2016) Label distribution learning. IEEE Trans Knowl Data Eng 28(7):1734–1748
Geng X, Xia Y (2014) Head pose estimation based on multivariate label distribution. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1837–1842
Gupta A, Narayan S, Khan S, Khan FS, Shao L, van de Weijer J (2023) Generative multi-label zero-shot learning. IEEE Transactions on Pattern Analysis and Machine Intelligence
Han Q, Hu L, Gao W (2024) Feature relevance and redundancy coefficients for multi-view multi-label feature selection. Inf Sci 652:119747
Hang JY, Zhang ML (2021) Collaborative learning of label semantics and deep label-specific features for multi-label classification. IEEE Trans Pattern Anal Mach Intell 44(12):9860–9871
Hao P, Hu L, Gao W (2023) Partial multi-label feature selection via subspace optimization. Inf Sci 648:119556
He Z, Lin Y, Wang C, Guo L, Ding W (2023) Multi-label feature selection based on correlation label enhancement. Inf Sci 647:119526
Hu L, Gao L, Li Y, Zhang P, Gao W (2022) Feature-specific mutual information variation for multi-label feature selection. Inf Sci 593:449–471
Huang J, Qin F, Zheng X, Cheng Z, Yuan Z, Zhang W, Huang Q (2019) Improving multi-label classification with missing labels by learning label-specific features. Inf Sci 492:124–146
Kashef S, Nezamabadi-pour H, Nikpour B (2018) Multilabel feature selection: a comprehensive review and guiding experiments. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 8(2):e1240
Li GL, Zhang HR, Min F, Lu YN (2023) Two-stage label distribution learning with label-independent prediction based on label-specific features. Knowl-Based Syst 267:110426
Li J, Zhang C, Zhou JT, Fu H, Xia S, Hu Q (2021) Deep-lift: deep label-specific feature learning for image annotation. IEEE Trans Cybern 52(8):7732–7741
Li J, Li P, Hu X, Yu K (2022a) Learning common and label-specific features for multi-label classification with correlation information. Pattern Recognit 121:108259
Li S, Deng W (2019) Blended emotion in-the-wild: multi-label facial expression recognition using crowdsourced annotations and deep locality feature learning. Int J Comput Vis 127(6–7):884–906
Li W, Chen J, Lu Y, Huang Z (2022b) Filling missing labels in label distribution learning by exploiting label-specific feature selection. In: 2022 International joint conference on neural networks (IJCNN), IEEE, pp 1–8
Lin Y, Liu H, Zhao H, Hu Q, Zhu X, Wu X (2022) Hierarchical feature selection based on label distribution learning. IEEE Transactions on Knowledge and Data Engineering
Liu H, Lin Y, Wang C, Guo L, Chen J (2023a) Semantic-gap-oriented feature selection in hierarchical classification learning. Inf Sci 642:119241
Liu K, Li T, Yang X, Chen H, Wang J, Deng Z (2023b) Semifree: semi-supervised feature selection with fuzzy relevance and redundancy. IEEE Transactions on Fuzzy Systems
Lu Y, Li W, Li H, Jia X (2023) Predicting label distribution from tie-allowed multi-label ranking. IEEE Transactions on Pattern Analysis and Machine Intelligence
Ma J, Chow TW, Zhang H (2020) Semantic-gap-oriented feature selection and classifier construction in multilabel learning. IEEE Trans Cybern 52(1):101–115
Paul D, Bardhan S, Saha S, Mathew J (2023) Ml-knockoffgan: deep online feature selection for multi-label learning. Knowl-Based Syst 271:110548
Peng Y, Liu H, Li J, Huang J, Lu BL, Kong W (2022) Cross-session emotion recognition by joint label-common and label-specific eeg features exploration. IEEE Trans Neural Syst Rehabil Eng 31:759–768
Qian W, Xiong C, Qian Y, Wang Y (2022a) Label enhancement-based feature selection via fuzzy neighborhood discrimination index. Knowl-Based Syst 250:109119
Qian W, Xiong Y, Yang J, Shu W (2022b) Feature selection for label distribution learning via feature similarity and label correlation. Inf Sci 582:38–59
Qian W, Ye Q, Li Y, Dai S (2022c) Label distribution feature selection with feature weights fusion and local label correlations. Knowl-Based Syst 256:109778
Qian W, Ye Q, Li Y, Huang J, Dai S (2022d) Relevance-based label distribution feature selection via convex optimization. Inf Sci 607:322–345
Qian W, Xu F, Huang J, Qian J (2023) A novel granular ball computing-based fuzzy rough set for feature selection in label distribution learning. Knowl-Based Syst 278:110898
Qian W, Xiong Y, Ding W, Huang J, Vong CM (2024) Label correlations-based multi-label feature selection with label enhancement. Eng Appl Artif Intell 127:107310
Ren T, Jia X, Li W, Chen L, Li Z (2019) Label distribution learning with label-specific features. In: IJCAI, pp 3318–3324
Sharifi-Noghabi H, Harjandi PA, Zolotareva O, Collins CC, Ester M (2021) Out-of-distribution generalization from labelled and unlabelled gene expression data for drug response prediction. Nat Mach Intell 3(11):962–972
Su Y, Zhao W, Jing P, Nie L (2022) Exploiting low-rank latent gaussian graphical model estimation for visual sentiment distributions. IEEE Trans Multimed 25:1243–1255
Wang J, Geng X (2019) Classification with label distribution learning. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence, pp 3712–3718
Xing C, Geng X, Xue H (2016) Logistic boosting regression for label distribution learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4489–4497
Xu P, Xiao L, Liu B, Lu S, Jing L, Yu J (2023a) Label-specific feature augmentation for long-tailed multi-label text classification. In: Proceedings of the AAAI conference on artificial intelligence, vol 37, pp 10602–10610
Xu T, Xu Y, Yang S, Li B, Zhang W (2023b) Learning accurate label-specific features from partially multilabeled data. IEEE Transactions on Neural Networks and Learning Systems
Yang L, Li M, Shen C, Hu Q, Wen J, Xu S (2020) Discriminative transfer learning for driving pattern recognition in unlabeled scenes. IEEE Trans Cybern 52(3):1429–1442
Yang Y, Chen H, Mi Y, Luo C, Horng SJ, Li T (2023) Multi-label feature selection based on stable label relevance and label-specific features. Inf Sci 648:119525
Yu ZB, Zhang ML (2021) Multi-label classification with label-specific feature generation: a wrapped approach. IEEE Trans Pattern Anal Mach Intell 44(9):5199–5210
Zeng Q, Geng J, Jiang W, Huang K, Wang Z (2021) Idln: iterative distribution learning network for few-shot remote sensing image scene classification. IEEE Geosci Remote Sens Lett 19:1–5
Zhang J, Liu K, Yang X, Ju H, Xu S (2023a) Multi-label learning with relief-based label-specific feature selection. Appl Intell 53(15):18517–18530
Zhang J, Wu H, Jiang M, Liu J, Li S, Tang Y, Long J (2023b) Group-preserving label-specific feature selection for multi-label learning. Expert Syst ApplSystems with Applications 213:118861
Zhang ML, Wu L (2014) Lift: multi-label learning with label-specific features. IEEE Trans Pattern Anal Mach Intell 37(1):107–120
Zhang ML, Fang JP, Wang YB (2021) Bilabel-specific features for multi-label classification. ACM Transactions on Knowledge Discovery from Data 16(1):1–23
Zhang Q, Tsang EC, He Q, Guo Y (2023c) Ensemble of kernel extreme learning machine based elimination optimization for multi-label classification. Knowl-Based Syst 278:110817
Zou Y, Hu X, Li P (2024) Gradient-based multi-label feature selection considering three-way variable interaction. Pattern Recognition 145:109900
Acknowledgements
This work is supported by National Natural Science Foundation of China (62266018 and 62366019), and Natural Science Foundation of Jiangxi Province (20202BABL202037 and 20232
BAB202052).
Author information
Authors and Affiliations
Contributions
Wenhao Shu: Conceptualization, Methodology, Visualization, Writing-original draft. Qiang Xia: Data curation, Software, Validation, Formal analysis, Writing-original draft. Wenbin Qian: Investigation, Supervision, Writing-review and editing.
Corresponding author
Ethics declarations
Conflicts of Interest
All the authors do not have any possible conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shu, W., Xia, Q. & Qian, W. Label distribution feature selection based on label-specific features. Appl Intell 54, 9195–9212 (2024). https://doi.org/10.1007/s10489-024-05668-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-024-05668-8