Skip to main content
Log in

Label distribution feature selection based on label-specific features

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Label distribution learning, where deal with label ambiguity by describing the degree of relevance of each label to a specific instance. As a novel machine learning paradigm, the curse of dimensionality is one of the prominent problems. Feature selection is a vital preprocessing step to reduce the high dimensionality of data. However, most existing label distribution feature selection methods focus on selecting a feature subset that has relevant capabilities for all labels, ignoring label-specific features with the maximum discriminatory power for each label. To tackle this issue, a label distribution feature selection algorithm based on label-specific features is proposed in this paper. Initially, we introduce a feature selection optimization model for label distribution data that simultaneously considers common and label-specific features, leveraging sparse learning to further investigate the intricate relationships between features and labels. Subsequently, the label correlation coefficient is employed to enhance the collaborative learning effect of labels. Finally, the relevance between features and labels is taken into account to guide the feature selection process, which can effectively eliminate the redundant features. Comprehensive experiments demonstrate the advantage of our proposed method over other well-established feature selection algorithms for selecting label-specific features to label distribution data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data Availability

Data sharing not applicable.

Code Availability

Code availability not applicable.

Notes

  1. http://palm.seu.edu.cn/xgeng/LDL/index.htm

References

  1. Akbari A, Awais M, Fatemifar S, Khalid SS, Kittler J (2021) A novel ground metric for optimal transport-based chronological age estimation. IEEE Trans Cybern 52(10):9986–9999

    Article  Google Scholar 

  2. Al-Fahdawi S, Al-Waisy AS, Zeebaree DQ, Qahwaji R, Natiq H, Mohammed MA, Nedoma J, Martinek R, Deveci M (2024) Fundus-deepnet: multi-label deep learning classification system for enhanced detection of multiple ocular diseases through data fusion of fundus images. Inf Fusion 102:102059

    Article  Google Scholar 

  3. Berger A, Della Pietra SA, Della Pietra VJ (1996) A maximum entropy approach to natural language processing. Comput linguist 22(1):39–71

    Google Scholar 

  4. Fan Y, Chen B, Huang W, Liu J, Weng W, Lan W (2022) Multi-label feature selection based on label correlations and feature redundancy. Knowl-Based Syst 241:108256

    Article  Google Scholar 

  5. Fan Y, Liu J, Tang J, Liu P, Lin Y, Du Y (2024) Learning correlation information for multi-label feature selection. Pattern Recognit 145:109899

    Article  Google Scholar 

  6. Geng X (2016) Label distribution learning. IEEE Trans Knowl Data Eng 28(7):1734–1748

    Article  Google Scholar 

  7. Geng X, Xia Y (2014) Head pose estimation based on multivariate label distribution. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1837–1842

  8. Gupta A, Narayan S, Khan S, Khan FS, Shao L, van de Weijer J (2023) Generative multi-label zero-shot learning. IEEE Transactions on Pattern Analysis and Machine Intelligence

  9. Han Q, Hu L, Gao W (2024) Feature relevance and redundancy coefficients for multi-view multi-label feature selection. Inf Sci 652:119747

    Article  Google Scholar 

  10. Hang JY, Zhang ML (2021) Collaborative learning of label semantics and deep label-specific features for multi-label classification. IEEE Trans Pattern Anal Mach Intell 44(12):9860–9871

    Article  Google Scholar 

  11. Hao P, Hu L, Gao W (2023) Partial multi-label feature selection via subspace optimization. Inf Sci 648:119556

    Article  Google Scholar 

  12. He Z, Lin Y, Wang C, Guo L, Ding W (2023) Multi-label feature selection based on correlation label enhancement. Inf Sci 647:119526

    Article  Google Scholar 

  13. Hu L, Gao L, Li Y, Zhang P, Gao W (2022) Feature-specific mutual information variation for multi-label feature selection. Inf Sci 593:449–471

    Article  Google Scholar 

  14. Huang J, Qin F, Zheng X, Cheng Z, Yuan Z, Zhang W, Huang Q (2019) Improving multi-label classification with missing labels by learning label-specific features. Inf Sci 492:124–146

    Article  MathSciNet  Google Scholar 

  15. Kashef S, Nezamabadi-pour H, Nikpour B (2018) Multilabel feature selection: a comprehensive review and guiding experiments. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 8(2):e1240

    Google Scholar 

  16. Li GL, Zhang HR, Min F, Lu YN (2023) Two-stage label distribution learning with label-independent prediction based on label-specific features. Knowl-Based Syst 267:110426

    Article  Google Scholar 

  17. Li J, Zhang C, Zhou JT, Fu H, Xia S, Hu Q (2021) Deep-lift: deep label-specific feature learning for image annotation. IEEE Trans Cybern 52(8):7732–7741

    Article  Google Scholar 

  18. Li J, Li P, Hu X, Yu K (2022a) Learning common and label-specific features for multi-label classification with correlation information. Pattern Recognit 121:108259

    Article  Google Scholar 

  19. Li S, Deng W (2019) Blended emotion in-the-wild: multi-label facial expression recognition using crowdsourced annotations and deep locality feature learning. Int J Comput Vis 127(6–7):884–906

  20. Li W, Chen J, Lu Y, Huang Z (2022b) Filling missing labels in label distribution learning by exploiting label-specific feature selection. In: 2022 International joint conference on neural networks (IJCNN), IEEE, pp 1–8

  21. Lin Y, Liu H, Zhao H, Hu Q, Zhu X, Wu X (2022) Hierarchical feature selection based on label distribution learning. IEEE Transactions on Knowledge and Data Engineering

  22. Liu H, Lin Y, Wang C, Guo L, Chen J (2023a) Semantic-gap-oriented feature selection in hierarchical classification learning. Inf Sci 642:119241

    Article  Google Scholar 

  23. Liu K, Li T, Yang X, Chen H, Wang J, Deng Z (2023b) Semifree: semi-supervised feature selection with fuzzy relevance and redundancy. IEEE Transactions on Fuzzy Systems

  24. Lu Y, Li W, Li H, Jia X (2023) Predicting label distribution from tie-allowed multi-label ranking. IEEE Transactions on Pattern Analysis and Machine Intelligence

  25. Ma J, Chow TW, Zhang H (2020) Semantic-gap-oriented feature selection and classifier construction in multilabel learning. IEEE Trans Cybern 52(1):101–115

    Article  Google Scholar 

  26. Paul D, Bardhan S, Saha S, Mathew J (2023) Ml-knockoffgan: deep online feature selection for multi-label learning. Knowl-Based Syst 271:110548

    Article  Google Scholar 

  27. Peng Y, Liu H, Li J, Huang J, Lu BL, Kong W (2022) Cross-session emotion recognition by joint label-common and label-specific eeg features exploration. IEEE Trans Neural Syst Rehabil Eng 31:759–768

    Article  Google Scholar 

  28. Qian W, Xiong C, Qian Y, Wang Y (2022a) Label enhancement-based feature selection via fuzzy neighborhood discrimination index. Knowl-Based Syst 250:109119

    Article  Google Scholar 

  29. Qian W, Xiong Y, Yang J, Shu W (2022b) Feature selection for label distribution learning via feature similarity and label correlation. Inf Sci 582:38–59

    Article  MathSciNet  Google Scholar 

  30. Qian W, Ye Q, Li Y, Dai S (2022c) Label distribution feature selection with feature weights fusion and local label correlations. Knowl-Based Syst 256:109778

    Article  Google Scholar 

  31. Qian W, Ye Q, Li Y, Huang J, Dai S (2022d) Relevance-based label distribution feature selection via convex optimization. Inf Sci 607:322–345

    Article  Google Scholar 

  32. Qian W, Xu F, Huang J, Qian J (2023) A novel granular ball computing-based fuzzy rough set for feature selection in label distribution learning. Knowl-Based Syst 278:110898

    Article  Google Scholar 

  33. Qian W, Xiong Y, Ding W, Huang J, Vong CM (2024) Label correlations-based multi-label feature selection with label enhancement. Eng Appl Artif Intell 127:107310

    Article  Google Scholar 

  34. Ren T, Jia X, Li W, Chen L, Li Z (2019) Label distribution learning with label-specific features. In: IJCAI, pp 3318–3324

  35. Sharifi-Noghabi H, Harjandi PA, Zolotareva O, Collins CC, Ester M (2021) Out-of-distribution generalization from labelled and unlabelled gene expression data for drug response prediction. Nat Mach Intell 3(11):962–972

    Article  Google Scholar 

  36. Su Y, Zhao W, Jing P, Nie L (2022) Exploiting low-rank latent gaussian graphical model estimation for visual sentiment distributions. IEEE Trans Multimed 25:1243–1255

    Article  Google Scholar 

  37. Wang J, Geng X (2019) Classification with label distribution learning. In: Proceedings of the twenty-eighth international joint conference on artificial intelligence, pp 3712–3718

  38. Xing C, Geng X, Xue H (2016) Logistic boosting regression for label distribution learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4489–4497

  39. Xu P, Xiao L, Liu B, Lu S, Jing L, Yu J (2023a) Label-specific feature augmentation for long-tailed multi-label text classification. In: Proceedings of the AAAI conference on artificial intelligence, vol 37, pp 10602–10610

  40. Xu T, Xu Y, Yang S, Li B, Zhang W (2023b) Learning accurate label-specific features from partially multilabeled data. IEEE Transactions on Neural Networks and Learning Systems

  41. Yang L, Li M, Shen C, Hu Q, Wen J, Xu S (2020) Discriminative transfer learning for driving pattern recognition in unlabeled scenes. IEEE Trans Cybern 52(3):1429–1442

  42. Yang Y, Chen H, Mi Y, Luo C, Horng SJ, Li T (2023) Multi-label feature selection based on stable label relevance and label-specific features. Inf Sci 648:119525

    Article  Google Scholar 

  43. Yu ZB, Zhang ML (2021) Multi-label classification with label-specific feature generation: a wrapped approach. IEEE Trans Pattern Anal Mach Intell 44(9):5199–5210

    Google Scholar 

  44. Zeng Q, Geng J, Jiang W, Huang K, Wang Z (2021) Idln: iterative distribution learning network for few-shot remote sensing image scene classification. IEEE Geosci Remote Sens Lett 19:1–5

    Article  Google Scholar 

  45. Zhang J, Liu K, Yang X, Ju H, Xu S (2023a) Multi-label learning with relief-based label-specific feature selection. Appl Intell 53(15):18517–18530

    Article  Google Scholar 

  46. Zhang J, Wu H, Jiang M, Liu J, Li S, Tang Y, Long J (2023b) Group-preserving label-specific feature selection for multi-label learning. Expert Syst ApplSystems with Applications 213:118861

    Article  Google Scholar 

  47. Zhang ML, Wu L (2014) Lift: multi-label learning with label-specific features. IEEE Trans Pattern Anal Mach Intell 37(1):107–120

    Article  Google Scholar 

  48. Zhang ML, Fang JP, Wang YB (2021) Bilabel-specific features for multi-label classification. ACM Transactions on Knowledge Discovery from Data 16(1):1–23

    Article  Google Scholar 

  49. Zhang Q, Tsang EC, He Q, Guo Y (2023c) Ensemble of kernel extreme learning machine based elimination optimization for multi-label classification. Knowl-Based Syst 278:110817

    Article  Google Scholar 

  50. Zou Y, Hu X, Li P (2024) Gradient-based multi-label feature selection considering three-way variable interaction. Pattern Recognition 145:109900

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by National Natural Science Foundation of China (62266018 and 62366019), and Natural Science Foundation of Jiangxi Province (20202BABL202037 and 20232

BAB202052).

Author information

Authors and Affiliations

Authors

Contributions

Wenhao Shu: Conceptualization, Methodology, Visualization, Writing-original draft. Qiang Xia: Data curation, Software, Validation, Formal analysis, Writing-original draft. Wenbin Qian: Investigation, Supervision, Writing-review and editing.

Corresponding author

Correspondence to Wenbin Qian.

Ethics declarations

Conflicts of Interest

All the authors do not have any possible conflicts of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shu, W., Xia, Q. & Qian, W. Label distribution feature selection based on label-specific features. Appl Intell 54, 9195–9212 (2024). https://doi.org/10.1007/s10489-024-05668-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-024-05668-8

Keywords