Skip to main content
Log in

Robust multi-label feature selection with shared coupled and dynamic graph regularization

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

The graph-based multi-label feature selection (MFS) method plays a pivotal role in big data era with the exponential growth of multi-label data. As the activity of the multi-label field increases, it also exposes some multi-label problems. Traditional graph-based methods adopt the original spaces to construct Laplacian graphs, which increases redundancy and noise. In addition, when exploring the deeper subtle connections between features and labels of multi-label samples, and extracting spatial correlations, there is a lack of consideration for shared connection information between feature space and label space. To track these tricky problems, this study adopts matrix factorization method to decompose the original matrix space and extract the low dimensional matrix. On the one hand, dynamic graph regularization preserves the spatial geometry and the interference of redundant features are reduced while extracting correlations. On the other hand, the decomposed low dimensional matrix obtains the shared connection information from the dual space of feature and label, which is beneficial for mining information from data. Furthermore, l2,1/2-norm is applied to the feature weight matrix to enforce row-sparsity and robustness. So this study proposes a robust MFS with Shared Coupled and Dynamic graph Regularization (SCDRMFS). An iterative method for solving the objective function is proposed, and its convergence is proved from two aspects, theoretically and experimentally. Moreover, experiments on nine real benchmark datasets are performed to verify the effectiveness of the proposed method. SCDRMFS is contrasted with six latest algorithms. It is concluded from the experimental results that the proposed algorithm SCDRMFS can improve the classification performance for multi-label datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Algorithm 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. http://mulan.sourceforge.net/datasets-mlc.html.

  2. http://www.uco.es/kdis/mllresources/.

  3. https://github.com/radishHead-hub/SCDRMFS.

References

  1. Zhang J, Li C, Sun Z, Luo Z, Zhou C, Li S (2019) Towards a unified multi-source-based optimization framework for multi-label learning. Appl Soft Comput 76:425–435

    Article  Google Scholar 

  2. Bo T, Kay S, He H (2016) Toward optimal feature selection in naive bayes for text categorization. IEEE Trans Knowl Data Eng 28(9):2508–2521

    Article  Google Scholar 

  3. Hong R, Wang M, Gao Y, Tao D, Li X, Wu X (2013) Image annotation by multiple-instance learning with discriminative feature mapping and selection. IEEE Trans Cybern 44(5):669–680

    Article  Google Scholar 

  4. Nie F, Huang H, Cai X, Ding C (2010) Efficient and robust feature selection via joint 2, 1-norms minimization. Adv Neural Inf Process Syst 23:1813–1821

    Google Scholar 

  5. Wang F-Y, Wang X, Li L, Li L i (2016) Steps toward parallel intelligence. IEEE/CAA J Autom Sin 3(4):345–348

    Article  MathSciNet  Google Scholar 

  6. Cai Z, Zhu W (2018) Multi-label feature selection via feature manifold learning and sparsity regularization. Int J Mach Learn Cybern 9(8):1321–1334

    Article  Google Scholar 

  7. Lu Q, Li X, Dong Y (2018) Structure preserving unsupervised feature selection. Neurocomputing 301:36–45

    Article  Google Scholar 

  8. Hu J, Li Y, Gao W, Zhang P (2020) Robust multi-label feature selection with dual-graph regularization. Knowl-Based Syst 203:106126

    Article  Google Scholar 

  9. Bolón-Canedo V, Sánchez-Maroño N, Alonso-Betanzos A (2016) Feature selection for high-dimensional data. Prog Artif Intel 5(2):65–75

    Article  Google Scholar 

  10. Qian W, Long X, Wang Y, Xie Y (2020) Multi-label feature selection based on label distribution and feature complementarity. Appl Soft Comput 90:106167

    Article  Google Scholar 

  11. Hashemi A, Dowlatshahi MB, Nezamabadi-Pour H (2020) Mgfs: a multi-label graph-based feature selection algorithm via pagerank centrality. Expert Syst Appl 142:113024

    Article  Google Scholar 

  12. Cai X, Nie F, Huang H (2013) Exact top-k feature selection via 2, 0-norm constraint. In: Twenty-third international joint conference on artificial intelligence. Citeseer

  13. Zhang P, Gao W (2020) Feature selection considering uncertainty change ratio of the class label. Appl Soft Comput 95:106537

    Article  Google Scholar 

  14. Zhu Y, Kwok JT, Zhou Zhi-Hua (2017) Multi-label learning with global and local label correlation. IEEE Trans Knowl Data Eng 30(6):1081–1094

    Article  Google Scholar 

  15. Zhang J, Luo Z, Li C, Zhou C, Li S (2019) Manifold regularized discriminative feature selection for multi-label learning. Pattern Recogn 95:136–150

    Article  Google Scholar 

  16. Meng Y, Shang R, Jiao L, Zhang W, Yuan Y, Yang S (2018) Feature selection based dual-graph sparse non-negative matrix factorization for local discriminative clustering. Neurocomputing 290:87–99

    Article  Google Scholar 

  17. Bandela SR, Kishore Kumar T (2021) Unsupervised feature selection and nmf de-noising for robust speech emotion recognition. Appl Acoust 172:107645

    Article  Google Scholar 

  18. Jian L, Li J, Shu K, Liu H (2016) Multi-label informed feature selection. In: IJCAI, vol 16, pp 1627–33

  19. Wang H, Yang Y, Liu B, Fujita H (2019) A study of graph-based system for multi-view clustering. Knowl-Based Syst 163:1009–1019

    Article  Google Scholar 

  20. Zhang Y, Yang Y, Li T, Fujita H (2019) A multitask multiview clustering algorithm in heterogeneous situations based on lle and le. Knowl-Based Syst 163:776–786

    Article  Google Scholar 

  21. Hu L, Li Y, Gao W, Zhang P, Hu J (2020) Multi-label feature selection with shared common mode. Pattern Recogn 104:107344

    Article  Google Scholar 

  22. Zhang P, Sheng J, Gao W, Hu J, Li Y (2022) Multi-label feature selection method based on dynamic weight. Soft computing: A fusion of foundations, methodologies and applications, (6):26

  23. Huang R, Jiang W, Sun G (2018) Manifold-based constraint laplacian score for multi-label feature selection. Pattern Recogn Lett 112:346–352

    Article  Google Scholar 

  24. Xu Y, Wang J, An S, Wei J, Ruan J (2018) Semi-supervised multi-label feature selection by preserving feature-label space consistency. In: Proceedings of the 27th ACM international conference on information and knowledge management, pp 783–792

  25. Hu J, Li Y, Xu G, Gao W (2022) Dynamic subspace dual-graph regularized multi-label feature selection. Neurocomputing 467:184–196

    Article  Google Scholar 

  26. Wang J, Xu Y, Xu H, Sun Z, Yang Z, Wei J (2020) An effective multi-label feature selection model towards eliminating noisy features. Appl Sci 10(22):8093

    Article  Google Scholar 

  27. Lv S, Shi S, Wang H, Li F (2021) Semi-supervised multi-label feature selection with adaptive structure learning and manifold learning. Knowl Based Syst 214(12):106757

    Article  Google Scholar 

  28. Hashemi A, Dowlatshahi MB, Nezamabadi-Pour H (2020) Mfs-mcdm: Multi-label feature selection using multi-criteria decision making. Knowl-Based Syst 206:106365

    Article  Google Scholar 

  29. Shang R, Xu K, Jiao L (2020) Subspace learning for unsupervised feature selection via adaptive structure learning and rank approximation. Neurocomputing 413:72–84

    Article  Google Scholar 

  30. Nie F, Dong X u, Tsang Ivor Wai-Hung, Zhang C (2010) Flexible manifold embedding: a framework for semi-supervised and unsupervised dimension reduction. IEEE Trans Image Process 19(7):1921–1932

    Article  MathSciNet  MATH  Google Scholar 

  31. Zhang Y, Ma Y, Yang X (2022) Multi-label feature selection based on logistic regression and manifold learning. Appl Intell 52(8):9256–9273

    Article  Google Scholar 

  32. Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Ser B Methodol 39(1):1–22

    MathSciNet  MATH  Google Scholar 

  33. Cai D, He X, Han J, Huang TS (2010) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560

    Google Scholar 

  34. Sun L, Ge H, Kang W (2019) Non-negative matrix factorization based modeling and training algorithm for multi-label learning. Front Comput Sci 13(6):1243–1254

    Article  Google Scholar 

  35. Bolón-Canedo V, Sánchez-Maroño N, Alonso-Betanzos A (2015) Feature selection for high-dimensional data. Springer

  36. Klimt B, Yang Y (2004) The enron corpus: a new dataset for email classification research. In: European conference on machine learning, Springer, pp 217–226

  37. Gonçalves EC, Plastino A, Freitas AA (2013) A genetic algorithm for optimizing the label ordering in multi-label classifier chains. In: 2013 IEEE 25Th international conference on tools with artificial intelligence, IEEE, pp 469–476

  38. Zhang M-L, Zhou Z-H (2007) Ml-knn: a lazy learning approach to multi-label learning. Pattern Recognit 40(7):2038–2048

    Article  MATH  Google Scholar 

  39. Boutell MR, Luo J, Shen X, Brown CM (2004) Learning multi-label scene classification. Pattern Recognit 37(9):1757–1771

    Article  Google Scholar 

  40. Rivolli A, Parker LC, Carvalho Andre CPLF de (2017) Food truck recommendation using multi-label classification. In: EPIA Conference on artificial intelligence, Springer, pp 585–596

  41. Elisseeff AE, Weston J (2001) A kernel method for multi-labelled classification. In: Neural information processing systems

  42. Tsoumakas G, Katakis I, Vlahavas I (2008) Effective and efficient multilabel classification in domains with large number of labels. In: Proc ECML/PKDD 2008 workshop on mining multidimensional data (MMD’08), vol 21, pp 53–59

  43. Turnbull D, Barrington L, Torres D, Lanckriet G (2008) Semantic annotation and retrieval of music and sound effects. IEEE Trans Audio Speech Lang Process 16(2):467–476

    Article  Google Scholar 

  44. Read J, Pfahringer B, Holmes G (2008) Multi-label classification using ensembles of pruned sets. In: 2008 Eighth IEEE international conference on data mining, IEEE, pp 995– 1000

  45. Katakis I, Tsoumakas G, Vlahavas I (2008) Multilabel text classification for automated tag suggestion. In: Proceedings of the ECML/PKDD, vol 18, pp 5. Citeseer

  46. Zhang M-L, Zhou Z-H (2013) A review on multi-label learning algorithms. IEEE Trans Knowl Data Eng 26(8):1819–1837

    Article  Google Scholar 

  47. Xiao Q, Dai J, Luo J, Fujita H (2019) Multi-view manifold regularized learning-based method for prioritizing candidate disease mirnas. Knowl-Based Syst 175:118–129

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (Nos. 61976182, 62076171, 61876157, 61976245), Sichuan Key R&D project (2020YFG0035), the Natural Science Foundation of Sichuan Province (2022NSFSC0898).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongmei Chen.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Hongmei Chen, Bo Peng, Tianrui Li and Tengyu Yin are contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, L., Chen, H., Peng, B. et al. Robust multi-label feature selection with shared coupled and dynamic graph regularization. Appl Intell 53, 16973–16997 (2023). https://doi.org/10.1007/s10489-022-04343-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-04343-0

Keywords

Navigation