Abstract
The graph-based multi-label feature selection (MFS) method plays a pivotal role in big data era with the exponential growth of multi-label data. As the activity of the multi-label field increases, it also exposes some multi-label problems. Traditional graph-based methods adopt the original spaces to construct Laplacian graphs, which increases redundancy and noise. In addition, when exploring the deeper subtle connections between features and labels of multi-label samples, and extracting spatial correlations, there is a lack of consideration for shared connection information between feature space and label space. To track these tricky problems, this study adopts matrix factorization method to decompose the original matrix space and extract the low dimensional matrix. On the one hand, dynamic graph regularization preserves the spatial geometry and the interference of redundant features are reduced while extracting correlations. On the other hand, the decomposed low dimensional matrix obtains the shared connection information from the dual space of feature and label, which is beneficial for mining information from data. Furthermore, l2,1/2-norm is applied to the feature weight matrix to enforce row-sparsity and robustness. So this study proposes a robust MFS with Shared Coupled and Dynamic graph Regularization (SCDRMFS). An iterative method for solving the objective function is proposed, and its convergence is proved from two aspects, theoretically and experimentally. Moreover, experiments on nine real benchmark datasets are performed to verify the effectiveness of the proposed method. SCDRMFS is contrasted with six latest algorithms. It is concluded from the experimental results that the proposed algorithm SCDRMFS can improve the classification performance for multi-label datasets.












Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Zhang J, Li C, Sun Z, Luo Z, Zhou C, Li S (2019) Towards a unified multi-source-based optimization framework for multi-label learning. Appl Soft Comput 76:425–435
Bo T, Kay S, He H (2016) Toward optimal feature selection in naive bayes for text categorization. IEEE Trans Knowl Data Eng 28(9):2508–2521
Hong R, Wang M, Gao Y, Tao D, Li X, Wu X (2013) Image annotation by multiple-instance learning with discriminative feature mapping and selection. IEEE Trans Cybern 44(5):669–680
Nie F, Huang H, Cai X, Ding C (2010) Efficient and robust feature selection via joint ℓ2, 1-norms minimization. Adv Neural Inf Process Syst 23:1813–1821
Wang F-Y, Wang X, Li L, Li L i (2016) Steps toward parallel intelligence. IEEE/CAA J Autom Sin 3(4):345–348
Cai Z, Zhu W (2018) Multi-label feature selection via feature manifold learning and sparsity regularization. Int J Mach Learn Cybern 9(8):1321–1334
Lu Q, Li X, Dong Y (2018) Structure preserving unsupervised feature selection. Neurocomputing 301:36–45
Hu J, Li Y, Gao W, Zhang P (2020) Robust multi-label feature selection with dual-graph regularization. Knowl-Based Syst 203:106126
Bolón-Canedo V, Sánchez-Maroño N, Alonso-Betanzos A (2016) Feature selection for high-dimensional data. Prog Artif Intel 5(2):65–75
Qian W, Long X, Wang Y, Xie Y (2020) Multi-label feature selection based on label distribution and feature complementarity. Appl Soft Comput 90:106167
Hashemi A, Dowlatshahi MB, Nezamabadi-Pour H (2020) Mgfs: a multi-label graph-based feature selection algorithm via pagerank centrality. Expert Syst Appl 142:113024
Cai X, Nie F, Huang H (2013) Exact top-k feature selection via ℓ2, 0-norm constraint. In: Twenty-third international joint conference on artificial intelligence. Citeseer
Zhang P, Gao W (2020) Feature selection considering uncertainty change ratio of the class label. Appl Soft Comput 95:106537
Zhu Y, Kwok JT, Zhou Zhi-Hua (2017) Multi-label learning with global and local label correlation. IEEE Trans Knowl Data Eng 30(6):1081–1094
Zhang J, Luo Z, Li C, Zhou C, Li S (2019) Manifold regularized discriminative feature selection for multi-label learning. Pattern Recogn 95:136–150
Meng Y, Shang R, Jiao L, Zhang W, Yuan Y, Yang S (2018) Feature selection based dual-graph sparse non-negative matrix factorization for local discriminative clustering. Neurocomputing 290:87–99
Bandela SR, Kishore Kumar T (2021) Unsupervised feature selection and nmf de-noising for robust speech emotion recognition. Appl Acoust 172:107645
Jian L, Li J, Shu K, Liu H (2016) Multi-label informed feature selection. In: IJCAI, vol 16, pp 1627–33
Wang H, Yang Y, Liu B, Fujita H (2019) A study of graph-based system for multi-view clustering. Knowl-Based Syst 163:1009–1019
Zhang Y, Yang Y, Li T, Fujita H (2019) A multitask multiview clustering algorithm in heterogeneous situations based on lle and le. Knowl-Based Syst 163:776–786
Hu L, Li Y, Gao W, Zhang P, Hu J (2020) Multi-label feature selection with shared common mode. Pattern Recogn 104:107344
Zhang P, Sheng J, Gao W, Hu J, Li Y (2022) Multi-label feature selection method based on dynamic weight. Soft computing: A fusion of foundations, methodologies and applications, (6):26
Huang R, Jiang W, Sun G (2018) Manifold-based constraint laplacian score for multi-label feature selection. Pattern Recogn Lett 112:346–352
Xu Y, Wang J, An S, Wei J, Ruan J (2018) Semi-supervised multi-label feature selection by preserving feature-label space consistency. In: Proceedings of the 27th ACM international conference on information and knowledge management, pp 783–792
Hu J, Li Y, Xu G, Gao W (2022) Dynamic subspace dual-graph regularized multi-label feature selection. Neurocomputing 467:184–196
Wang J, Xu Y, Xu H, Sun Z, Yang Z, Wei J (2020) An effective multi-label feature selection model towards eliminating noisy features. Appl Sci 10(22):8093
Lv S, Shi S, Wang H, Li F (2021) Semi-supervised multi-label feature selection with adaptive structure learning and manifold learning. Knowl Based Syst 214(12):106757
Hashemi A, Dowlatshahi MB, Nezamabadi-Pour H (2020) Mfs-mcdm: Multi-label feature selection using multi-criteria decision making. Knowl-Based Syst 206:106365
Shang R, Xu K, Jiao L (2020) Subspace learning for unsupervised feature selection via adaptive structure learning and rank approximation. Neurocomputing 413:72–84
Nie F, Dong X u, Tsang Ivor Wai-Hung, Zhang C (2010) Flexible manifold embedding: a framework for semi-supervised and unsupervised dimension reduction. IEEE Trans Image Process 19(7):1921–1932
Zhang Y, Ma Y, Yang X (2022) Multi-label feature selection based on logistic regression and manifold learning. Appl Intell 52(8):9256–9273
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Ser B Methodol 39(1):1–22
Cai D, He X, Han J, Huang TS (2010) Graph regularized nonnegative matrix factorization for data representation. IEEE Trans Pattern Anal Mach Intell 33(8):1548–1560
Sun L, Ge H, Kang W (2019) Non-negative matrix factorization based modeling and training algorithm for multi-label learning. Front Comput Sci 13(6):1243–1254
Bolón-Canedo V, Sánchez-Maroño N, Alonso-Betanzos A (2015) Feature selection for high-dimensional data. Springer
Klimt B, Yang Y (2004) The enron corpus: a new dataset for email classification research. In: European conference on machine learning, Springer, pp 217–226
Gonçalves EC, Plastino A, Freitas AA (2013) A genetic algorithm for optimizing the label ordering in multi-label classifier chains. In: 2013 IEEE 25Th international conference on tools with artificial intelligence, IEEE, pp 469–476
Zhang M-L, Zhou Z-H (2007) Ml-knn: a lazy learning approach to multi-label learning. Pattern Recognit 40(7):2038–2048
Boutell MR, Luo J, Shen X, Brown CM (2004) Learning multi-label scene classification. Pattern Recognit 37(9):1757–1771
Rivolli A, Parker LC, Carvalho Andre CPLF de (2017) Food truck recommendation using multi-label classification. In: EPIA Conference on artificial intelligence, Springer, pp 585–596
Elisseeff AE, Weston J (2001) A kernel method for multi-labelled classification. In: Neural information processing systems
Tsoumakas G, Katakis I, Vlahavas I (2008) Effective and efficient multilabel classification in domains with large number of labels. In: Proc ECML/PKDD 2008 workshop on mining multidimensional data (MMD’08), vol 21, pp 53–59
Turnbull D, Barrington L, Torres D, Lanckriet G (2008) Semantic annotation and retrieval of music and sound effects. IEEE Trans Audio Speech Lang Process 16(2):467–476
Read J, Pfahringer B, Holmes G (2008) Multi-label classification using ensembles of pruned sets. In: 2008 Eighth IEEE international conference on data mining, IEEE, pp 995– 1000
Katakis I, Tsoumakas G, Vlahavas I (2008) Multilabel text classification for automated tag suggestion. In: Proceedings of the ECML/PKDD, vol 18, pp 5. Citeseer
Zhang M-L, Zhou Z-H (2013) A review on multi-label learning algorithms. IEEE Trans Knowl Data Eng 26(8):1819–1837
Xiao Q, Dai J, Luo J, Fujita H (2019) Multi-view manifold regularized learning-based method for prioritizing candidate disease mirnas. Knowl-Based Syst 175:118–129
Acknowledgements
This work is supported by the National Natural Science Foundation of China (Nos. 61976182, 62076171, 61876157, 61976245), Sichuan Key R&D project (2020YFG0035), the Natural Science Foundation of Sichuan Province (2022NSFSC0898).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Hongmei Chen, Bo Peng, Tianrui Li and Tengyu Yin are contributed equally to this work.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, L., Chen, H., Peng, B. et al. Robust multi-label feature selection with shared coupled and dynamic graph regularization. Appl Intell 53, 16973–16997 (2023). https://doi.org/10.1007/s10489-022-04343-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04343-0