Abstract
As in the traditional single-label classification, the feature selection plays an important role in the multi-label classification. This paper presents a multi-label feature selection algorithm MLFS which consists of two steps. The first step employs the mutual information to complete the local feature selection. Based on the result of local selection, GA algorithm is adopted to select the global optimal feature subset and the correlations among the labels are considered. Compared with other multi-label feature selection algorithms, MLFS exploits the label correlation to improve the performance. The experiments on two multi-label datasets demonstrate that the proposed method has been proved to be a promising multi-label feature selection method.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wang, J., Zhao, Y., Wu, X., et al.: A transductive multi-label learning approach for video concept detection. Pattern Recognition 44(10), 2274–2286 (2011)
Trohidis, K., Tsoumakas, G., Kalliris, G., et al.: Multi-Label Classification of Music into Emotions. ISMIR 8, 325–330 (2008)
Zhang, M.L., Zhou, Z.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Transactions on Knowledge and Data Engineering 18(10), 1338–1351 (2006)
Bellman, R.E.: Adaptive Control Process: A Guided Tour. Princeton University Press, Princeton (1961)
Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. ICML 97, 412–420 (1997)
Chen, W., Yan, J., Zhang, B., et al.: Document transformation for multi-label feature selection in text categorization. In: The Seventh IEEE International Conference on Data Mining, pp. 451–456. IEEE (2007)
Gao, S., Wu, W., Lee, C.H., et al.: A MFoM learning approach to robust multiclass multi-label text categorization. In: The Twenty-First International Conference on Machine Learning, p. 42. ACM (2004)
Zhang, M.L., Pea, J.M., Robles, V.: Feature selection for multi-label naive Bayes classification. Information Sciences 179(19), 3218–3229 (2009)
Wang, H., Ding, C.H.Q., Huang, H.: Multi-Label Classification: Inconsistency and Class Balanced K-Nearest Neighbor. In: The Twenty-Fouth AAAI Conference on Artifical Intelligence, pp. 1264–1266 (2010)
Yu, K., Yu, S., Tresp, V.: Multi-label informed latent semantic indexing. In: The 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 258–265. ACM (2005)
Zhang, Y., Zhou, Z.H.: Multilabel dimensionality reduction via dependence maximization. ACM Transactions on Knowledge Discovery from Data (TKDD) 4(3), 14 (2010)
Langley, P., Simon, H.A.: Applications of machine learning and rule induction. Communications of the ACM 38(11), 54–64 (1995)
Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(8), 1226–1238 (2005)
Goldberg, D.E.: Genetic algorithms in search, optimization, and machine learning. Addison-wesley, Reading Menlo Park (1989)
Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. International Journal of Data Warehousing and Mining 3, 1–13 (2007)
Zhang, M.L., Zhou, Z.H.: ML-KNN: A lazy learning approach to multi-label learning. Pattern Recognition 40(7), 2038–2048 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Yu, Y., Wang, Y. (2014). Feature Selection for Multi-label Learning Using Mutual Information and GA. In: Miao, D., Pedrycz, W., Ślȩzak, D., Peters, G., Hu, Q., Wang, R. (eds) Rough Sets and Knowledge Technology. RSKT 2014. Lecture Notes in Computer Science(), vol 8818. Springer, Cham. https://doi.org/10.1007/978-3-319-11740-9_42
Download citation
DOI: https://doi.org/10.1007/978-3-319-11740-9_42
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11739-3
Online ISBN: 978-3-319-11740-9
eBook Packages: Computer ScienceComputer Science (R0)