Abstract
The task of learning from multi-label example is rather challenging because of the tremendous number of possible label sets. It has been well recognized that exploiting label relationships in a proper way can facilitate the learning process and boost the learning performance. In this paper, we propose a novel framework called Label-Topic Pairs Multi-Label (LTPML) for multi-label classification. LTPML regards the label set associated with each instance as a document and each class label in the label set as a word and then obtains the topics from the label space by topic models. With the information about label correlations contained by topics, multi-label classification problem is decomposed into a series of single-label classification problems. Based on label-topic pairs which are constructed from relationships among the current label and topics, several multi-class classifiers are built for each class label. Two algorithms named LTPML-\(\alpha \) and LTPML-\(\beta \) are derived according to different way of selecting the topics. Experiments on benchmark data sets clearly validate the effectiveness of the proposed approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barutcuoglu, Z., Schapire, R.E., Troyanskaya, O.G.: Hierarchical multi-label prediction of gene function. Bioinformatics 22(7), 830 (2006)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Boutell, M.R., Luo, J., Shen, X., Brown, C.M.: Learning multi-label scene classification. Pattern Recogn. 37(9), 1757–1771 (2004)
Brinker, K.: Multilabel classification via calibrated label ranking. Mach. Learn. 73(2), 133–153 (2008)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)
Kim, D., Kim, S., Oh, A.: Dirichlet process with mixed random measures: a nonparametric topic model for labeled data. Computer Science pp. 727–734 (2012)
Li, X., Ouyang, J., Zhou, X.: Supervised topic models for multi-label classification. Neurocomputing 149(PB), 811–819 (2015)
Li, X., Ouyang, J., Zhou, X.: Labelset topic model for multi-label document classification. J. Intell. Inf. Syst. 46(1), 83–97 (2016)
Pillai, I., Fumera, G., Roli, F.: Designing multi-label classifiers that maximize F measures: state of the art. Pattern Recogn. 61, 394–404 (2017)
Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: Conference on Empirical Methods in Natural Language Processing, vol.1, pp. 248–256 (2009)
Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Mach. Learn. 85(3), 333 (2011)
Rubin, T.N., Chambers, A., Smyth, P., Steyvers, M.: Statistical topic models for multi-label document classification. Mach. Learn. 88(1–2), 157–208 (2012)
Schapire, R.E., Singer, Y.: A boosting-based systemfor text categorization. Mach. Learn. 39(2–3), 135–168 (2000)
Sun, F., Tang, J., Li, H., Qi, G.J., Huang, T.S.: Multi-label image categorization with sparse factor representation. IEEE Trans. Image Process. 23(3), 1028–1037 (2014)
Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining multi-label data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 667–685. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-09823-4_34
Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-labelsets for multilabel classification. IEEE Trans. Knowl. Data Eng. 23(7), 1079–1089 (2011)
Tsoumakas, G., Spyromitros-Xioufis, E., Vilcek, J., Vlahavas, I.: Mulan: a java library for multi-label learning. J. Mach. Learn. Res. 12(7), 2411–2414 (2011)
Wang, X., Sukthankar, G.: Multi-label relational neighbor classification using social context features. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 464–472 (2013)
Wang, X., Li, G.Z.: Multilabel learning via random label selection for protein subcellular multilocations prediction. IEEE/ACM Trans. Comput. Biol. Bioinf. 10(2), 436–446 (2013)
Wieczorkowska, A., Synak, P., Ra, Z.W.: Multi-label classification of emotions in music. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds.) Intelligent Information Processing and Web Mining. Advances in Soft Computing, pp. 307–315. Springer, Berlin (2006). https://doi.org/10.1007/3-540-33521-8_30
Wu, B., Zhong, E., Horner, A., Yang, Q.: Music emotion recognition by multi-label multi-layer multi-instance multi-view learning. In: ACM International Conference on Multimedia, pp. 117–126 (2014)
Wu, X.Z., Zhou, Z.H.: A unified view of multi-label performance measures (2016)
Zhang, M.L., Zhang, K.: Multi-label learning by exploiting label dependency. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 999–1008 (2010)
Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Pattern Recogn. 40(7), 2038–2048 (2007)
Zhang, M.L., Zhou, Z.H.: A review on multi-label learning algorithms. IEEE Trans. Knowl. Data Eng. 26(8), 1819–1837 (2014)
Acknowledgements
This paper is supported by the National Key Research and Development Program of China (Grant No. 2016YFB1001102), the National Natural Science Foundation of China (Grant Nos. 61502227, 61375069), the Collaborative Innovation Center of Novel Software Technology and Industrialization at Nanjing University and the Fundamental Research Funds for the Central Universities (Grant Nos. 020214380036, 020214380038).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Chen, G., Peng, Y., Wang, C. (2018). Multi-label Classification via Label-Topic Pairs. In: Cai, Y., Ishikawa, Y., Xu, J. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 10987. Springer, Cham. https://doi.org/10.1007/978-3-319-96890-2_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-96890-2_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96889-6
Online ISBN: 978-3-319-96890-2
eBook Packages: Computer ScienceComputer Science (R0)