Skip to main content

Multi-label Classification via Label-Topic Pairs

  • Conference paper
  • First Online:
Book cover Web and Big Data (APWeb-WAIM 2018)

Abstract

The task of learning from multi-label example is rather challenging because of the tremendous number of possible label sets. It has been well recognized that exploiting label relationships in a proper way can facilitate the learning process and boost the learning performance. In this paper, we propose a novel framework called Label-Topic Pairs Multi-Label (LTPML) for multi-label classification. LTPML regards the label set associated with each instance as a document and each class label in the label set as a word and then obtains the topics from the label space by topic models. With the information about label correlations contained by topics, multi-label classification problem is decomposed into a series of single-label classification problems. Based on label-topic pairs which are constructed from relationships among the current label and topics, several multi-class classifiers are built for each class label. Two algorithms named LTPML-\(\alpha \) and LTPML-\(\beta \) are derived according to different way of selecting the topics. Experiments on benchmark data sets clearly validate the effectiveness of the proposed approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Barutcuoglu, Z., Schapire, R.E., Troyanskaya, O.G.: Hierarchical multi-label prediction of gene function. Bioinformatics 22(7), 830 (2006)

    Article  Google Scholar 

  2. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  3. Boutell, M.R., Luo, J., Shen, X., Brown, C.M.: Learning multi-label scene classification. Pattern Recogn. 37(9), 1757–1771 (2004)

    Article  Google Scholar 

  4. Brinker, K.: Multilabel classification via calibrated label ranking. Mach. Learn. 73(2), 133–153 (2008)

    Article  Google Scholar 

  5. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)

    Article  Google Scholar 

  6. Kim, D., Kim, S., Oh, A.: Dirichlet process with mixed random measures: a nonparametric topic model for labeled data. Computer Science pp. 727–734 (2012)

    Google Scholar 

  7. Li, X., Ouyang, J., Zhou, X.: Supervised topic models for multi-label classification. Neurocomputing 149(PB), 811–819 (2015)

    Article  Google Scholar 

  8. Li, X., Ouyang, J., Zhou, X.: Labelset topic model for multi-label document classification. J. Intell. Inf. Syst. 46(1), 83–97 (2016)

    Article  Google Scholar 

  9. Pillai, I., Fumera, G., Roli, F.: Designing multi-label classifiers that maximize F measures: state of the art. Pattern Recogn. 61, 394–404 (2017)

    Article  Google Scholar 

  10. Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: Conference on Empirical Methods in Natural Language Processing, vol.1, pp. 248–256 (2009)

    Google Scholar 

  11. Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Mach. Learn. 85(3), 333 (2011)

    Article  MathSciNet  Google Scholar 

  12. Rubin, T.N., Chambers, A., Smyth, P., Steyvers, M.: Statistical topic models for multi-label document classification. Mach. Learn. 88(1–2), 157–208 (2012)

    Article  MathSciNet  Google Scholar 

  13. Schapire, R.E., Singer, Y.: A boosting-based systemfor text categorization. Mach. Learn. 39(2–3), 135–168 (2000)

    Article  Google Scholar 

  14. Sun, F., Tang, J., Li, H., Qi, G.J., Huang, T.S.: Multi-label image categorization with sparse factor representation. IEEE Trans. Image Process. 23(3), 1028–1037 (2014)

    Article  MathSciNet  Google Scholar 

  15. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining multi-label data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 667–685. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-09823-4_34

    Chapter  Google Scholar 

  16. Tsoumakas, G., Katakis, I., Vlahavas, I.: Random k-labelsets for multilabel classification. IEEE Trans. Knowl. Data Eng. 23(7), 1079–1089 (2011)

    Article  Google Scholar 

  17. Tsoumakas, G., Spyromitros-Xioufis, E., Vilcek, J., Vlahavas, I.: Mulan: a java library for multi-label learning. J. Mach. Learn. Res. 12(7), 2411–2414 (2011)

    MathSciNet  MATH  Google Scholar 

  18. Wang, X., Sukthankar, G.: Multi-label relational neighbor classification using social context features. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 464–472 (2013)

    Google Scholar 

  19. Wang, X., Li, G.Z.: Multilabel learning via random label selection for protein subcellular multilocations prediction. IEEE/ACM Trans. Comput. Biol. Bioinf. 10(2), 436–446 (2013)

    Article  Google Scholar 

  20. Wieczorkowska, A., Synak, P., Ra, Z.W.: Multi-label classification of emotions in music. In: Kłopotek, M.A., Wierzchoń, S.T., Trojanowski, K. (eds.) Intelligent Information Processing and Web Mining. Advances in Soft Computing, pp. 307–315. Springer, Berlin (2006). https://doi.org/10.1007/3-540-33521-8_30

    Chapter  Google Scholar 

  21. Wu, B., Zhong, E., Horner, A., Yang, Q.: Music emotion recognition by multi-label multi-layer multi-instance multi-view learning. In: ACM International Conference on Multimedia, pp. 117–126 (2014)

    Google Scholar 

  22. Wu, X.Z., Zhou, Z.H.: A unified view of multi-label performance measures (2016)

    Google Scholar 

  23. Zhang, M.L., Zhang, K.: Multi-label learning by exploiting label dependency. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 999–1008 (2010)

    Google Scholar 

  24. Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Pattern Recogn. 40(7), 2038–2048 (2007)

    Article  Google Scholar 

  25. Zhang, M.L., Zhou, Z.H.: A review on multi-label learning algorithms. IEEE Trans. Knowl. Data Eng. 26(8), 1819–1837 (2014)

    Article  Google Scholar 

Download references

Acknowledgements

This paper is supported by the National Key Research and Development Program of China (Grant No. 2016YFB1001102), the National Natural Science Foundation of China (Grant Nos. 61502227, 61375069), the Collaborative Innovation Center of Novel Software Technology and Industrialization at Nanjing University and the Fundamental Research Funds for the Central Universities (Grant Nos. 020214380036, 020214380038).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chongjun Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, G., Peng, Y., Wang, C. (2018). Multi-label Classification via Label-Topic Pairs. In: Cai, Y., Ishikawa, Y., Xu, J. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 10987. Springer, Cham. https://doi.org/10.1007/978-3-319-96890-2_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-96890-2_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-96889-6

  • Online ISBN: 978-3-319-96890-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics