Skip to main content

A Two-Stage Dual Space Reduction Framework for Multi-label Classification

  • Conference paper
Trends and Applications in Knowledge Discovery and Data Mining (PAKDD 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7867))

Included in the following conference series:

Abstract

Multi-label classification has been increasingly recognized since it can classify objects into multiple classes, simultaneously. However, its effectiveness might be sacrificed due to high dimensionality problem in feature space and sparseness problem in label space. To address these issues, this paper proposes a Two-Stage Dual Space Reduction (2SDSR) framework that transforms both feature space and label space into the lower-dimensional spaces. In our framework, the label space is transformed into reduced label space and then supervised dimensionality reduction method is applied to find a small number of features that maximizing dependency between features and that reduced labels. Using these reduced features and labels, a set of classification models are built. In this framework, we employ two well-known feature reduction methods such as MDDM and CCA, and two widely used label reduction methods i.e., PLST and BMD. However, it is possible to apply various dimensionality reduction methods into the framework. By a set of experiments on five real world datasets, the results indicated that our proposed framework can improve the classification performance, compared to the traditional dimensionality reduction approaches which reduce feature space or label space only.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fan, R.E., Lin, C.J.: A Study on Threshold Selection for Multi-label Classification. National Taiwan University (2007)

    Google Scholar 

  2. Golub, G., Reinsch, C.: Singular value decomposition and least squares solutions. Numerische Mathematik 14, 403–420 (1970)

    Article  MathSciNet  MATH  Google Scholar 

  3. Gretton, A., Bousquet, O., Smola, A.J., Schölkopf, B.: Measuring statistical dependence with hilbert-schmidt norms. In: Jain, S., Simon, H.U., Tomita, E. (eds.) ALT 2005. LNCS (LNAI), vol. 3734, pp. 63–77. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  4. Hotelling, H.: The most predictable criterion. Journal of Educational Psychology 26, 139–142 (1935)

    Article  Google Scholar 

  5. Hsu, D., Kakade, S., Langford, J., Zhang, T.: Multi-label prediction via compressed sensing. Proceedings of the Advances in Neural Information Processing Systems 22, 772–780 (2009)

    Google Scholar 

  6. Miettinen, P.: The boolean column and column-row matrix decompositions. Data Mining and Knowledge Discovery 17(1), 39–56 (2008)

    Article  MathSciNet  Google Scholar 

  7. Pacharawongsakda, E., Theeramunkong, T.: Towards more efficient multi-label classification using dependent and independent dual space reduction. In: Tan, P.-N., Chawla, S., Ho, C.K., Bailey, J. (eds.) PAKDD 2012, Part II. LNCS, vol. 7302, pp. 383–394. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  8. Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine Learning, 1–27 (2011)

    Google Scholar 

  9. Schapire, R., Singer, Y.: Boostexter: A boosting-based system for text categorization. Machine Learning 39(2/3), 135–168 (2000)

    Article  MATH  Google Scholar 

  10. Tai, F., Lin, H.T.: Multi-label classification with principle label space transformation. In: Proceedings of the 2nd International Workshop on Learning from Multi-Label Data (MLD 2010), pp. 45–52 (2010)

    Google Scholar 

  11. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data, 2nd edn. Data Mining and Knowledge Discovery Handbook. Springer (2010)

    Google Scholar 

  12. Wicker, J., Pfahringer, B., Kramer, S.: Multi-label classification using boolean matrix decomposition. In: Proceedings of the 27th Annual ACM Symposium on Applied Computing, pp. 179–186. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  13. Yu, K., Yu, S., Tresp, V.: Multi-label informed latent semantic indexing. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 258–265 (2005)

    Google Scholar 

  14. Zhang, Y., Zhou, Z.H.: Multilabel dimensionality reduction via dependence maximization. ACM Transactions on Knowledge Discovery from Data (TKDD) 4(3), 1–21 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pacharawongsakda, E., Theeramunkong, T. (2013). A Two-Stage Dual Space Reduction Framework for Multi-label Classification. In: Li, J., et al. Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2013. Lecture Notes in Computer Science(), vol 7867. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40319-4_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40319-4_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40318-7

  • Online ISBN: 978-3-642-40319-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics