Abstract
Multi-label learning has been becoming an increasingly active area into the machine learning community due to a wide variety of real world problems. However, only over the past few years class balancing for these kind of problems became a topic of interest. In this paper, we present a novel method named hyperparameter calibration to treat class imbalance in a multi-label problem, to this aim we develop an extensive analysis over four real-world databases and two own synthetic databases exhibiting different ratios of imbalance. The empirical analysis shows that the proposed method is able to improve the classification performance when it is combined with three of the most widely used strategies for treating multi-label classification problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Train and test sets for emotions, scene, yeast and cal500 databases were obtained from http://simidat.ujaen.es/~research/MLSMOTE/index.html#datasets.
- 2.
References
Tsoumakas, G., Katakis, I.: Multi-label classification: an overview. In. J. Data Warehous. Min. 3(3), 1–3 (2006)
Fürnkranz, J., Hüllermeier, E., Mencía, E.L., Brinker, K.: Multilabel classification via calibrated label ranking. Mach. Learn. 73(2), 133–153 (2008)
Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Mach. Learn. 85(3), 333 (2011)
Giraldo-Forero, A., Jaramillo-Garzón, J., Castellanos-Dominguez, C.: A comparison of multi-label techniques based on problem transformation for protein functional prediction. In: Proceedings of the 35th Annual International Conference of the EMBS, pp. 2688–2691. IEEE (2013)
Koziarski, M., Wożniak, M.: CCR: A combined cleaning and resampling algorithm for imbalanced data classification. Int. J. Appl. Math. Comput. Sci. 27(4), 727–736 (2017)
Batuwita, R., Palade, V.: Class Imbalance Learning Methods for Support Vector Machines (2013)
Charte, F., Rivera, A.J., del Jesus, M.J., Herrera, F.: MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation. Knowl-Based Syst. 89, 385–397 (2015)
Tahir, M.A., Kittler, J., Yan, F.: Inverse random under sampling for class imbalance problem and its application to multi-label classification. Pattern Recognit. 45(10), 3738–3750 (2012)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), (2011). Article No. 27
Tomás, J.T., Spolaôr, N., Cherman, E.A., Monard, M.C.: A framework to generate synthetic multi-label datasets. Electron. Notes Theor. Comput. Sci. 302, 155–176 (2014)
Charte, F., Rivera, A.J., del Jesus, M.J., Herrera, F.: Addressing imbalance in multilabel classification: measures and random resampling algorithms. Neurocomputing 163, 3–16 (2015)
Wieczorkowska, A., Synak, P., Raś, Z.W.: Multi-label classification of emotions in music. In: Intelligent Information Processing and Web Mining, pp. 307–315. Springer, Heidelberg (2006)
Boutell, M.R., Luo, J., Shen, X., Brown, C.M.: Learning multi-label scene classification. Pattern Recognit. 37(9), 1757–1771 (2004)
Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. In: Advances in Neural Information Processing Systems, pp. 681–687 (2002)
Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. Audio, Speech, Lang. Process. 16(2), 467–476 (2008)
Meyer, D., Wien, F.T.: Support vector machines. R News 1(3), 23–26 (2001)
Hsu, C.W., Chang, C.C., Lin, C.J., et al.: A Practical Guide to Support Vector Classification (2010)
Giraldo-Forero, A.F., Jaramillo-Garzón, J.A., Castellanos-Domínguez, C.G.: Evaluation of example-based measures for multi-label classification performance. In: Ortuño, F., Rojas, I. (eds.) IWBBIO 2015. LNCS, vol. 9043, pp. 557–564. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16483-0_54
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Giraldo-Forero, A.F., Cardona-Escobar, A.F., Castro-Ospina, A.E. (2018). Multi-label Learning by Hyperparameters Calibration for Treating Class Imbalance. In: de Cos Juez, F., et al. Hybrid Artificial Intelligent Systems. HAIS 2018. Lecture Notes in Computer Science(), vol 10870. Springer, Cham. https://doi.org/10.1007/978-3-319-92639-1_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-92639-1_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92638-4
Online ISBN: 978-3-319-92639-1
eBook Packages: Computer ScienceComputer Science (R0)