Abstract
Multi-label classification refers to the problem of associating an object with multiple labels. This problem has been successfully addressed from the perspective of problem transformation and adaptation of algorithms. Multi-Label k-Nearest Neighbour (MLkNN) is a lazy learner that has reported excellent results, still there is room for improvements. In this paper we propose a modification to the MLkNN algorithm for the solution to problems of multi-label classification based on the Extended Rough Set Theory. More explicitly, the key modifications are focused in obtaining the relevance of the attributes when computing the distance between two instances, which are obtained using a heuristic search method and a target function based on the quality of the similarity. Experimental results using synthetic datasets have shown promising prediction rates. It is worth mentioning the ability of our proposal to deal with inconsistent scenarios, a main shortcoming present in most state-of-the-art multi-label classification algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bi, W., Kwok, J.T.: Multi-label classification on tree-and dag-structured hierarchies. In: Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp. 17–24 (2011)
Briggs, F., et al.: Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. J. Acoust. Soc. Am. 131(6), 4640–4650 (2012)
Cakir, E., Heittola, T., Huttunen, H., Virtanen, T.: Polyphonic sound event detection using multi label deep neural networks. In: 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE (2015)
Charte, F., Rivera, A.J., del Jesus, M.J., Herrera, F.: Quinta: a question tagging assistant to improve the answering ratio in electronic forums. In: EUROCON 2015-International Conference on Computer as a Tool (EUROCON), IEEE, pp. 1–6. IEEE (2015)
Chin, K.S., Liang, J., Dang, C.: Rough set data analysis algorithms for incomplete information systems. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 264–268. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-39205-X_35
Chou, K.C., Wu, Z.C., Xiao, X.: iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins. PLoS ONE 6(3), e18258 (2011)
Clerc, M., Kennedy, J.: The particle swarm-explosion, stability, and convergence in a multidimensional complex space. IEEE Trans. Evol. Comput. 6(1), 58–73 (2002)
Coello, L., Frías, M., Fernández, Y., Filiberto, Y., Bello, R., Caballero, Y.: Construcción de relaciones de similaridad borrosa basada en la medida calidad de la similaridad. Investig. Oper. 38(2), 132–140 (2018)
Deza, M.M., Deza, E.: Encyclopedia of Distances, pp. 1–583. Springer, Berlin (2009). https://doi.org/10.1007/978-3-642-00234-2
Filiberto, Y.: Métodos de aprendiza je para dominios con datos mezclados basados en la teoría de los conjuntos aproximados extendida. Universidad Central de Las Villas (2012)
Gibaja, E., Ventura, S.: Multi-label learning: a review of the state of the art and ongoing research. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 4(6), 411–444 (2014)
Kennedy, J.: Particle swarm optimization. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 760–766. Springer, Heidelberg (2011). https://doi.org/10.1007/978-0-387-30164-8
Madjarov, G., Kocev, D., Gjorgjevikj, D., Džeroski, S.: An extensive experimental comparison of methods for multi-label learning. Pattern Recognit. 45(9), 3084–3104 (2012)
Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.): ECML PKDD 2014. LNCS (LNAI), vol. 8725. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44851-9
Pawlak, Z., Skowron, A.: Rough sets: some extensions. Inf. Sci. 177(1), 28–40 (2007)
Ruiz, R., Aguilar–Ruiz, Jesús S., Riquelme, José C., Díaz–Díaz, N.: Analysis of Feature Rankings for Classification. In: Famili, A.Fazel, Kok, Joost N., Peña, José M., Siebes, A., Feelders, A. (eds.) IDA 2005. LNCS, vol. 3646, pp. 362–372. Springer, Heidelberg (2005). https://doi.org/10.1007/11552253_33
Shao, H., Li, G., Liu, G., Wang, Y.: Symptom selection for multi-label data of inquiry diagnosis in traditional chinese medicine. Sci. China Inf. Sci. 56(5), 1–13 (2013)
Slowinski, R., Vanderpooten, D.: A generalized definition of rough approximations based on similarity. IEEE Trans. Knowl. Data Eng. 12(2), 331–336 (2000)
Spyromitros, E., Tsoumakas, G., Vlahavas, I.: An Empirical Study of Lazy Multilabel Classification Algorithms. In: Darzentas, J., Vouros, George A., Vosinakis, S., Arnellos, A. (eds.) SETN 2008. LNCS (LNAI), vol. 5138, pp. 401–406. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87881-0_40
Tsoumakas, G., Xioufis, E., Vilcek, J., Vlahavas, I.: Mulan multi-label dataset repository (2014). http://mulan.sourceforge.net/datasets.html
Wang, J., Yang, Y., Mao, J., Huang, Z., Huang, C., Xu, W.: CNN-RNN: a unified framework for multi-label image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2285–2294 (2016)
Wilcoxon, F.: Individual comparisons by ranking methods. In: Kotz, S., Johnson, N.L. (eds.) Breakthroughs in Statistics, pp. 196–202. Springer, New York (1992). https://doi.org/10.1007/978-1-4612-4380-9_16
Wilson, D.R., Martinez, T.R.: Improved heterogeneous distance functions. J. Artif. Intell. Res. 6, 1–34 (1997)
Yao, Y.Y.: On generalizing rough set theory. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 44–51. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-39205-X_6
Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. J. Mach. Learn. Res. 5, 1205–1224 (2004)
Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Pattern Recognit. 40(7), 2038–2048 (2007)
Zhang, M.L., Zhou, Z.H.: A review on multi-label learning algorithms. IEEE Trans. Knowl. Data Eng. 26(8), 1819–1837 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Pérez, G., Bello, M., Nápoles, G., García, M.M., Bello, R., Vanhoof, K. (2018). Expanding MLkNN Using Extended Rough Set Theory. In: Hernández Heredia, Y., Milián Núñez, V., Ruiz Shulcloper, J. (eds) Progress in Artificial Intelligence and Pattern Recognition. IWAIPR 2018. Lecture Notes in Computer Science(), vol 11047. Springer, Cham. https://doi.org/10.1007/978-3-030-01132-1_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-01132-1_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01131-4
Online ISBN: 978-3-030-01132-1
eBook Packages: Computer ScienceComputer Science (R0)