Expanding MLkNN Using Extended Rough Set Theory

Pérez, Gabriela; Bello, Marilyn; Nápoles, Gonzalo; García, María Matilde; Bello, Rafael; Vanhoof, Koen

doi:10.1007/978-3-030-01132-1_28

Gabriela Pérez¹⁶,
Marilyn Bello^16,17,
Gonzalo Nápoles¹⁷,
María Matilde García¹⁶,
Rafael Bello¹⁶ &
…
Koen Vanhoof¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11047))

Included in the following conference series:

International Workshop on Artificial Intelligence and Pattern Recognition

1186 Accesses
1 Citations

Abstract

Multi-label classification refers to the problem of associating an object with multiple labels. This problem has been successfully addressed from the perspective of problem transformation and adaptation of algorithms. Multi-Label k-Nearest Neighbour (MLkNN) is a lazy learner that has reported excellent results, still there is room for improvements. In this paper we propose a modification to the MLkNN algorithm for the solution to problems of multi-label classification based on the Extended Rough Set Theory. More explicitly, the key modifications are focused in obtaining the relevance of the attributes when computing the distance between two instances, which are obtained using a heuristic search method and a target function based on the quality of the similarity. Experimental results using synthetic datasets have shown promising prediction rates. It is worth mentioning the ability of our proposal to deal with inconsistent scenarios, a main shortcoming present in most state-of-the-art multi-label classification algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bi, W., Kwok, J.T.: Multi-label classification on tree-and dag-structured hierarchies. In: Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp. 17–24 (2011)
Google Scholar
Briggs, F., et al.: Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. J. Acoust. Soc. Am. 131(6), 4640–4650 (2012)
Article Google Scholar
Cakir, E., Heittola, T., Huttunen, H., Virtanen, T.: Polyphonic sound event detection using multi label deep neural networks. In: 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE (2015)
Google Scholar
Charte, F., Rivera, A.J., del Jesus, M.J., Herrera, F.: Quinta: a question tagging assistant to improve the answering ratio in electronic forums. In: EUROCON 2015-International Conference on Computer as a Tool (EUROCON), IEEE, pp. 1–6. IEEE (2015)
Google Scholar
Chin, K.S., Liang, J., Dang, C.: Rough set data analysis algorithms for incomplete information systems. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 264–268. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-39205-X_35
Chapter Google Scholar
Chou, K.C., Wu, Z.C., Xiao, X.: iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins. PLoS ONE 6(3), e18258 (2011)
Article Google Scholar
Clerc, M., Kennedy, J.: The particle swarm-explosion, stability, and convergence in a multidimensional complex space. IEEE Trans. Evol. Comput. 6(1), 58–73 (2002)
Article Google Scholar
Coello, L., Frías, M., Fernández, Y., Filiberto, Y., Bello, R., Caballero, Y.: Construcción de relaciones de similaridad borrosa basada en la medida calidad de la similaridad. Investig. Oper. 38(2), 132–140 (2018)
Google Scholar
Deza, M.M., Deza, E.: Encyclopedia of Distances, pp. 1–583. Springer, Berlin (2009). https://doi.org/10.1007/978-3-642-00234-2
Book MATH Google Scholar
Filiberto, Y.: Métodos de aprendiza je para dominios con datos mezclados basados en la teoría de los conjuntos aproximados extendida. Universidad Central de Las Villas (2012)
Google Scholar
Gibaja, E., Ventura, S.: Multi-label learning: a review of the state of the art and ongoing research. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 4(6), 411–444 (2014)
Article Google Scholar
Kennedy, J.: Particle swarm optimization. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 760–766. Springer, Heidelberg (2011). https://doi.org/10.1007/978-0-387-30164-8
Chapter Google Scholar
Madjarov, G., Kocev, D., Gjorgjevikj, D., Džeroski, S.: An extensive experimental comparison of methods for multi-label learning. Pattern Recognit. 45(9), 3084–3104 (2012)
Article Google Scholar
Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.): ECML PKDD 2014. LNCS (LNAI), vol. 8725. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44851-9
Book Google Scholar
Pawlak, Z., Skowron, A.: Rough sets: some extensions. Inf. Sci. 177(1), 28–40 (2007)
Article MathSciNet Google Scholar
Ruiz, R., Aguilar–Ruiz, Jesús S., Riquelme, José C., Díaz–Díaz, N.: Analysis of Feature Rankings for Classification. In: Famili, A.Fazel, Kok, Joost N., Peña, José M., Siebes, A., Feelders, A. (eds.) IDA 2005. LNCS, vol. 3646, pp. 362–372. Springer, Heidelberg (2005). https://doi.org/10.1007/11552253_33
Chapter MATH Google Scholar
Shao, H., Li, G., Liu, G., Wang, Y.: Symptom selection for multi-label data of inquiry diagnosis in traditional chinese medicine. Sci. China Inf. Sci. 56(5), 1–13 (2013)
Article MathSciNet Google Scholar
Slowinski, R., Vanderpooten, D.: A generalized definition of rough approximations based on similarity. IEEE Trans. Knowl. Data Eng. 12(2), 331–336 (2000)
Article Google Scholar
Spyromitros, E., Tsoumakas, G., Vlahavas, I.: An Empirical Study of Lazy Multilabel Classification Algorithms. In: Darzentas, J., Vouros, George A., Vosinakis, S., Arnellos, A. (eds.) SETN 2008. LNCS (LNAI), vol. 5138, pp. 401–406. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87881-0_40
Chapter Google Scholar
Tsoumakas, G., Xioufis, E., Vilcek, J., Vlahavas, I.: Mulan multi-label dataset repository (2014). http://mulan.sourceforge.net/datasets.html
Wang, J., Yang, Y., Mao, J., Huang, Z., Huang, C., Xu, W.: CNN-RNN: a unified framework for multi-label image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2285–2294 (2016)
Google Scholar
Wilcoxon, F.: Individual comparisons by ranking methods. In: Kotz, S., Johnson, N.L. (eds.) Breakthroughs in Statistics, pp. 196–202. Springer, New York (1992). https://doi.org/10.1007/978-1-4612-4380-9_16
Chapter Google Scholar
Wilson, D.R., Martinez, T.R.: Improved heterogeneous distance functions. J. Artif. Intell. Res. 6, 1–34 (1997)
Article MathSciNet Google Scholar
Yao, Y.Y.: On generalizing rough set theory. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 44–51. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-39205-X_6
Chapter Google Scholar
Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. J. Mach. Learn. Res. 5, 1205–1224 (2004)
MathSciNet MATH Google Scholar
Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Pattern Recognit. 40(7), 2038–2048 (2007)
Article Google Scholar
Zhang, M.L., Zhou, Z.H.: A review on multi-label learning algorithms. IEEE Trans. Knowl. Data Eng. 26(8), 1819–1837 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Central University “Marta Abreu” of Las Villas, Santa Clara, Cuba
Gabriela Pérez, Marilyn Bello, María Matilde García & Rafael Bello
Faculty of Business Economics, Hasselt University, Hasselt, Belgium
Marilyn Bello, Gonzalo Nápoles & Koen Vanhoof

Authors

Gabriela Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Marilyn Bello
View author publications
You can also search for this author in PubMed Google Scholar
Gonzalo Nápoles
View author publications
You can also search for this author in PubMed Google Scholar
María Matilde García
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Bello
View author publications
You can also search for this author in PubMed Google Scholar
Koen Vanhoof
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marilyn Bello .

Editor information

Editors and Affiliations

Universidad de las Ciencias Informáticas, Havana, Cuba
Yanio Hernández Heredia
Universidad de las Ciencias Informáticas, Havana, Cuba
Vladimir Milián Núñez
Universidad de las Ciencias Informáticas, Havana, Cuba
José Ruiz Shulcloper

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pérez, G., Bello, M., Nápoles, G., García, M.M., Bello, R., Vanhoof, K. (2018). Expanding MLkNN Using Extended Rough Set Theory. In: Hernández Heredia, Y., Milián Núñez, V., Ruiz Shulcloper, J. (eds) Progress in Artificial Intelligence and Pattern Recognition. IWAIPR 2018. Lecture Notes in Computer Science(), vol 11047. Springer, Cham. https://doi.org/10.1007/978-3-030-01132-1_28

Download citation

DOI: https://doi.org/10.1007/978-3-030-01132-1_28
Published: 22 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01131-4
Online ISBN: 978-3-030-01132-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics