Skip to main content

Expanding MLkNN Using Extended Rough Set Theory

  • Conference paper
  • First Online:
Progress in Artificial Intelligence and Pattern Recognition (IWAIPR 2018)

Abstract

Multi-label classification refers to the problem of associating an object with multiple labels. This problem has been successfully addressed from the perspective of problem transformation and adaptation of algorithms. Multi-Label k-Nearest Neighbour (MLkNN) is a lazy learner that has reported excellent results, still there is room for improvements. In this paper we propose a modification to the MLkNN algorithm for the solution to problems of multi-label classification based on the Extended Rough Set Theory. More explicitly, the key modifications are focused in obtaining the relevance of the attributes when computing the distance between two instances, which are obtained using a heuristic search method and a target function based on the quality of the similarity. Experimental results using synthetic datasets have shown promising prediction rates. It is worth mentioning the ability of our proposal to deal with inconsistent scenarios, a main shortcoming present in most state-of-the-art multi-label classification algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bi, W., Kwok, J.T.: Multi-label classification on tree-and dag-structured hierarchies. In: Proceedings of the 28th International Conference on Machine Learning (ICML-11), pp. 17–24 (2011)

    Google Scholar 

  2. Briggs, F., et al.: Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. J. Acoust. Soc. Am. 131(6), 4640–4650 (2012)

    Article  Google Scholar 

  3. Cakir, E., Heittola, T., Huttunen, H., Virtanen, T.: Polyphonic sound event detection using multi label deep neural networks. In: 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE (2015)

    Google Scholar 

  4. Charte, F., Rivera, A.J., del Jesus, M.J., Herrera, F.: Quinta: a question tagging assistant to improve the answering ratio in electronic forums. In: EUROCON 2015-International Conference on Computer as a Tool (EUROCON), IEEE, pp. 1–6. IEEE (2015)

    Google Scholar 

  5. Chin, K.S., Liang, J., Dang, C.: Rough set data analysis algorithms for incomplete information systems. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 264–268. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-39205-X_35

    Chapter  Google Scholar 

  6. Chou, K.C., Wu, Z.C., Xiao, X.: iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins. PLoS ONE 6(3), e18258 (2011)

    Article  Google Scholar 

  7. Clerc, M., Kennedy, J.: The particle swarm-explosion, stability, and convergence in a multidimensional complex space. IEEE Trans. Evol. Comput. 6(1), 58–73 (2002)

    Article  Google Scholar 

  8. Coello, L., Frías, M., Fernández, Y., Filiberto, Y., Bello, R., Caballero, Y.: Construcción de relaciones de similaridad borrosa basada en la medida calidad de la similaridad. Investig. Oper. 38(2), 132–140 (2018)

    Google Scholar 

  9. Deza, M.M., Deza, E.: Encyclopedia of Distances, pp. 1–583. Springer, Berlin (2009). https://doi.org/10.1007/978-3-642-00234-2

    Book  MATH  Google Scholar 

  10. Filiberto, Y.: Métodos de aprendiza je para dominios con datos mezclados basados en la teoría de los conjuntos aproximados extendida. Universidad Central de Las Villas (2012)

    Google Scholar 

  11. Gibaja, E., Ventura, S.: Multi-label learning: a review of the state of the art and ongoing research. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 4(6), 411–444 (2014)

    Article  Google Scholar 

  12. Kennedy, J.: Particle swarm optimization. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 760–766. Springer, Heidelberg (2011). https://doi.org/10.1007/978-0-387-30164-8

    Chapter  Google Scholar 

  13. Madjarov, G., Kocev, D., Gjorgjevikj, D., Džeroski, S.: An extensive experimental comparison of methods for multi-label learning. Pattern Recognit. 45(9), 3084–3104 (2012)

    Article  Google Scholar 

  14. Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.): ECML PKDD 2014. LNCS (LNAI), vol. 8725. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-44851-9

    Book  Google Scholar 

  15. Pawlak, Z., Skowron, A.: Rough sets: some extensions. Inf. Sci. 177(1), 28–40 (2007)

    Article  MathSciNet  Google Scholar 

  16. Ruiz, R., Aguilar–Ruiz, Jesús S., Riquelme, José C., Díaz–Díaz, N.: Analysis of Feature Rankings for Classification. In: Famili, A.Fazel, Kok, Joost N., Peña, José M., Siebes, A., Feelders, A. (eds.) IDA 2005. LNCS, vol. 3646, pp. 362–372. Springer, Heidelberg (2005). https://doi.org/10.1007/11552253_33

    Chapter  MATH  Google Scholar 

  17. Shao, H., Li, G., Liu, G., Wang, Y.: Symptom selection for multi-label data of inquiry diagnosis in traditional chinese medicine. Sci. China Inf. Sci. 56(5), 1–13 (2013)

    Article  MathSciNet  Google Scholar 

  18. Slowinski, R., Vanderpooten, D.: A generalized definition of rough approximations based on similarity. IEEE Trans. Knowl. Data Eng. 12(2), 331–336 (2000)

    Article  Google Scholar 

  19. Spyromitros, E., Tsoumakas, G., Vlahavas, I.: An Empirical Study of Lazy Multilabel Classification Algorithms. In: Darzentas, J., Vouros, George A., Vosinakis, S., Arnellos, A. (eds.) SETN 2008. LNCS (LNAI), vol. 5138, pp. 401–406. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87881-0_40

    Chapter  Google Scholar 

  20. Tsoumakas, G., Xioufis, E., Vilcek, J., Vlahavas, I.: Mulan multi-label dataset repository (2014). http://mulan.sourceforge.net/datasets.html

  21. Wang, J., Yang, Y., Mao, J., Huang, Z., Huang, C., Xu, W.: CNN-RNN: a unified framework for multi-label image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2285–2294 (2016)

    Google Scholar 

  22. Wilcoxon, F.: Individual comparisons by ranking methods. In: Kotz, S., Johnson, N.L. (eds.) Breakthroughs in Statistics, pp. 196–202. Springer, New York (1992). https://doi.org/10.1007/978-1-4612-4380-9_16

    Chapter  Google Scholar 

  23. Wilson, D.R., Martinez, T.R.: Improved heterogeneous distance functions. J. Artif. Intell. Res. 6, 1–34 (1997)

    Article  MathSciNet  Google Scholar 

  24. Yao, Y.Y.: On generalizing rough set theory. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 44–51. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-39205-X_6

    Chapter  Google Scholar 

  25. Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. J. Mach. Learn. Res. 5, 1205–1224 (2004)

    MathSciNet  MATH  Google Scholar 

  26. Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Pattern Recognit. 40(7), 2038–2048 (2007)

    Article  Google Scholar 

  27. Zhang, M.L., Zhou, Z.H.: A review on multi-label learning algorithms. IEEE Trans. Knowl. Data Eng. 26(8), 1819–1837 (2014)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marilyn Bello .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pérez, G., Bello, M., Nápoles, G., García, M.M., Bello, R., Vanhoof, K. (2018). Expanding MLkNN Using Extended Rough Set Theory. In: Hernández Heredia, Y., Milián Núñez, V., Ruiz Shulcloper, J. (eds) Progress in Artificial Intelligence and Pattern Recognition. IWAIPR 2018. Lecture Notes in Computer Science(), vol 11047. Springer, Cham. https://doi.org/10.1007/978-3-030-01132-1_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-01132-1_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-01131-4

  • Online ISBN: 978-3-030-01132-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics