Abstract
In previous work, we devised an approach for multilabel classification based on an ensemble of Bayesian networks. It was characterized by an efficient structural learning and by high accuracy. Its shortcoming was the high computational complexity of the MAP inference, necessary to identify the most probable joint configuration of all classes. In this work, we switch from the ensemble approach to the single model approach. This allows important computational savings. The reduction of inference times is exponential in the difference between the treewidth of the single model and the number of classes. We adopt moreover a more sophisticated approach for the structural learning of the class subgraph. The proposed single models outperforms alternative approaches for multilabel classification such as binary relevance and ensemble of classifier chains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Antonucci, A., Corani, G., Mauá, D., Gabaglio, S.: An ensemble of Bayesian networks for multilabel classification. In: Rossi, F. (ed.) Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI 2013), pp. 1220–1225 (2013)
Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B (Methodological), 289–300 (1995)
Bielza, C., Li, G., Larrañaga, P.: Multi-dimensional classification with Bayesian networks. International Journal of Approximate Reasoning 52(6), 705–727 (2011)
Bolt, J.H., van der Gaag, L.C.: Multi-dimensional classification with naive Bayesian network classifiers. In: Uiterwijk, J., Roos, N., Winands, M. (eds.) BNAIC 2012 the 24th Benelux Conference on Artificial Intelligence, pp. 27–34 (2012)
Borchani, H., Bielza, C., Larrañaga, P.: Learning CB-decomposable multi-dimensional Bayesian network classifiers. In: Myllymaki, P., Roos, T., Jaakkola, T. (eds.) Proceedings of the 5th European Workshop on Probabilistic Graphical Models (PGM 2010), pp. 25–32 (2010)
de Campos, C., Ji, Q.: Efficient structure learning of Bayesian networks using constraints. Journal of Machine Learning Research 12, 663–689 (2011)
de Campos, C., Cuccu, M., Corani, G., Zaffalon, M.: The extended tree augmented naive classifier. In: Van Der Gaag, L., Feelders, A. (eds.) Proceedings PGM 2014 (2014)
Dembczynski, K., Waegeman, W., Hüllermeier, E.: An analysis of chaining in multi-label classification. In: De Raedt, L., Bessiere, C., Dubois, D., Doherty, P., Frasconi, P., Heintz, F., Lucas, P. (eds.) Proceedings of the 20th European Conference on Artificial Intelligence (ECAI), pp. 294–299 (2012)
Elkan, C.: Magical thinking in data mining: lessons from coil challenge 2000. In: Proc. KDD 2001: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 426–431 (2001)
Karpas, E., Solomon, E., Beimel, A.: Approximate belief updating in max-2-connected Bayes networks is NP-hard. Artificial Intelligence 173(12-13), 1150–1153 (2009)
Kohavi, R., John, G.: Wrappers for feature subset selection. Artificial Intelligence 97(1), 273–324 (1997)
Read, J., Pfahringer, B., Holmes, G., Frank, E.: Classifier chains for multi-label classification. Machine Learning 85(3), 333–359 (2011)
Read, J., Bielza, C., Larranaga, P.: Multi-dimensional classification with super-classes. IEEE Transactions on Knowledge and Data Engineering 26(7), 1720–1733 (2014)
Van Der Gaag, L., De Waal, P.: Multi-dimensional Bayesian network classifiers. In: Studeny, M., Vomlel, J. (eds.) Proceedings of the Third European Workshop on Probabilistic Graphical Models, pp. 107–114 (2006)
Witten, I., Frank, E., Hall, M.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Corani, G., Antonucci, A., Mauá, D.D., Gabaglio, S. (2014). Trading off Speed and Accuracy in Multilabel Classification. In: van der Gaag, L.C., Feelders, A.J. (eds) Probabilistic Graphical Models. PGM 2014. Lecture Notes in Computer Science(), vol 8754. Springer, Cham. https://doi.org/10.1007/978-3-319-11433-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-11433-0_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11432-3
Online ISBN: 978-3-319-11433-0
eBook Packages: Computer ScienceComputer Science (R0)