Abstract
In this paper, we introduce a Gene Expression Programming algorithm for multi label classification. This algorithm encodes each individual into a discriminant function that shows whether a pattern belongs to a given class or not. The algorithm also applies a niching technique to guarantee that the population includes functions for each existing class. In order to evaluate the quality of our algorithm, its performance is compared to that of four recently published algorithms. The results show that our proposal is the best in terms of accuracy, precision and recall.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Boutell, M.R., Luo, J., Shen, X., Brown, C.M.: Learning multi-label scene classification. Pattern Recognition 37(9), 1757–1771 (2004)
Li, T., Zhang, C., Zhu, S.: Empirical studies on multi-label classification. In: ICTAI 2006: Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence, Washington, DC, USA, pp. 86–92. IEEE Computer Society, Los Alamitos (2006)
Li, T., Ogihara, M.: Detecting emotion in music. In: Proceedings of the 14th intern. conference on music information retrieval (ISMIR 2003), Baltimore, USA (2003)
Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. Advances in Neural Information Processing Systems 14, 681–687 (2001)
Rak, R., Kurgan, L.A., Reformat, M.: Multilabel associative classification categorization of medline articles into mesh keywords. IEEE Engineering in Medicine and Biology Magazine 26(2), 47–55 (2007)
Tsoumakas, G., Katakis, I.: Multi label classification: An overview. International Journal of Data Warehousing and Mining 3(3), 1–13 (2007)
Tsoumakas, G., Vlahavas, I.: Random k-labelsets: An ensemble method for multilabel classification. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS, vol. 4701, pp. 406–417. Springer, Heidelberg (2007)
Clare, A., King, R.D.: Knowledge discovery in multi-label phenotype data. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS, vol. 2168, p. 42. Springer, Heidelberg (2001)
Zhang, M.L., Zhou, Z.H.: A k-nearest neighbor based algorithm for multi-label classification., vol. 2, pp. 718–721. The IEEE Computational Intelligence Society, Los Alamitos (2005)
Noh, H.G., Song, M.S., Park, S.H.: An unbiased method for constructing multilabel classification trees. Computational Statistics & Data Analysis 47(1), 149–164 (2004)
Ghamrawi, N., Mccallum, A.: Collective multi-label classification. In: CIKM 2005: Proceedings of the 14th ACM international conference on Information and knowledge management, New York, USA, pp. 195–200. ACM Press, New York (2005)
Crammer, K., Singer, Y.: A family of additive online algorithms for category ranking. The Journal of Machine Learning Research 3, 1025–1058 (2003)
Zhang, M.L., Zhou, X.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Transactions on Knowledge and Data Engineering 18(10), 1338–1351 (2006)
Rak, R., Kurgan, L., Reformat, M.: A tree-projection-based algorithm for multi-label recurrent-item associative-classification rule generation. Data & Knowledge Engineering 64(1), 171–197 (2008)
Schapire, R.E., Singer, Y.: Boostexter: A boosting-based system for text categorization. Machine Learning 39(2/3), 135–168 (2000)
Johnson, M., Cipolla, R.: Improved image annotation and labelling through multi label boosting. In: Proceedings of the British Machine Vision Conference (16th BMVC), British Machine Vision Association (BMVA), Oxford, U.K (2005)
Ferreira, C.: Gene expression programming:a new adaptative algorithm for solving problems. Complex Systems 13(2), 87–129 (2001)
Zhou, C., Xiao, W., Tirpak, T.M., Nelson, P.C.: Evolving accurate and compact classification rules with gene expression programming. IEEE Transactions on Evolutionary Computation 7(6), 519–531 (2003)
Han, J., Kamber, M.: Data Mining: Methods and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2006)
Wong, M.L., Leung, K.S.: Data Mining Using Grammar-Based Genetic Programming and Applications. Genetic Programming Series. Kluwer Academic Publishers, Dordrecht (2002)
Ventura, S., Romero, C., Zafra, A., Delgado, J.A., Hervás, C.: JCLEC: A Java framework for evolutionary computation. Soft Computing 12(4), 381–392 (2008)
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ávila, J.L., Gibaja, E.L., Ventura, S. (2009). Multi-label Classification with Gene Expression Programming. In: Corchado, E., Wu, X., Oja, E., Herrero, Á., Baruque, B. (eds) Hybrid Artificial Intelligence Systems. HAIS 2009. Lecture Notes in Computer Science(), vol 5572. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02319-4_76
Download citation
DOI: https://doi.org/10.1007/978-3-642-02319-4_76
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02318-7
Online ISBN: 978-3-642-02319-4
eBook Packages: Computer ScienceComputer Science (R0)