Abstract
Inductive Logic Programming (ILP) systems have been successfully applied to solve binary classification problems. It remains an open question how an accurate solution to a multi-class problem can be obtained by using a logic based learning method. In this paper we present a novel logic based approach to solve challenging multi-class classification problems. Our technique is based on the use of large margin methods in conjunction with the kernels constructed from first order rules induced by an ILP system. The proposed approach learns a multi-class classifier by using a divide and conquer reduction strategy that splits multi-classes into binary groups and solves each individual problem recursively hence generating an underlying decision list structure. We also study the well known one-vs-all scheme in conjunction with logic-based kernel learning. In order to construct a highly informative logical and relational space we introduce a low dimensional embedding method. The technique is amenable to skewed/non-skewed class distribution where multi-class problems such as protein fold recognition are generally characterized by highly uneven class distribution. We performed a series of experiments to evaluate the proposed rule selection and multi-class schemes. The methods were applied to solve challenging problems in computation biology and bioinformatics, namely multi-class protein fold recognition and mutagenicity detection. Experimental comparisons of the performance of large margin first order decision list based multi-class scheme with the standard multi-class ILP algorithm and multi-class Support Vector Machine yielded statistically significant results. The results also demonstrated a favorable comparison between the performances of decision list based scheme and one-vs-all strategy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Muggleton, S., Lodhi, H., Amini, A., Sternberg, M.J.E.: Support Vector Inductive Logic Programming. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS (LNAI), vol. 3735, pp. 163–175. Springer, Heidelberg (2005)
Landwehr, N., Passerini, A., Raedt, L., Frasconi, P.: kFOIL: Learning simple relational kernels. In: Proceedings of the National Conference on Artificial Intelligence (AAAI), vol. 21, pp. 389–394 (2006)
Ruckert, U., Kramer, S.: Margin-base first-order rule learning. Machine Learning 70(2-3), 189–206 (2008)
Ding, C.H., Dubchak, I.: Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics 17, 349–358 (2001)
Allwein, E.L., Schapire, R.E., Singer, Y.: Reducing multiclass to binary: a unifying approach for margin classifiers. Journal of Machine Learning Research 1, 113–141 (2000)
Platt, J.C., Cristianini, N., Shawe-Taylor, J.: Large margin dags for multiclass classification. In: Advances in Neural Information Processing Systems, vol. 12, pp. 457–553. MIT Press, Cambridge (2000)
Rifkin, R., Klautau, A.: In defense of one-vs-all classification. Journal of Machine Learning Research 5, 101–141 (2004)
Krebel, U.: Pairwise classification and support vector machines. In: Advances in Kernel Methods: Support Vector Learning, pp. 255–268. MIT Press, Cambridge (1999)
Riverst, R.: Learning decision list. Machine Learning 2(3), 229–246 (1987)
Mooney, R.J., Califf, M.E.: Induction of first-order decision lits: results on learning the past tense of english verbs. Journal of Artificial Intelligence Research 3, 1–24 (1995)
Quinlan, J.: Learning logical definitions from relations. Machine Learning 5(3), 239–266 (1990)
Laer, W.V., de Raedt, L., Dzeroski, S.: On multi-class problems and discretization in Iductive Logic Programming. In: Proceedings of the 10th International Symposium on Foundations of Intelligent Systems, pp. 277–286 (1997)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
Muggleton, S.: Inverse entailment and progol. New Generation Computing 13, 245–286 (1995)
Joachims, T.: Making large–scale SVM learning practical. In: Schölkopf, B., Burges, C.J.C., Smola, A.J. (eds.) Advances in Kernel Methods — Support Vector Learning, pp. 169–184. MIT Press, Cambridge (1999)
Debnath, A.K., de Compadre, R.L.L., Debnath, G., Schusterman, A.J., Hansch, C.: Structure-activity relationship of mutagenic aromatic and heteroaromatics nito compounds. correlation with molecular orbital energies and hydrophobicity. Journal of Medicinal Chemistry 34(2), 786–797 (1991)
Lodhi, H., Muggleton, S.: Is mutagenesis still challenging? In: International Conference on Inductive Logic Programming (ILP - Late-Breaking Papers), pp. 35–40 (2005)
Shamim, M., Anwaruddin, M., Nagarajaram, H.A.N.J.: Support Vector Machine based classification of protein folds using the structural properties of amino acid residue pairs. Bioinformatocs (2006)
Chen, K., Kurgan, L.: PFRES: Protein fold classification by using evolutionary information and predicted secondary structure. Bioinformatics 23, 2843–2850 (2007)
Shen, H.B., Chou, C.K.: Ensemble classifier for protein fold recognition. Bioinformatics 22, 1717–1722 (2006)
Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C.: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536–540 (1995)
Turcotte, M., Muggleton, S., Sternberg, J.E.: Automated discovery of structural signatures of protein fold and function. J. Mol. Biol. 306, 591–605 (2001)
Crammer, K., Singer, Y.: On the algorithmic implementation of multi-class svms. In: JMLR (2001)
Cootes, A.P., Muggleton, S., Sternberg, M.J.: The automatic discovery of structural principles describing protein fold space. Journal of Molecular Biology 330(4), 839–850 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lodhi, H., Muggleton, S., Sternberg, M.J.E. (2009). Learning Large Margin First Order Decision Lists for Multi-Class Classification. In: Gama, J., Costa, V.S., Jorge, A.M., Brazdil, P.B. (eds) Discovery Science. DS 2009. Lecture Notes in Computer Science(), vol 5808. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04747-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-04747-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04746-6
Online ISBN: 978-3-642-04747-3
eBook Packages: Computer ScienceComputer Science (R0)