Skip to main content

Learning Large Margin First Order Decision Lists for Multi-Class Classification

  • Conference paper
Discovery Science (DS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5808))

Included in the following conference series:

Abstract

Inductive Logic Programming (ILP) systems have been successfully applied to solve binary classification problems. It remains an open question how an accurate solution to a multi-class problem can be obtained by using a logic based learning method. In this paper we present a novel logic based approach to solve challenging multi-class classification problems. Our technique is based on the use of large margin methods in conjunction with the kernels constructed from first order rules induced by an ILP system. The proposed approach learns a multi-class classifier by using a divide and conquer reduction strategy that splits multi-classes into binary groups and solves each individual problem recursively hence generating an underlying decision list structure. We also study the well known one-vs-all scheme in conjunction with logic-based kernel learning. In order to construct a highly informative logical and relational space we introduce a low dimensional embedding method. The technique is amenable to skewed/non-skewed class distribution where multi-class problems such as protein fold recognition are generally characterized by highly uneven class distribution. We performed a series of experiments to evaluate the proposed rule selection and multi-class schemes. The methods were applied to solve challenging problems in computation biology and bioinformatics, namely multi-class protein fold recognition and mutagenicity detection. Experimental comparisons of the performance of large margin first order decision list based multi-class scheme with the standard multi-class ILP algorithm and multi-class Support Vector Machine yielded statistically significant results. The results also demonstrated a favorable comparison between the performances of decision list based scheme and one-vs-all strategy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Muggleton, S., Lodhi, H., Amini, A., Sternberg, M.J.E.: Support Vector Inductive Logic Programming. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS (LNAI), vol. 3735, pp. 163–175. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  2. Landwehr, N., Passerini, A., Raedt, L., Frasconi, P.: kFOIL: Learning simple relational kernels. In: Proceedings of the National Conference on Artificial Intelligence (AAAI), vol. 21, pp. 389–394 (2006)

    Google Scholar 

  3. Ruckert, U., Kramer, S.: Margin-base first-order rule learning. Machine Learning 70(2-3), 189–206 (2008)

    Article  Google Scholar 

  4. Ding, C.H., Dubchak, I.: Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics 17, 349–358 (2001)

    Article  Google Scholar 

  5. Allwein, E.L., Schapire, R.E., Singer, Y.: Reducing multiclass to binary: a unifying approach for margin classifiers. Journal of Machine Learning Research 1, 113–141 (2000)

    MathSciNet  MATH  Google Scholar 

  6. Platt, J.C., Cristianini, N., Shawe-Taylor, J.: Large margin dags for multiclass classification. In: Advances in Neural Information Processing Systems, vol. 12, pp. 457–553. MIT Press, Cambridge (2000)

    Google Scholar 

  7. Rifkin, R., Klautau, A.: In defense of one-vs-all classification. Journal of Machine Learning Research 5, 101–141 (2004)

    MathSciNet  MATH  Google Scholar 

  8. Krebel, U.: Pairwise classification and support vector machines. In: Advances in Kernel Methods: Support Vector Learning, pp. 255–268. MIT Press, Cambridge (1999)

    Google Scholar 

  9. Riverst, R.: Learning decision list. Machine Learning 2(3), 229–246 (1987)

    Google Scholar 

  10. Mooney, R.J., Califf, M.E.: Induction of first-order decision lits: results on learning the past tense of english verbs. Journal of Artificial Intelligence Research 3, 1–24 (1995)

    Google Scholar 

  11. Quinlan, J.: Learning logical definitions from relations. Machine Learning 5(3), 239–266 (1990)

    Google Scholar 

  12. Laer, W.V., de Raedt, L., Dzeroski, S.: On multi-class problems and discretization in Iductive Logic Programming. In: Proceedings of the 10th International Symposium on Foundations of Intelligent Systems, pp. 277–286 (1997)

    Google Scholar 

  13. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)

    Book  MATH  Google Scholar 

  14. Muggleton, S.: Inverse entailment and progol. New Generation Computing 13, 245–286 (1995)

    Article  Google Scholar 

  15. Joachims, T.: Making large–scale SVM learning practical. In: Schölkopf, B., Burges, C.J.C., Smola, A.J. (eds.) Advances in Kernel Methods — Support Vector Learning, pp. 169–184. MIT Press, Cambridge (1999)

    Google Scholar 

  16. Debnath, A.K., de Compadre, R.L.L., Debnath, G., Schusterman, A.J., Hansch, C.: Structure-activity relationship of mutagenic aromatic and heteroaromatics nito compounds. correlation with molecular orbital energies and hydrophobicity. Journal of Medicinal Chemistry 34(2), 786–797 (1991)

    Article  Google Scholar 

  17. Lodhi, H., Muggleton, S.: Is mutagenesis still challenging? In: International Conference on Inductive Logic Programming (ILP - Late-Breaking Papers), pp. 35–40 (2005)

    Google Scholar 

  18. Shamim, M., Anwaruddin, M., Nagarajaram, H.A.N.J.: Support Vector Machine based classification of protein folds using the structural properties of amino acid residue pairs. Bioinformatocs (2006)

    Google Scholar 

  19. Chen, K., Kurgan, L.: PFRES: Protein fold classification by using evolutionary information and predicted secondary structure. Bioinformatics 23, 2843–2850 (2007)

    Article  Google Scholar 

  20. Shen, H.B., Chou, C.K.: Ensemble classifier for protein fold recognition. Bioinformatics 22, 1717–1722 (2006)

    Article  Google Scholar 

  21. Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C.: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536–540 (1995)

    Google Scholar 

  22. Turcotte, M., Muggleton, S., Sternberg, J.E.: Automated discovery of structural signatures of protein fold and function. J. Mol. Biol. 306, 591–605 (2001)

    Article  Google Scholar 

  23. Crammer, K., Singer, Y.: On the algorithmic implementation of multi-class svms. In: JMLR (2001)

    Google Scholar 

  24. Cootes, A.P., Muggleton, S., Sternberg, M.J.: The automatic discovery of structural principles describing protein fold space. Journal of Molecular Biology 330(4), 839–850 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lodhi, H., Muggleton, S., Sternberg, M.J.E. (2009). Learning Large Margin First Order Decision Lists for Multi-Class Classification. In: Gama, J., Costa, V.S., Jorge, A.M., Brazdil, P.B. (eds) Discovery Science. DS 2009. Lecture Notes in Computer Science(), vol 5808. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04747-3_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04747-3_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04746-6

  • Online ISBN: 978-3-642-04747-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics