Abstract
In this paper we extend kernel functions defined on generative models to embed phylogenetic information into a discriminative learning approach. We describe three generative tree kernels, a Fisher kernel, a sufficient statistics kernel and a probability product kernel, whose key features are the adaptivity to the input domain and the ability to deal with structured data. In particular, kernel adaptivity is obtained through the estimation of a tree structured model of evolution starting from the phylogenetic profiles encoding the presence or absence of specific proteins in a set of fully sequenced genomes. We report preliminary results obtained by these kernels in the prediction of the functional class of the proteins of S. Cervisae, together with comparisons to a standard vector based kernel and to a non-adaptive tree kernel function.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Liberales, D.A., Thoren, A., von Heijne, G., Eloffson, A.: The use of phylogenetic profiles for gene function prediction. Current Genomics 3, 131–137 (2002)
Vert, J.P.: A tree kernel to analyze phylogenetic profiles. Bioinformatics 18, S276–S284 (2002)
Pavlidis, P., Weston, J., Cai, J., Grundy, N.W.: Gene functional classification from heterogeneous data. In: Proceedings of the Fifth International Conference on Computational Molecular Biology, 242–248 (2001)
Baldi, P., Brunak, S.: Bioinformatics: the Machine Learning Approach, 2nd edn. MIT Press, Cambridge, MA (2001)
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, New York (2004)
Jaakkola, T., Haussler, D.: Exploiting generative models in discriminative classifiers. In: Kearns, M.S., Solla, S.A., Cohn, D.A. (eds.) Advances in neural information processing systems, vol. 11, pp. 487–493. MIT Press, Cambridge (1999)
Jebara, T., Kondor, R., Howard, A.: Probability product kernels. Journal of Machine Learning Research 5, 819–844 (2004)
Nicotra, L., Micheli, A., Starita, A.: Fisher kernel for Tree Structured Data. In: Proceedings of the IEEE International Joint Conference of Neural Networks, pp. 1917–1922. IEEE, New York (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nicotra, L., Micheli, A., Starita, A. (2007). Generative Kernels for Gene Function Prediction Through Probabilistic Tree Models of Evolution. In: Masulli, F., Mitra, S., Pasi, G. (eds) Applications of Fuzzy Sets Theory. WILF 2007. Lecture Notes in Computer Science(), vol 4578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73400-0_65
Download citation
DOI: https://doi.org/10.1007/978-3-540-73400-0_65
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73399-7
Online ISBN: 978-3-540-73400-0
eBook Packages: Computer ScienceComputer Science (R0)