Abstract
During the last few years, Kernel methods have gained considerable attention for analyzing biological data for protein function prediction. Based on biological processes annotation of Yeast and GO(gene ontology), we constructed a kernel matrix to predict protein functions. We used measurement method about semantic similarity on GO and adaptive Hausdorff distance to successfully obtain protein similarity matrix, and furthermore, transformed protein similarity matrix to a undirected graph. Then, We developed a novel method that can learn optimal diffusion kernel from graph by maximizing kernel-target alignment. Experimental results illustrate that the kernel matrix generated by our formula has larger AUC value than ordinary diffusion kernel and those proposed before. Our method can even learn a common optimal kernel matrix for multiple predict tasks at one run. Furthermore, it can also be directly used to learn from various biolobical networks.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Tsuda, K., Noble, W.S.: Learning Kernels from Biological Networks by Maximizing Entropy. Bioinformatics 20, 326–333 (2004)
Lanckriet, G., Deng, M., Cristianini, M., Jordan, M., Noble, W.: Kernel-based Data Fusion and Its Application to Protein Function Prediction in Yeast. In: Pac. Symp. Biocomput., vol. 9, pp. 300–311 (2004)
Cristianini, N., Shawe-Taylor, J.: Kernel Methods for Pattern Analysis. China Machine Press, Beijing (2005)
Ashburner, M., Ball, C., Blake, J., Botstein, D.: Gene Ontology: Tool for the Unification of Biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29 (2000)
Resnik, P.: Semantic Similarity in A Taxonomy: An Information-based Measure and Its Application to Problems of Ambiguity in Natural Language. J. Artif. Intell. Res. 11, 95–130 (1999)
Lin, D.: An Information-theoretic Definition of Similarity. In: The 15th International Conference on Machine learning, pp. 296–304. Morgan Kaufmann Publishers, San Francisco (1998)
Kondor, R.I., Lafferty, J.: Diffusion Kernels on Graphs and other Discrete Input Spaces. In: Proc. Int. Conf. Machine Learning, pp. 315–322. Morgan Kaufmann Publishers, San Francisco (2002)
Nello, C., John, A.E.: On Kernel-target Alignment. In: Advances in Neural Information Processing Systems 14, pp. 367–373. MIT Press, Cambridge (2002)
Lanckriet, G.R., Cristianini, N., Bartlett, P., Laurent, E., Michael, I.G., Jordan, M.I.: Learning the Kernel Matrix with Semide Nite Programming. Journal of Machine Learning Research 5, 27–72 (2004)
Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press, London (2004)
The Mathworks Incorporation: Matlab help, http://www.mathworks.cn/
Sonnenburg, S., Raetsch, G., Schaefer, C., Scholkopf, B.: Large scale multiple kernel learning. Journal of Machine Learning Research 7, 1531–1565 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Y., Li, Z., Liu, J. (2009). Learning Kernel Matrix from Gene Ontology and Annotation Data for Protein Function Prediction. In: Yu, W., He, H., Zhang, N. (eds) Advances in Neural Networks – ISNN 2009. ISNN 2009. Lecture Notes in Computer Science, vol 5553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01513-7_76
Download citation
DOI: https://doi.org/10.1007/978-3-642-01513-7_76
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01512-0
Online ISBN: 978-3-642-01513-7
eBook Packages: Computer ScienceComputer Science (R0)