Abstract
We propose here a multi-label semi-supervised learning algorithm, PfunBG, to predict protein functions, employing a bi-relational graph (BG) of proteins and function annotations. Different from most, if not all, existing methods that only consider the partially labeled protein-protein interaction (PPI) network, the BG comprises three components, a PPI network, a function class graph induced from function annotations of such proteins, and a bipartite graph induced from function assignments. By referring to proteins and function classes equally as vertices, we exploit network propagation to measure how closely a specific function class is related to a protein of interest. The experiments on a yeast PPI network illustrate its effectiveness and efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Breitkreutz, B.J., Stark, C., Reguly, T., et al.: The BioGRID Interaction Database: 2008 update. Nucleic Acids Res. 36(Database issue), D637–D640 (2008)
Can, T., Camoğlu, O., Singh, A.K.: Analysis of proteinprotein interaction networks using random walks. In Proc. 5th International Workshop on Bioinformatics, pp. 61–68 (2005)
Chen, G., Song, Y., Wang, F., Zhang, C.: Semi-supervised Multi-label Learning by Solving a Sylvester Equation. In: SIAM International Conference on Data Mining (2008)
Erten, S., Bebek, G., Koyutürk, M.: Disease Gene Prioritization Based on Topological Similarity in Protein-Protein Interaction Networks. In: Bafna, V., Sahinalp, S.C. (eds.) RECOMB 2011. LNCS (LNBI), vol. 6577, pp. 54–68. Springer, Heidelberg (2011)
Fan, R.-E., Lin, C.-J.: A Study on Threshold Selection for Multi-label Classification. Technical Report, National Taiwan University (2007)
Haveliwala, T.H.: Topic-sensitive pagerank. In: 11th International World Wide Web Conference (WWW), pp. 517–526 (2002)
Hishigaki, H., Nakai, K., Ono, T., Tanigami, A., Takagi, T.: Assessment of prediction accuracy of protein function from proteinCprotein interaction data. Yeast 18, 523–531 (2001)
Jiang, J.Q.: Multi-label Correlated Semi-supervised Learning for Protein Function Prediction. In: Chen, J., Wang, J., Zelikovsky, A. (eds.) ISBRA 2011. LNCS (LNBI), vol. 6674, pp. 368–379. Springer, Heidelberg (2011)
Karaoz, U., Murali, T.M., Letovsky, S., Zheng, Y., Ding, C., Cantor, C.R., Kasif, S.: Whole-genome annotation by using evidence integration in functional-linkage networks. Proc. Natl. Acad. Sci. USA 101, 2888–2893 (2004)
Nabieva, E., Jim, K., Agarwal, A., Chazelle, B., Singh, M.: Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps. Bioinformatics 21(suppl. 1), 302–310 (2005)
Pan, J.-Y., Yang, H.-J., Faloutsos, C., Duygulu, F.: Automatic Multimedia Cross-modal Correlation Discovery. In: The 10th ACM SIGKDD Conference, Seattle, WA, August 22-25 (2004)
Pavlidis, P., Weston, J., Cai, J., Grundy, W.N.: Gene functional classification from heterogeneous data. In: Proceedings of the Fifth Annual International Conference on Computational Biology. ACM Press, Montreal (2001)
Schwikowski, B., Uetz, P., Fields, S.: A network of proteinCprotein interactions in yeast. Nat. Biotechnol. 18, 1257–1261 (2000)
Vazquez, A., Flammini, A., Maritan, A., Vespignani, A.: Global protein function prediction from proteinCprotein interaction networks. Nat. Biotechnol. 21, 697–700 (2003)
Wang, H., Huang, H., Ding, C.: Image annotation using multi-label correlated Green’s function. In: IEEE International Conference on Computer Vision (2009)
Wang, H., Huang, H., Ding, C.: Image Annotation Using Bi-Relational Graph of Images and Semantic Labels. In: IEEE International Conference on Computer Vision and Pattern Recognition (2001)
Zha, Z., Mei, T., Wang, J., Wang, Z., Hua, X.: Graph-based semi-supervised learning with multi-label. In: IEEE International Conference on Multiamedia and Expo (2008)
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Scholkopf, B.: Learning with local and global consistency. In: Advances in Neural Information Processing Systems (NIPS), vol. 16, pp. 321–328. MIT Press, Cambridge (2004)
Zhu, X.: Semi-supervised learning literature survey. Technical Report 1530, Department of Computer Sciences, University of Wisconsin, Madison (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, J.Q. (2011). Learning Protein Functions from Bi-relational Graph of Proteins and Function Annotations. In: Przytycka, T.M., Sagot, MF. (eds) Algorithms in Bioinformatics. WABI 2011. Lecture Notes in Computer Science(), vol 6833. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23038-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-23038-7_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23037-0
Online ISBN: 978-3-642-23038-7
eBook Packages: Computer ScienceComputer Science (R0)