Abstract
O-glycosylation means that sugar transferred to the protein. It can adjust the function of protein. To improve the prediction accuracy of O-glycosylation sites in protein, we used a new method of combining kernel independent component analysis with support vectors machine (KICA + SVM). The samples for experiment are encoded by the sparse coding with window size w = 51, 48 kernel independent components (feature) are extracted by kernel independent component analysis (KICA), then the prediction (classification) is done in feature space by support vector machines (SVM). The results of experiment show that the performance of KICA + SVM is better than that of KPCA + SVM, ICA + SVM, and PCA + SVM. Furthermore, we investigated the same protein sequence under various window size (w = 5, 7, 9, 11, 21, 31, 41, 51), and used the sum role to combine all the pre-classifiers to improve the prediction performance. The results indicate that the performance of ensembles of KICA + SVM is superior to that of pre-classifier. The prediction accuracy is about 90 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Nishikawa, I., Sakamoto, H., Nouno, I., Iritani, T., Sakakibara, K., Ito, M.: Prediction of the O-glycosylation sites in protein by layered neural networks and support vector machines. In: Gabrys, B., Howlett, R.J., Jain, L.C. (eds.) KES 2006. LNCS (LNAI), vol. 4252, pp. 953–960. Springer, Heidelberg (2006)
Sasaki, K., Nagamine, N., Sakakibara, Y.: Support vector machines prediction of N- and O-glycosylation sites using whole sequence information and subcellular localizition. IPSJ Trans. Bioinform. 2, 25–35 (2009)
Li, S., et al.: Predicting O-glycosylation sites in mammalian proteins by using SVMs. Comput. Biol. Chem. 30, 203–208 (2006)
Chen, Y.: Prediction of mucin-type O-Glycosylation sites in mammaliam protein using the composition of k-spaced amino acid pairs. BMC Bioinform. 9, 101–112 (2008)
Yang, X., Chen, Y.-W., Ito, M., Nishikawa, I.: Principal component analysis of O-linked glycosylation sites in protein sequence. In: IEEE Third International Conference on IIHMSP, vol. 1, pp. 121–126 (2007)
Wang, C.-Z., Tan, X.-F., Chen, Y.-W., Han, X.-H.: Independent component analysis-based prediction of O-linked glycosylation sites in protein using multi-layered neural networks. In: ICSP 2010 Proceedings, pp. 1761–1764 (2010)
Yang, X.M., Cui, X.W., Yang, X.Z.: Prediction of O-glycosylation sites in protein sequence by kernel principal component analysis. In: Proceedings of the International Conference on Computational Aspects of Social Networks, Taiyuan, China, pp. 267–270 (2010)
Yang, X.M.: Prediction of the protein O-glycosylation by machine learning based on kernel principal component analysis and ensemble classifiers. ICIC Express Lett. 5(8B), 2805–2810 (2011)
Tateyama, T., Nakao, Z.: Brain matters emphasis in MRI by kernel independent component analysis. In: IEEE Third International Conference on IIHMSP, vol. 1, pp. 117–120 (2007)
Sun, S.: Ensembles of feature subspaces for object detection. In: Yu, W., He, H., Zhang, N. (eds.) ISNN 2009, Part II. LNCS, vol. 5552, pp. 996–1004. Springer, Heidelberg (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Chen, Z. (2015). Kernel Independent Component Analysis-Based Prediction on the Protein O-Glycosylation Sites Using Support Vectors Machine and Ensemble Classifiers. In: Huang, DS., Han, K. (eds) Advanced Intelligent Computing Theories and Applications. ICIC 2015. Lecture Notes in Computer Science(), vol 9227. Springer, Cham. https://doi.org/10.1007/978-3-319-22053-6_67
Download citation
DOI: https://doi.org/10.1007/978-3-319-22053-6_67
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22052-9
Online ISBN: 978-3-319-22053-6
eBook Packages: Computer ScienceComputer Science (R0)