Abstract
Support vector machines (SVM) have shown strong generalization ability in a number of application areas, including protein structure prediction. Bioinformatics techniques to protein secondary structure prediction mostly depend on the information available in amino acid sequence. In this study, a new sliding window scheme is introduced with multiple granular windows to form the protein data for training and testing SVM. Orthogonal encoding scheme coupled with BLOSUM62 matrix is used to make the prediction. The prediction of binary classifiers using multiple windows is compared with single window scheme, the results shows single window not to be good in all cases. New classifier is introduced for effective tertiary classification. The accuracy level of the new architectures are determined and compared with other studies. The tertiary architecture is better than most available techniques.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Rost, B., Sander, C.: Improved prediction of protein secondary structure by use of sequence profile and neural networks. Proc. Natl. Acad. Sci. U S A 90, 7558–7562 (1993)
Chandonia, J.M., Karplus, M.: New Method for accuracy prediction of protein secondary structure. Proteins 35, 293–306 (1999)
Hua, S., Sun, Z.: A Novel Method of Protein Secondary Structure Prediction with High Segment Overlap Measure: Support Vector Machine Approach. J. Mol. Biol. 308, 397–407 (2001)
Casbon, J.: Protein Secondary Structure Prediction with Support Vector Machines (2002)
Jones, D.T.: Protein Secondary Structure Prediction Based on Position-Specific-Scoring Matrices. J. Mol. Biol. 292, 195–202 (1999)
Hu, H., Yi, P.: Improved Secondary Structure Prediction Using Support Vector Machines with a New Encoding Scheme and an Advanced Tertiary Classifier. IEEE - Transaction on Nanobioscience 3(4) (2004)
Henikoff, S., Henikoff, J.G.: Amino acid substitution matrices from protein blocks. PNAS 89, 10915–10919 (1992)
Vapnik, V., Corter, C.: Support vector networks. Machine Learning 20, 273–293 (1995)
Christianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
Burges, C.J.C.: A Tutorial on Support Vector Machines for Pattern Recognition (1998), http://www.kernel-machines.org/papers/Burges98.ps.gz
Joachims, T.: SVM light (2002), http://www.cs.cornell.edu/People/tj/svm_light
Kim, H., Park, H.: Protein Secondary Structure Prediction Based on an Improved Support Vector Machines Approach. Protein Eng. 16, 553–560 (2003)
Reyaz-Ahmed, A., Zhang, Y.-Q.: Protein Secondary Structure Prediction Using Genetic Neural Support Vector Machines. In: Proc. of IEEE 7th International Conference on BioInformatics and BioEngineering, Boston, October 14-17, pp. 1355–1359 (2007)
Tang, Y.C., Jin, B., Zhang, Y.-Q.: Granular Support Vector Machines with Association Rules Mining for Protein Homology Prediction, Artificial Intelligence in Medicine. Special Issue on Computational Intelligence Techniques in Bioinformatics 35(1-2), 121–134 (2005)
Jin, B., Zhang, Y.-Q., Wang, B.H.: Granular Kernel Trees with Parallel Genetic Algorithms for Drug Activity Comparisons. International Journal of Data Mining and Bioinformatics 1(3), 270–285 (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Reyaz-Ahmed, A., Zhang, Y. (2008). A New SVM-Based Decision Fusion Method Using Multiple Granular Windows for Protein Secondary Structure Prediction. In: Wang, G., Li, T., Grzymala-Busse, J.W., Miao, D., Skowron, A., Yao, Y. (eds) Rough Sets and Knowledge Technology. RSKT 2008. Lecture Notes in Computer Science(), vol 5009. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79721-0_46
Download citation
DOI: https://doi.org/10.1007/978-3-540-79721-0_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-79720-3
Online ISBN: 978-3-540-79721-0
eBook Packages: Computer ScienceComputer Science (R0)