Abstract
Proteins are complex molecules, each comprised of its own combination of twenty different amino acids. Protein secondary structure is a polypeptide that has formed an arrangement of amino acids that are located next to one another in a linear fashion. Protein secondary structure prediction refers to the prediction of the conformational state of each amino acid residue of a protein sequence as one of the three possible states, namely helices, strands, or coils, denoted as H, E, and C, respectively. Protein sequence is the only resource that provides the information to survive denaturing process, so it is essential to find the secondary structure of a protein sequence. The existing methodology uses only one hydrophobicity scale called Kyte-Doolittle whereas in this paper three scales such as, Kyte-Doolittle scale, Hopp-Woods scale and Rose scale are used for protein secondary structure prediction. This Paper formulates secondary structure prediction task as sequence labeling and a new coding scheme is introduced with multiple windows to predict secondary structure of proteins using hydrophobicity scales. Protein sequences with their physical and chemical properties are learned using SVMhmm that creates a learned model, which is then used to predict protein secondary structure of an unknown primary sequence. It is reported 77.11% accuracy based on Q3 measures, when SVMhmm is used.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Salamov, A.A., Solovyev, V.V.: Protein secondary structure prediction using local alignments. J. Mol. Biol. 268, 31–36 (1997)
Chou, P.Y., Fasman, G.D.: Prediction of secondary structure of proteins from their amino acid sequence. Advance in Enzymology and Related Areas of Molecular Biology 47, 45–148 (1978)
Asai, K., Hayamizu, K.I., Handa, S.: Prediction of Protein Secondary Structure by the Hidden Markov Model. Computer Application in Biosciences 7, 141–146 (1999)
Weng, B., Xuan, G., Kolodzey, J., Barner, K.E.: Empirical mode decomposition as a tool for DNA sequence analysis from terahertz spectroscopy measurements. In: IEEE International Workshop on Genomic Signal Processing and Statistics, May 28-30, pp. 63–64 (2006)
Qian, N., Sejnowski, T.J.: Predicting the secondary structure of globular proteins using neural network models. Journal of Molecular Biology 202, 865–884 (1988)
Kim, H., Park, H.: Protein secondary structure prediction based on an improved support vector machines approach. Protein Engineering 16, 553–560 (2003)
Wang, J.T.L., Zaki, M.J., Toivonen, H., Shasha, D.E. (eds.): Data mining in bioinformatics, 1st edn., vol. XI, p. 340 (2005), 110 illus
Kabsch, W., Sander, C.: Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983)
Sander, C., Schneider, R.: Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins 9(1), 56–68 (1991)
Zvelebil, M.J., Barton, G.L., Taylor, W.R., Sternberg, M.J.E.: Prediction of protein secondary structure and active sites using alignment of homologous sequences. Journal of Molecular Biology 195, 957–961 (1987)
Cornette, J.L., Cease, K.B., Margalit, H., Spouge, J.L., Berzofsky, J.A., DeLisi, C.: Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. J. Mol. Biol. 195(3), 659–685 (1987)
Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157(1), 105–132 (1982)
Hopp, T.P., Woods, K.R.: A computer program for predicting protein antigenic determinants. Mol. Immunol. 20(4), 483–489 (1983)
Rose, G.D., Geselowitz, A.R., Lesser, G.J., Lee, R.H., Zehfus, M.H.: Hydrophobicity of amino acid residues in globular proteins. Science 229(4716), 834–838 (1985)
Bystroff, C., Thorsson, V., Baker, D.: HMMSTR: A hidden Markov model for local sequence-structure correlations in proteins. J. Mol. Biol. 301, 173–190 (2000)
Hu, H., Yi, P.: Improved Secondary Structure Prediction Using Support Vector Machines with a New Encoding Scheme and an Advanced Tertiary Classifier. IEEE – Transaction on Nanobioscience 3(4) (2004)
Gotoh, O.: Multiple Sequence Alignment: Algorithms and Applications. Advances in Biophysics 36(1), 159–206 (1999)
Maetin, J., Gibrat, J.F., Rodolphe: Analysis of an optimal hidden Markov model for secondary structure prediction. BMV Structural Biology 6, 25–44 (2006)
Jain, L.C., Sato-Ilic, M., Virvou, M., Tsihrintzis, G.A., Balas, V.E., Abeynayake, C.: Computational Intelligence Paradigms: Innovative Applications, 282 p (June 12, 2008) ISBN: 3540794735
Wang, L.-H., Liu, J., Li, Y.-F., Zhou, H.-B.: Predicting Protein Secondary Structure by a Support Vector Machine Based on a New Coding Scheme. Genome Informatics 15(2), 181–190 (2004)
Nageswara Rao, P.V., et al.: Protein Secondary Structure Prediction using Pattern Recognition Neural Network. International Journal of Engineering Science and Technology 2(6), 1752–1757 (2010)
Tan, Y.H., Huang, H., Kihara, D.: Statistical potentialbased amino acid similarity matrices for aligning distantly related protein sequences. Proteins: Structure, Function, and Bioinformatics 64(3), 587–600 (2006)
Peng, J., Xu, J.: Low-homology protein threading. Bioinformatics 26(12), i294–i300 (2010)
Wang, G., Dunbrack, J., Roland, L.: PISCES: a protein sequence culling server. Bioinformatics 19(12), 1589–1591 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer India Pvt. Ltd.
About this paper
Cite this paper
Vinodhini, R., Vijaya, M.S. (2012). Label Sequence Learning Based Protein Secondary Structure Prediction Using Hydrophobicity Scales. In: Deep, K., Nagar, A., Pant, M., Bansal, J. (eds) Proceedings of the International Conference on Soft Computing for Problem Solving (SocProS 2011) December 20-22, 2011. Advances in Intelligent and Soft Computing, vol 131. Springer, New Delhi. https://doi.org/10.1007/978-81-322-0491-6_56
Download citation
DOI: https://doi.org/10.1007/978-81-322-0491-6_56
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-0490-9
Online ISBN: 978-81-322-0491-6
eBook Packages: EngineeringEngineering (R0)