Label Sequence Learning Based Protein Secondary Structure Prediction Using Hydrophobicity Scales

Vinodhini, R.; Vijaya, M. S.

doi:10.1007/978-81-322-0491-6_56

R. Vinodhini⁶ &
M. S. Vijaya⁷

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 131))

2906 Accesses

Abstract

Proteins are complex molecules, each comprised of its own combination of twenty different amino acids. Protein secondary structure is a polypeptide that has formed an arrangement of amino acids that are located next to one another in a linear fashion. Protein secondary structure prediction refers to the prediction of the conformational state of each amino acid residue of a protein sequence as one of the three possible states, namely helices, strands, or coils, denoted as H, E, and C, respectively. Protein sequence is the only resource that provides the information to survive denaturing process, so it is essential to find the secondary structure of a protein sequence. The existing methodology uses only one hydrophobicity scale called Kyte-Doolittle whereas in this paper three scales such as, Kyte-Doolittle scale, Hopp-Woods scale and Rose scale are used for protein secondary structure prediction. This Paper formulates secondary structure prediction task as sequence labeling and a new coding scheme is introduced with multiple windows to predict secondary structure of proteins using hydrophobicity scales. Protein sequences with their physical and chemical properties are learned using SVM^hmm that creates a learned model, which is then used to predict protein secondary structure of an unknown primary sequence. It is reported 77.11% accuracy based on Q₃ measures, when SVM^hmm is used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Salamov, A.A., Solovyev, V.V.: Protein secondary structure prediction using local alignments. J. Mol. Biol. 268, 31–36 (1997)
Article Google Scholar
Chou, P.Y., Fasman, G.D.: Prediction of secondary structure of proteins from their amino acid sequence. Advance in Enzymology and Related Areas of Molecular Biology 47, 45–148 (1978)
Google Scholar
Asai, K., Hayamizu, K.I., Handa, S.: Prediction of Protein Secondary Structure by the Hidden Markov Model. Computer Application in Biosciences 7, 141–146 (1999)
Google Scholar
Weng, B., Xuan, G., Kolodzey, J., Barner, K.E.: Empirical mode decomposition as a tool for DNA sequence analysis from terahertz spectroscopy measurements. In: IEEE International Workshop on Genomic Signal Processing and Statistics, May 28-30, pp. 63–64 (2006)
Google Scholar
Qian, N., Sejnowski, T.J.: Predicting the secondary structure of globular proteins using neural network models. Journal of Molecular Biology 202, 865–884 (1988)
Article Google Scholar
Kim, H., Park, H.: Protein secondary structure prediction based on an improved support vector machines approach. Protein Engineering 16, 553–560 (2003)
Article Google Scholar
Wang, J.T.L., Zaki, M.J., Toivonen, H., Shasha, D.E. (eds.): Data mining in bioinformatics, 1st edn., vol. XI, p. 340 (2005), 110 illus
Google Scholar
Kabsch, W., Sander, C.: Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983)
Article Google Scholar
Sander, C., Schneider, R.: Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins 9(1), 56–68 (1991)
Article Google Scholar
Zvelebil, M.J., Barton, G.L., Taylor, W.R., Sternberg, M.J.E.: Prediction of protein secondary structure and active sites using alignment of homologous sequences. Journal of Molecular Biology 195, 957–961 (1987)
Article Google Scholar
Cornette, J.L., Cease, K.B., Margalit, H., Spouge, J.L., Berzofsky, J.A., DeLisi, C.: Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. J. Mol. Biol. 195(3), 659–685 (1987)
Article Google Scholar
Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157(1), 105–132 (1982)
Article Google Scholar
Hopp, T.P., Woods, K.R.: A computer program for predicting protein antigenic determinants. Mol. Immunol. 20(4), 483–489 (1983)
Article Google Scholar
Rose, G.D., Geselowitz, A.R., Lesser, G.J., Lee, R.H., Zehfus, M.H.: Hydrophobicity of amino acid residues in globular proteins. Science 229(4716), 834–838 (1985)
Article Google Scholar
Bystroff, C., Thorsson, V., Baker, D.: HMMSTR: A hidden Markov model for local sequence-structure correlations in proteins. J. Mol. Biol. 301, 173–190 (2000)
Article Google Scholar
Hu, H., Yi, P.: Improved Secondary Structure Prediction Using Support Vector Machines with a New Encoding Scheme and an Advanced Tertiary Classifier. IEEE – Transaction on Nanobioscience 3(4) (2004)
Google Scholar
Gotoh, O.: Multiple Sequence Alignment: Algorithms and Applications. Advances in Biophysics 36(1), 159–206 (1999)
Article Google Scholar
Maetin, J., Gibrat, J.F., Rodolphe: Analysis of an optimal hidden Markov model for secondary structure prediction. BMV Structural Biology 6, 25–44 (2006)
Article Google Scholar
Jain, L.C., Sato-Ilic, M., Virvou, M., Tsihrintzis, G.A., Balas, V.E., Abeynayake, C.: Computational Intelligence Paradigms: Innovative Applications, 282 p (June 12, 2008) ISBN: 3540794735
Google Scholar
Wang, L.-H., Liu, J., Li, Y.-F., Zhou, H.-B.: Predicting Protein Secondary Structure by a Support Vector Machine Based on a New Coding Scheme. Genome Informatics 15(2), 181–190 (2004)
Google Scholar
Nageswara Rao, P.V., et al.: Protein Secondary Structure Prediction using Pattern Recognition Neural Network. International Journal of Engineering Science and Technology 2(6), 1752–1757 (2010)
Google Scholar
Tan, Y.H., Huang, H., Kihara, D.: Statistical potentialbased amino acid similarity matrices for aligning distantly related protein sequences. Proteins: Structure, Function, and Bioinformatics 64(3), 587–600 (2006)
Article Google Scholar
Peng, J., Xu, J.: Low-homology protein threading. Bioinformatics 26(12), i294–i300 (2010)
Article Google Scholar
Wang, G., Dunbrack, J., Roland, L.: PISCES: a protein sequence culling server. Bioinformatics 19(12), 1589–1591 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

PSGR Krishnammal College for Women, Coimbatore, 641 004, Tamil Nadu, India
R. Vinodhini
G.R.G School of Applied Computer Technology, Coimbatore, 641 004, Tamil Nadu, India
M. S. Vijaya

Authors

R. Vinodhini
View author publications
You can also search for this author in PubMed Google Scholar
M. S. Vijaya
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Vinodhini .

Editor information

Editors and Affiliations

, Department of Mathematics, Indian Institute of Technology Roorkee, Uttarakhand, Roorkee, 247667, India
Kusum Deep
Department of Mathematics and, Computer Science, Liverpool Hope University, Hope Park, Liverpool, L16 9JD, United Kingdom
Atulya Nagar
, Department of Paper Technology, Indian Institute of Technology Roorkee, Uttarakhand, Roorkee, 247667, India
Millie Pant
Information Technology and Management, ABV-Indian Institute of, 109, E- Block, Gwalior, 474010, India
Jagdish Chand Bansal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vinodhini, R., Vijaya, M.S. (2012). Label Sequence Learning Based Protein Secondary Structure Prediction Using Hydrophobicity Scales. In: Deep, K., Nagar, A., Pant, M., Bansal, J. (eds) Proceedings of the International Conference on Soft Computing for Problem Solving (SocProS 2011) December 20-22, 2011. Advances in Intelligent and Soft Computing, vol 131. Springer, New Delhi. https://doi.org/10.1007/978-81-322-0491-6_56

Download citation

DOI: https://doi.org/10.1007/978-81-322-0491-6_56
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-0490-9
Online ISBN: 978-81-322-0491-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics