Skip to main content

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 131))

  • 2906 Accesses

Abstract

Proteins are complex molecules, each comprised of its own combination of twenty different amino acids. Protein secondary structure is a polypeptide that has formed an arrangement of amino acids that are located next to one another in a linear fashion. Protein secondary structure prediction refers to the prediction of the conformational state of each amino acid residue of a protein sequence as one of the three possible states, namely helices, strands, or coils, denoted as H, E, and C, respectively. Protein sequence is the only resource that provides the information to survive denaturing process, so it is essential to find the secondary structure of a protein sequence. The existing methodology uses only one hydrophobicity scale called Kyte-Doolittle whereas in this paper three scales such as, Kyte-Doolittle scale, Hopp-Woods scale and Rose scale are used for protein secondary structure prediction. This Paper formulates secondary structure prediction task as sequence labeling and a new coding scheme is introduced with multiple windows to predict secondary structure of proteins using hydrophobicity scales. Protein sequences with their physical and chemical properties are learned using SVMhmm that creates a learned model, which is then used to predict protein secondary structure of an unknown primary sequence. It is reported 77.11% accuracy based on Q3 measures, when SVMhmm is used.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Salamov, A.A., Solovyev, V.V.: Protein secondary structure prediction using local alignments. J. Mol. Biol. 268, 31–36 (1997)

    Article  Google Scholar 

  2. Chou, P.Y., Fasman, G.D.: Prediction of secondary structure of proteins from their amino acid sequence. Advance in Enzymology and Related Areas of Molecular Biology 47, 45–148 (1978)

    Google Scholar 

  3. Asai, K., Hayamizu, K.I., Handa, S.: Prediction of Protein Secondary Structure by the Hidden Markov Model. Computer Application in Biosciences 7, 141–146 (1999)

    Google Scholar 

  4. Weng, B., Xuan, G., Kolodzey, J., Barner, K.E.: Empirical mode decomposition as a tool for DNA sequence analysis from terahertz spectroscopy measurements. In: IEEE International Workshop on Genomic Signal Processing and Statistics, May 28-30, pp. 63–64 (2006)

    Google Scholar 

  5. Qian, N., Sejnowski, T.J.: Predicting the secondary structure of globular proteins using neural network models. Journal of Molecular Biology 202, 865–884 (1988)

    Article  Google Scholar 

  6. Kim, H., Park, H.: Protein secondary structure prediction based on an improved support vector machines approach. Protein Engineering 16, 553–560 (2003)

    Article  Google Scholar 

  7. Wang, J.T.L., Zaki, M.J., Toivonen, H., Shasha, D.E. (eds.): Data mining in bioinformatics, 1st edn., vol. XI, p. 340 (2005), 110 illus

    Google Scholar 

  8. Kabsch, W., Sander, C.: Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983)

    Article  Google Scholar 

  9. Sander, C., Schneider, R.: Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins 9(1), 56–68 (1991)

    Article  Google Scholar 

  10. Zvelebil, M.J., Barton, G.L., Taylor, W.R., Sternberg, M.J.E.: Prediction of protein secondary structure and active sites using alignment of homologous sequences. Journal of Molecular Biology 195, 957–961 (1987)

    Article  Google Scholar 

  11. Cornette, J.L., Cease, K.B., Margalit, H., Spouge, J.L., Berzofsky, J.A., DeLisi, C.: Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. J. Mol. Biol. 195(3), 659–685 (1987)

    Article  Google Scholar 

  12. Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157(1), 105–132 (1982)

    Article  Google Scholar 

  13. Hopp, T.P., Woods, K.R.: A computer program for predicting protein antigenic determinants. Mol. Immunol. 20(4), 483–489 (1983)

    Article  Google Scholar 

  14. Rose, G.D., Geselowitz, A.R., Lesser, G.J., Lee, R.H., Zehfus, M.H.: Hydrophobicity of amino acid residues in globular proteins. Science 229(4716), 834–838 (1985)

    Article  Google Scholar 

  15. Bystroff, C., Thorsson, V., Baker, D.: HMMSTR: A hidden Markov model for local sequence-structure correlations in proteins. J. Mol. Biol. 301, 173–190 (2000)

    Article  Google Scholar 

  16. Hu, H., Yi, P.: Improved Secondary Structure Prediction Using Support Vector Machines with a New Encoding Scheme and an Advanced Tertiary Classifier. IEEE – Transaction on Nanobioscience 3(4) (2004)

    Google Scholar 

  17. Gotoh, O.: Multiple Sequence Alignment: Algorithms and Applications. Advances in Biophysics 36(1), 159–206 (1999)

    Article  Google Scholar 

  18. Maetin, J., Gibrat, J.F., Rodolphe: Analysis of an optimal hidden Markov model for secondary structure prediction. BMV Structural Biology 6, 25–44 (2006)

    Article  Google Scholar 

  19. Jain, L.C., Sato-Ilic, M., Virvou, M., Tsihrintzis, G.A., Balas, V.E., Abeynayake, C.: Computational Intelligence Paradigms: Innovative Applications, 282 p (June 12, 2008) ISBN: 3540794735

    Google Scholar 

  20. Wang, L.-H., Liu, J., Li, Y.-F., Zhou, H.-B.: Predicting Protein Secondary Structure by a Support Vector Machine Based on a New Coding Scheme. Genome Informatics 15(2), 181–190 (2004)

    Google Scholar 

  21. Nageswara Rao, P.V., et al.: Protein Secondary Structure Prediction using Pattern Recognition Neural Network. International Journal of Engineering Science and Technology 2(6), 1752–1757 (2010)

    Google Scholar 

  22. Tan, Y.H., Huang, H., Kihara, D.: Statistical potentialbased amino acid similarity matrices for aligning distantly related protein sequences. Proteins: Structure, Function, and Bioinformatics 64(3), 587–600 (2006)

    Article  Google Scholar 

  23. Peng, J., Xu, J.: Low-homology protein threading. Bioinformatics 26(12), i294–i300 (2010)

    Article  Google Scholar 

  24. Wang, G., Dunbrack, J., Roland, L.: PISCES: a protein sequence culling server. Bioinformatics 19(12), 1589–1591 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Vinodhini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer India Pvt. Ltd.

About this paper

Cite this paper

Vinodhini, R., Vijaya, M.S. (2012). Label Sequence Learning Based Protein Secondary Structure Prediction Using Hydrophobicity Scales. In: Deep, K., Nagar, A., Pant, M., Bansal, J. (eds) Proceedings of the International Conference on Soft Computing for Problem Solving (SocProS 2011) December 20-22, 2011. Advances in Intelligent and Soft Computing, vol 131. Springer, New Delhi. https://doi.org/10.1007/978-81-322-0491-6_56

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-0491-6_56

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-0490-9

  • Online ISBN: 978-81-322-0491-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics