Skip to main content

A New Encoding Scheme to Improve the Performance of Protein Structural Class Prediction

  • Conference paper
Advances in Natural Computation (ICNC 2005)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3611))

Included in the following conference series:

Abstract

Based on the concept of coarse-grained description, a new encoding scheme with grouped weight for protein sequence is presented in this paper. By integrating the new scheme with the component-coupled algorithm, the overall prediction accuracy of protein structural class is significantly improved. For the same training dataset consisting of 359 proteins, the overall prediction accuracy achieved by the new method is 7% higher than that based solely on the amino-acid composition for the jackknife test. Especially for α + β the increase of prediction accuracy can achieve 15%. For the jackknife test, the overall prediction accuracy by the proposed scheme can reach 91.09%, which implies that a significant improvement has been achieved by making full use of the information contained in the protein sequence. Furthermore, the experimental analysis shows that the improvement depends on the size of the training dataset and the number of groups.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anfinsen, C.B.: Principles that govern the folding of protein chains. Science 181, 223–230 (1973)

    Article  Google Scholar 

  2. Levitt, M., Chothia, C.: Structure patterns in globular proteins. Nature 262, 552–557 (1976)

    Article  Google Scholar 

  3. Chou, K.C., Zhang, C.T.: Prediction of protein structural classes. Crit. Rev. Biochem. Mol. Biol. 30, 275–349 (1995)

    Article  Google Scholar 

  4. Nakashima, H., Nishikawa, K., Ooi, T.: The folding type of a protein is relevant to the amino acid composition. J. Biochem. 99, 152–162 (1986)

    Google Scholar 

  5. Chou, K.C., Maggiora, G.M.: Domain structural class prediction. Protein Engineering 11, 523–538 (1998)

    Article  Google Scholar 

  6. Bu, W.S., Feng, Z.P., Zhang, Z.D., Zhang, C.T.: Prediction of protein (domain) structural classes based on amino-acid index. Eur. J. Biochem. 266, 1043–1049 (1999)

    Article  Google Scholar 

  7. Li, X.Q., Luo, L.F.: The definition and recognition of protein structural class. Progress in Biochemistry and Biophysics 29, 124–127 (2002)

    Google Scholar 

  8. Li, X.Q., Luo, L.F.: The recognition of protein structural class. Progress in Biochemistry and Biophysics 29, 938–941 (2002)

    Google Scholar 

  9. Wang, Z.X., Yuan, Z.: How good is prediction of protein structural class by the component-coupled method? Proteins 38, 165–175 (2000)

    Article  Google Scholar 

  10. Cai, Y.D., Liu, X.J., Xu, X.B., Zhou, G.P.: Support Vector Machines for predicting protein structural class. BMC Bioinformatics 2, 3 (2001)

    Article  Google Scholar 

  11. Luo, R.Y., Feng, Z.P., Liu, J.K.: Prediction of protein structural class by amino acid and ploypeptide composition. Eur. J. Biochem. 269, 4219–4225 (2002)

    Article  Google Scholar 

  12. He, P.A., Wang, J.: Numerical characterization of DNA primary sequence. Internet Electronic Journal of Molecular Design 1, 668–674 (2002)

    Google Scholar 

  13. Lin, J.C., Yang, K.C.: Biochemistry, pp. 6–7. Liaoning science and technology press, Shenyang (1996)

    Google Scholar 

  14. Baldi, P., Brunak, S., Chauvin, Y., Andersen, C.A., Nielsen, H.: Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16, 412–424 (2000)

    Article  Google Scholar 

  15. Loredana, L.C., Steven, E.B., Tim, J.P.H., Cyrus, C., Alexey, G.M.: SCOP dataset in 2002: refinements accommodate structural genomics. Nucleic Acids Research 30, 264–267 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, ZH., Wang, ZH., Wang, YX. (2005). A New Encoding Scheme to Improve the Performance of Protein Structural Class Prediction. In: Wang, L., Chen, K., Ong, Y.S. (eds) Advances in Natural Computation. ICNC 2005. Lecture Notes in Computer Science, vol 3611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539117_157

Download citation

  • DOI: https://doi.org/10.1007/11539117_157

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28325-6

  • Online ISBN: 978-3-540-31858-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics