Abstract
Based on the concept of coarse-grained description, a new encoding scheme with grouped weight for protein sequence is presented in this paper. By integrating the new scheme with the component-coupled algorithm, the overall prediction accuracy of protein structural class is significantly improved. For the same training dataset consisting of 359 proteins, the overall prediction accuracy achieved by the new method is 7% higher than that based solely on the amino-acid composition for the jackknife test. Especially for α + β the increase of prediction accuracy can achieve 15%. For the jackknife test, the overall prediction accuracy by the proposed scheme can reach 91.09%, which implies that a significant improvement has been achieved by making full use of the information contained in the protein sequence. Furthermore, the experimental analysis shows that the improvement depends on the size of the training dataset and the number of groups.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anfinsen, C.B.: Principles that govern the folding of protein chains. Science 181, 223–230 (1973)
Levitt, M., Chothia, C.: Structure patterns in globular proteins. Nature 262, 552–557 (1976)
Chou, K.C., Zhang, C.T.: Prediction of protein structural classes. Crit. Rev. Biochem. Mol. Biol. 30, 275–349 (1995)
Nakashima, H., Nishikawa, K., Ooi, T.: The folding type of a protein is relevant to the amino acid composition. J. Biochem. 99, 152–162 (1986)
Chou, K.C., Maggiora, G.M.: Domain structural class prediction. Protein Engineering 11, 523–538 (1998)
Bu, W.S., Feng, Z.P., Zhang, Z.D., Zhang, C.T.: Prediction of protein (domain) structural classes based on amino-acid index. Eur. J. Biochem. 266, 1043–1049 (1999)
Li, X.Q., Luo, L.F.: The definition and recognition of protein structural class. Progress in Biochemistry and Biophysics 29, 124–127 (2002)
Li, X.Q., Luo, L.F.: The recognition of protein structural class. Progress in Biochemistry and Biophysics 29, 938–941 (2002)
Wang, Z.X., Yuan, Z.: How good is prediction of protein structural class by the component-coupled method? Proteins 38, 165–175 (2000)
Cai, Y.D., Liu, X.J., Xu, X.B., Zhou, G.P.: Support Vector Machines for predicting protein structural class. BMC Bioinformatics 2, 3 (2001)
Luo, R.Y., Feng, Z.P., Liu, J.K.: Prediction of protein structural class by amino acid and ploypeptide composition. Eur. J. Biochem. 269, 4219–4225 (2002)
He, P.A., Wang, J.: Numerical characterization of DNA primary sequence. Internet Electronic Journal of Molecular Design 1, 668–674 (2002)
Lin, J.C., Yang, K.C.: Biochemistry, pp. 6–7. Liaoning science and technology press, Shenyang (1996)
Baldi, P., Brunak, S., Chauvin, Y., Andersen, C.A., Nielsen, H.: Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16, 412–424 (2000)
Loredana, L.C., Steven, E.B., Tim, J.P.H., Cyrus, C., Alexey, G.M.: SCOP dataset in 2002: refinements accommodate structural genomics. Nucleic Acids Research 30, 264–267 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, ZH., Wang, ZH., Wang, YX. (2005). A New Encoding Scheme to Improve the Performance of Protein Structural Class Prediction. In: Wang, L., Chen, K., Ong, Y.S. (eds) Advances in Natural Computation. ICNC 2005. Lecture Notes in Computer Science, vol 3611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539117_157
Download citation
DOI: https://doi.org/10.1007/11539117_157
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28325-6
Online ISBN: 978-3-540-31858-3
eBook Packages: Computer ScienceComputer Science (R0)