Abstract
Protein subcellular location prediction with computational method is still a hot spot in bioinformatics. In this paper, we present a new method to predict protein subcellular location, which based on pseudo amino acid composition and immune genetic algorithm. Hydrophobic patterns of amino acid couples and approximate entropy are introduced to construct pseudo amino acid composition. Immune Genetic algorithm (IGA) is applied to find the fittest weight factors for pseudo amino acid composition, which are crucial in this method. As such, high success rates are obtained by both self-consistency test and jackknife test. More than 80% predictive accuracy is achieved in independent dataset test. The result demonstrates that this new method is practical. And, the method illuminates that the hydrophobic patterns of protein sequence influence its subcellular location.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Nakai, K., Kanehisa, M.: A Knowledge Base for Predicting Protein Localization Sites in Eukaryotic Cells. Genomics 14, 897–911 (1992)
Reinhardt, A., Hubbard, T.: Using Neural Networks for Prediction of the Subcellular Location of Proteins. Nucleic Acids Res. 26, 2230–2236 (1998)
Yuan, Z.: Prediction of Protein Subcellular Locations using Markov Chain Models. FEBS Letter 451, 23–26 (1999)
Park, K.J., Kanehisa, M.: Prediction of Protein Subcellular Locations Support Vector Machines using Compositions Amino Acids and Amino Acid Pairs. Bioinformatics 19, 1656–1663 (2003)
Chou, K.C., Cai, Y.D.: Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular Location. Journal of biological chemistry 277, 45765–45769 (2002)
Hua, S., Sun, Z.: Support Vector Machine Approach for Protein Subcellular Localization Prediction. Bioinformatics 17, 721–728 (2001)
Cai, Y.D., Liu, X.J., Xu, X.B., Chou, K.C.: Support Vector Machines for Prediction of Protein Subcellular Llocation by Incorporating Quasi-sequence-order Effect. J. Cell Biochemistry 84, 343–348 (2002)
Pan, Y.X., Zhang, Z.Z., Guo, Z.M., Feng, G.Y., Huang, Z.D., He, L.: Application of Pseudo Amino Acid Composition for Predicting Protein Subcellular Location: Stochastic Signal Processing Approach. Journal of Protein Chemistry 22, 395–402 (2003)
Cai, Y.D., Chou, K.C.: Nearest Neighbor Algorithm for Predicting Protein Subcellular Location by Combining Function Domain Composition and Pseudo Amino Acid Composition. Biochem. Biophys. Res. Comm. 305, 407–411 (2003)
Xiao, X., Shao, S.H., Ding, Y.S., Huang, Z.D., Huang, Y.S., Chou, K.C.: Using Complexity Measure Factor to Predict Protein Subcellular Location. Amino Acid 28, 57–61 (2005)
Wang, M., Yang, J., Xu, Z.J., Chou, K.C.: Weight-support Vector Machines for Prediction Membrane Protein Type Based on Pseudo-amino Acid Composition. Protein Engineering Design and Selection 17, 509–516 (2004)
Lim, V.I.: Algorithms for the Prediction of A-helical and 0-structural Regions in Globular Proteins. J. Mol. Biol. 88, 873–894 (1974)
Dill, K.A.: Dominant Forces in Protein Folding. Biochemistry 29, 7133–7155 (1990)
Sadovsky, M.G.: The Method to Compare Nucleotide Sequence based on Minimum Entropy Principle. Bull Math. Biol. 65, 309–322 (2003)
Pincus, S.M.: Approximate Entropy as a Measure of System Complexity. PNAS 88, 2297–2301 (1991)
Chou, K.C., Elrod, D.W.: Protein Subcellular Location Prediction. Protein Eng. 12, 183–190 (1999)
Schiffer, M., Edmundson, A.: Use of Helical Wheels to Represent the Structures of Proteins and to Identify Segments with Helical Potential. Biophys. J. 7, 121–133 (1967)
Rose, G.D., Geselowitz, A.R., Lesser, G.J., Lee, R.H., Zehfus, M.H.: Hydrophobic of Amino Acid Residue in Globular Proteins. Science 229, 834–838 (1985)
Hong, B., Tang, Q.Y., Yang, F.S.: Apen and Cross-ApEn: Property, Fast Algorithm and Preliminary Application to the Study of EEG and Cognition. Signal Process 15, 100–108 (1999)
Chou, K.C.: Prediction of Protein Cellular Attributes using Pseudo-amino-acid Composition. Protein: struct. Funct. Genet. 43, 246–255 (2001)
Zhou, G.P.: An Intriguing Controversy over Protein Structural Class Prediction. J. Protein Chem. 17, 729–738 (1998)
Zhou, G.P., Assa-Munt, N.: Some Insight into Protein Structural Class Prediction. Protein: Structure, Function, and Genetics 50, 44–48 (2001)
Chou, K.C., Zhang, C.T.: Review: Prediction of Protein Structural Classes. Crit. Rev. Biochem. Mol Biol. 30, 275–349 (1995)
Cedano, J., Aloy, P., P’erez-Pons, J.A., Querol, E.: Relation between Amino Acid Composition and Cellular Location of Protein. J. Mol. Biol. 266, 594–600 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, T., Ding, Y., Shao, S. (2006). Protein Subcellular Location Prediction Based on Pseudo Amino Acid Composition and Immune Genetic Algorithm. In: Huang, DS., Li, K., Irwin, G.W. (eds) Computational Intelligence and Bioinformatics. ICIC 2006. Lecture Notes in Computer Science(), vol 4115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11816102_57
Download citation
DOI: https://doi.org/10.1007/11816102_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37277-6
Online ISBN: 978-3-540-37282-0
eBook Packages: Computer ScienceComputer Science (R0)