Abstract
It is widely recognized that the information for determining the final subcellular localization of proteins is found in their amino acid sequences. In this work we present new features extracted from the full length protein sequence to incorporate more biological information. Features are based on the occurrence frequency of di-peptides - traditional, higher order. Naïve Bayes classification along with correlation-based feature selection method is proposed to predict the subcellular location of apoptosis protein sequences. Our system makes predictions with an accuracy of 83% using Naïve Bayes classification alone and 86% using Naïve Bayes classification with correlation-based feature selection. This result shows that the new feature vector is promising, and helps in increasing the prediction accuracy.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Potten, C., Apoptosis, J.W.: The Life and Death of Cells (Developmental & Cell Biology Series)
Emanuelsson, O., Nielsen, H., Brunak, S., Gunnar, H.: Predicting Subcellular Localization of Proteins Based on Their N-Terminal Amino Acid Sequence. J. Molecular Biology. 300, 1005–1006 (2000)
Chou, K.C.: A New Branch of Proteomics Prediction of Protein Cellular Attributes. Gene Cloning and Expression Technologies 4, 57–70 (2002)
Huang, J., Shi, F.: Support Vector Machines for Predicting Apoptosis Proteins Types. Acta Bioinformatics 53, 39–47 (2005)
Zhou, G.P., Doctor, K.: Subcelluar Location of Apoptosis Proteins. Proteins: Structure, Function, and Genetic 50, 44–48 (2003)
Chou, K.C.: Prediction of Protein Cellular Attributes using Pseudoamino Acid Composition. Proteins: Structure. Functions. Genetics. 43(3), 246–255 (2001)
Chou, K.C., Cai, Y.D.: Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular Location. J. Bio. Chem 227(48), 45765–45769 (2002)
Chou, K.C., Cai, Y.D.: Predicting Subcellular Localization of Proteins by Hybridizing Functional Domain Composition and Pseudo-amino acid Composition. J. Cell Biochem. 91(3), 1197–1203 (2004)
Feng, Z.P.: Prediction of the Subcellular Location of Prokaryotic Proteins based on a New Representation of the Amino acid Composition. Biopolymers 58(4) (2001)
Cherian, B.S., Nair, A.S.: Protein Location Prediction using Atomic Composition of the Amino acid Sequence. Biochemical and Biophysical Research Communications 391, 1670–1674 (2010)
Zhang, L., Liao, B., Li, D., Zhu, W.: A Novel Representation for Apoptosis Protein Subcellular Localization Prediction Using Support Vector Machine. Journal of Theoretical Biology 259, 361–365 (2009)
Kumar Kandaswamy, K., Pugalenthi, G., Moller, S.: Prediction of Apoptosis Protein Locations with Genetic Algorithms and Support Vector Machines Through a New Mode of Pseudo Amino Acid Composition. Protein Peptide Letters 17(12) (2010)
Ding, Y.S., Zhang, T.L.: Using Chou’s Pseudo Amino Acid Composition to Predict Subcellular Localization of Apoptosis Proteins: An Approach with Immune Genetic Algorithm Based Ensemble Classifier. Pattern Recognition Letters 29, 1887–1892 (2008)
Hall, M.A., Holmes, G.: Benchmarking Attribute Selection Techniques for Discrete Class Data Mining. IEEE Transactions on Knowledge and Data Engineering 15, 1–16 (2003)
Ding, Y., Cai, Y., Zhang, G., Xu, W.: The Influence of Dipeptide Composition on Protein Thermostability. FEBS Letters 569, 284–288 (2004)
Song, C., Shi, F.: Prediction of Subcellular Localization of Apoptosis Proteins by Dipeptide Composition. JDCTA: International Journal of Digital Content Technology and its Applications 4(1.4), 32–36 (2010), doi:10.4156/jdcta
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Govindan, G., Nair, A.S. (2011). New Feature Vector for Apoptosis Protein Subcellular Localization Prediction. In: Abraham, A., Lloret Mauri, J., Buford, J.F., Suzuki, J., Thampi, S.M. (eds) Advances in Computing and Communications. ACC 2011. Communications in Computer and Information Science, vol 190. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22709-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-22709-7_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22708-0
Online ISBN: 978-3-642-22709-7
eBook Packages: Computer ScienceComputer Science (R0)