Abstract
Mitochondria are essential subcellular organelles found in eukaryotic cells. Knowing information on a protein’s subcellular or sub-subcellular location provides in-depth insights about the microenvironment where it interacts with other molecules and is crucial for inferring the protein’s function. Therefore, it is important to predict the submitochondrial localization of mitochondrial proteins. In this study, we introduced MitoBSKnn, a K-nearest neighbor method based on a bit-score weighted Euclidean distance, which is calculated from an extended version of pseudo-amino acid composition. We then improved the method by applying a heuristic feature selection process. Using the selected features, the final method achieved a 93% overall accuracy on the benchmarking dataset, which is higher than or comparable to other state-of-art methods. On a larger recently curated dataset, the method also achieved a consistent performance of 90% overall accuracy. MitoBSKnn is available at http://edisk.fandm.edu/jing.hu/mitobsknn/mitobsknn.html.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Henze, K., Martin, W.: Evolutionary Biology: Essence of Mitochondria. Nature 426, 127–128 (2003)
McBride, H.M., Neuspiel, M., Wasiak, S.: Mitochondria: More Than Just a Powerhouse. Curr. Biol. 16, R551–R560 (2006)
Gottlieb, R.A.: Programmed cell death. Drug News Perspect 13, 471–476 (2000)
Gardner, A., Boles, R.G.: Is a “Mitochondrial Psychiatry” in the Future? A Review. Curr. Psychiatry Review 1, 255–271 (2005)
Lesnefsky, E.J., Moghaddas, S., Tandler, B., Kerner, J., Hoppel, C.L.: Mitochondrial Dysfunction in Cardiac Disease: Ischemia—Reperfusion, Aging, and Heart failure. J. Mol. Cell Cardiol. 33, 1065–1089 (2001)
Du, P., Li, Y.: Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence. BMC Bioinformatics 7, 518 (2006)
Nanni, L., Lumini, A.: Genetic Programming for Creating Chou’s Pseudo Amino Acid Based Features for Submitochondria Localization. Amino Acids 34, 653–660 (2008)
Zeng, Y.H., Guo, Y.Z., Xiao, R.Q., Yang, L., Yu, L.Z., Li, M.L.: Using The Augmented Chou’s Pseudo Amino Acid Composition for Predicting Protein Submitochondria Locations Based on Auto Covariance Approach. J. Theor. Biol. 259, 366–372 (2009)
Shi, S.P., Qiu, J.D., Sun, X.Y., Huang, J.H., Huang, S.Y., Suo, S.B., Liang, R.P., Zhang, L.: Identify Submitochondria and Subchloroplast Locations with Pseudo Amino Acid Composition: Approach from the Strategy of Discrete Wavelet Transform Feature Extraction. Biochim. Biophys. Acta 1813, 424–430 (2011)
Zakeri, P., Moshiri, B., Sadeghi, M.: Prediction of Protein Submitochondria Locations Based on Data Fusion of Various Features of Sequences. J. Theor. Biol. 269, 208–216 (2011)
Fan, G.L., Li, Q.Z.: Predicting Protein Submitochondria Locations by Combining Different Descriptors Into the General Form of Chou’s Pseudo Amino Acid Composition. Amino Acids 43, 545–555 (2012)
Lin, H., Chen, W., Yuan, L.F., Li, Z.Q., Ding, H.: Using Over-Represented Tetrapeptides to Predict Protein Submitochondria Locations. Acta Biotheor. 61, 259–268 (2013)
Mei, S.: Multi-kernel Transfer Learning Based on Chou’s PseAAC Formulation for Protein Submitochondria Localization. J. Theor. Biol. 293, 121–130 (2012)
Du, P., Yu, Y.: SubMito-PSPCP: Predicting Protein Submitochondrial Locations by Hybrid-izing Positional Specific Physicochemical Properties with Pseudoamino Acid Compositions. Biomed Res. Int. 2013, 263829 (2013)
Chou, K.C.: Prediction of Protein Cellular Attributes Using Pseudo-Amino Acid Composition. Proteins 43, 246–255 (2001)
Chou, K.C., Cai, Y.D.: Prediction and Classification of Protein Subcellular Location-Sequence-Order Effect and Pseudo Amino Acid Composition. J. Cell Biochem. 90, 1250–1260 (2003)
Li, W., Jaroszewski, L., Godzik, A.: Clustering of Highly Homologous Sequences to Reduce the Size of Large Protein Databases. Bioinformatics 17, 282–283 (2001)
Kawashima, S., Kanehisa, M.: AAindex: Amino Acid Index Database. Nucleic Acids Res. 28, 374 (2000)
Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs. Nucleic Acids Res. 25, 3389–3402 (1997)
Pierleoni, A., Martelli, P.L., Fariselli, P., Casadio, R.: BaCelLo: A Balanced Subcellular Localization Predictor. Bioinformatics 22, e408–e416 (2006)
Hu, J., Ng, P.C.: Predicting the Effects of Frameshifting Indels. Genome Biology 13, R9 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Hu, J., Yan, X. (2014). Predicting Protein Submitochondrial Locations Using a K-Nearest Neighbors Method Based on the Bit-Score Weighted Euclidean Distance. In: Basu, M., Pan, Y., Wang, J. (eds) Bioinformatics Research and Applications. ISBRA 2014. Lecture Notes in Computer Science(), vol 8492. Springer, Cham. https://doi.org/10.1007/978-3-319-08171-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-08171-7_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08170-0
Online ISBN: 978-3-319-08171-7
eBook Packages: Computer ScienceComputer Science (R0)