Skip to main content

Predicting Protein Submitochondrial Locations Using a K-Nearest Neighbors Method Based on the Bit-Score Weighted Euclidean Distance

  • Conference paper
Book cover Bioinformatics Research and Applications (ISBRA 2014)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 8492))

Included in the following conference series:

  • 2178 Accesses

Abstract

Mitochondria are essential subcellular organelles found in eukaryotic cells. Knowing information on a protein’s subcellular or sub-subcellular location provides in-depth insights about the microenvironment where it interacts with other molecules and is crucial for inferring the protein’s function. Therefore, it is important to predict the submitochondrial localization of mitochondrial proteins. In this study, we introduced MitoBSKnn, a K-nearest neighbor method based on a bit-score weighted Euclidean distance, which is calculated from an extended version of pseudo-amino acid composition. We then improved the method by applying a heuristic feature selection process. Using the selected features, the final method achieved a 93% overall accuracy on the benchmarking dataset, which is higher than or comparable to other state-of-art methods. On a larger recently curated dataset, the method also achieved a consistent performance of 90% overall accuracy. MitoBSKnn is available at http://edisk.fandm.edu/jing.hu/mitobsknn/mitobsknn.html.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Henze, K., Martin, W.: Evolutionary Biology: Essence of Mitochondria. Nature 426, 127–128 (2003)

    Article  Google Scholar 

  2. McBride, H.M., Neuspiel, M., Wasiak, S.: Mitochondria: More Than Just a Powerhouse. Curr. Biol. 16, R551–R560 (2006)

    Google Scholar 

  3. Gottlieb, R.A.: Programmed cell death. Drug News Perspect 13, 471–476 (2000)

    Google Scholar 

  4. Gardner, A., Boles, R.G.: Is a “Mitochondrial Psychiatry” in the Future? A Review. Curr. Psychiatry Review 1, 255–271 (2005)

    Article  Google Scholar 

  5. Lesnefsky, E.J., Moghaddas, S., Tandler, B., Kerner, J., Hoppel, C.L.: Mitochondrial Dysfunction in Cardiac Disease: Ischemia—Reperfusion, Aging, and Heart failure. J. Mol. Cell Cardiol. 33, 1065–1089 (2001)

    Article  Google Scholar 

  6. Du, P., Li, Y.: Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence. BMC Bioinformatics 7, 518 (2006)

    Article  Google Scholar 

  7. Nanni, L., Lumini, A.: Genetic Programming for Creating Chou’s Pseudo Amino Acid Based Features for Submitochondria Localization. Amino Acids 34, 653–660 (2008)

    Article  Google Scholar 

  8. Zeng, Y.H., Guo, Y.Z., Xiao, R.Q., Yang, L., Yu, L.Z., Li, M.L.: Using The Augmented Chou’s Pseudo Amino Acid Composition for Predicting Protein Submitochondria Locations Based on Auto Covariance Approach. J. Theor. Biol. 259, 366–372 (2009)

    Article  Google Scholar 

  9. Shi, S.P., Qiu, J.D., Sun, X.Y., Huang, J.H., Huang, S.Y., Suo, S.B., Liang, R.P., Zhang, L.: Identify Submitochondria and Subchloroplast Locations with Pseudo Amino Acid Composition: Approach from the Strategy of Discrete Wavelet Transform Feature Extraction. Biochim. Biophys. Acta 1813, 424–430 (2011)

    Article  Google Scholar 

  10. Zakeri, P., Moshiri, B., Sadeghi, M.: Prediction of Protein Submitochondria Locations Based on Data Fusion of Various Features of Sequences. J. Theor. Biol. 269, 208–216 (2011)

    Article  Google Scholar 

  11. Fan, G.L., Li, Q.Z.: Predicting Protein Submitochondria Locations by Combining Different Descriptors Into the General Form of Chou’s Pseudo Amino Acid Composition. Amino Acids 43, 545–555 (2012)

    Article  Google Scholar 

  12. Lin, H., Chen, W., Yuan, L.F., Li, Z.Q., Ding, H.: Using Over-Represented Tetrapeptides to Predict Protein Submitochondria Locations. Acta Biotheor. 61, 259–268 (2013)

    Article  Google Scholar 

  13. Mei, S.: Multi-kernel Transfer Learning Based on Chou’s PseAAC Formulation for Protein Submitochondria Localization. J. Theor. Biol. 293, 121–130 (2012)

    Article  Google Scholar 

  14. Du, P., Yu, Y.: SubMito-PSPCP: Predicting Protein Submitochondrial Locations by Hybrid-izing Positional Specific Physicochemical Properties with Pseudoamino Acid Compositions. Biomed Res. Int. 2013, 263829 (2013)

    Google Scholar 

  15. Chou, K.C.: Prediction of Protein Cellular Attributes Using Pseudo-Amino Acid Composition. Proteins 43, 246–255 (2001)

    Article  Google Scholar 

  16. Chou, K.C., Cai, Y.D.: Prediction and Classification of Protein Subcellular Location-Sequence-Order Effect and Pseudo Amino Acid Composition. J. Cell Biochem. 90, 1250–1260 (2003)

    Article  Google Scholar 

  17. Li, W., Jaroszewski, L., Godzik, A.: Clustering of Highly Homologous Sequences to Reduce the Size of Large Protein Databases. Bioinformatics 17, 282–283 (2001)

    Article  Google Scholar 

  18. Kawashima, S., Kanehisa, M.: AAindex: Amino Acid Index Database. Nucleic Acids Res. 28, 374 (2000)

    Article  Google Scholar 

  19. Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs. Nucleic Acids Res. 25, 3389–3402 (1997)

    Article  Google Scholar 

  20. Pierleoni, A., Martelli, P.L., Fariselli, P., Casadio, R.: BaCelLo: A Balanced Subcellular Localization Predictor. Bioinformatics 22, e408–e416 (2006)

    Google Scholar 

  21. Hu, J., Ng, P.C.: Predicting the Effects of Frameshifting Indels. Genome Biology 13, R9 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Hu, J., Yan, X. (2014). Predicting Protein Submitochondrial Locations Using a K-Nearest Neighbors Method Based on the Bit-Score Weighted Euclidean Distance. In: Basu, M., Pan, Y., Wang, J. (eds) Bioinformatics Research and Applications. ISBRA 2014. Lecture Notes in Computer Science(), vol 8492. Springer, Cham. https://doi.org/10.1007/978-3-319-08171-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08171-7_5

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08170-0

  • Online ISBN: 978-3-319-08171-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics