Skip to main content

Predicting Subcellular Localization of Proteins Using Support Vector Machine with N-Terminal Amino Composition

  • Conference paper
Book cover Advanced Data Mining and Applications (ADMA 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3584))

Included in the following conference series:

Abstract

Prediction of protein subcellular localization is one of the hot research topics in bioinformatics. In this paper, several support vector machines (SVM) with a new presented coding scheme method based on N-terminal amino compositions are first trained to discriminate between proteins destined for the mitochondrion, the chloroplast, the secretory pathway, and ‘other’ localizations. Then a decision unit is used to make the final prediction based on several SVMs’ outputs. Tested on redundancy-reduced sets, the proposed method reached 89.6 % (plant) and 91.9% (non-plant) total accuracies, which, to the best of our knowledge, are the highest ever reported using the same data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jensen, L.J., Gupta, R., Blom, N., et al.: Prediction of human protein function from post-translational modifications and localization features. J. Mol. Biol. 319, 1257–1265 (2002)

    Article  Google Scholar 

  2. Chou, K.C., Elrod, D.: Protein subcellular location prediction. Protein Eng. 12, 107–118 (1999)

    Article  Google Scholar 

  3. Reinhardt, A., Hubbard, T.: Using neural networks for prediction of the subcellular location of proteins. Nucleic Acids Res. 26, 2230–2236 (1998)

    Article  Google Scholar 

  4. Hua, S., Sun, Z.: Support vector machine approach for protein subcellular localization prediction. Bioinformatics 17, 721–728 (2001)

    Article  Google Scholar 

  5. Nakashima, H., Nishikawa, K.: Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies. J. Mol. Biol. 238, 54–61 (1994)

    Article  Google Scholar 

  6. Nakai, K.: Protein sorting signals and prediction of subcellular localization. Adv. Protein Chem. 54, 277–344 (2000)

    Article  Google Scholar 

  7. Emanuelsson, O., Nielsen, H., Brunak, S., et al.: Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J. Mol. Biol. 300, 1005–1016 (2000)

    Article  Google Scholar 

  8. Nielsen, H., Engelbrecht, J., Brunak, S., et al.: Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng. 10, 1–6 (1997)

    Article  Google Scholar 

  9. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)

    MATH  Google Scholar 

  10. Emanuelsson, O.: Predicting protein subcellular localisation from amino acid sequence information. Briefings in Bioinformatics 3, 361–376 (2002)

    Article  Google Scholar 

  11. Westion, J., Watkins, C.: Multi-class support vector machines. In: Verleysen, M. (ed.) Proceedings of ESANN 1999, Brussels. D. Facto Press (1999)

    Google Scholar 

  12. Hsu, C.-W., Lin, C.-J.: A comparison of methods for multi-class support vector machines. IEEE Transactions on Neural Networks 13, 415–425 (2002)

    Article  Google Scholar 

  13. Matthews, B.W.: Comparison of predicted and observed secondary structure of T4 phage lysozyme. Biochim. Biophys. Acta 405, 442–451 (1975)

    Google Scholar 

  14. Rost, B., Sander, C.: Prediction of secondary structure at better than 70% accuracy. J. Mol. Biol. 232, 584–599 (1993)

    Article  Google Scholar 

  15. Bhasin, M., Raghava, G.P.S.: ESLpred: SVM Based Method for Subcellular Localization of Eukaryotic Proteins using Dipeptide Composition and PSI-BLAST. Nucleic Acids Reasearch 32, W383 - W389 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, Yf., Liu, J. (2005). Predicting Subcellular Localization of Proteins Using Support Vector Machine with N-Terminal Amino Composition. In: Li, X., Wang, S., Dong, Z.Y. (eds) Advanced Data Mining and Applications. ADMA 2005. Lecture Notes in Computer Science(), vol 3584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527503_73

Download citation

  • DOI: https://doi.org/10.1007/11527503_73

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27894-8

  • Online ISBN: 978-3-540-31877-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics