Skip to main content

Accurate Prediction of Protein Hot Spots Residues Based on Gentle AdaBoost Algorithm

  • Conference paper
  • First Online:
Intelligent Computing Theories and Application (ICIC 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9771))

Included in the following conference series:

Abstract

Hot spots are critical for protein interactions. Since the experimental method to measure protein hot spots are very cost and time-consuming, computational method is an option to predict hot spots in protein interfaces. In this work, we use Gentle AdaBoost to identify hot spot without feature selection. For all algorithm, ASEdb and BID are used as separate training and test dataset. We extract sequential and structural information of protein, which constitutes 178 dimensional feature vectors. Comparing with other algorithms, our proposed algorithm obtained satisfactory experimental results either on training dataset or test dataset, which yeilds F1 score of 0.79 and 0.69 on training dataset and test dataset, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chothia, C., Janin, J.: Principles of protein-protein recognition. Nature 256(5520), 705–708 (1975)

    Article  Google Scholar 

  2. Janin, J.: Principles of protein-protein recognition from structure to thermodynamics. Biochimie 77(7), 497–505 (1995)

    Article  Google Scholar 

  3. Kann, M.G.: Protein interactions and disease: computational approaches to uncover the etiology of diseases. Briefings Bioinform. 8(5), 333–346 (2007)

    Article  Google Scholar 

  4. Moreira, I.S., Fernandes, P.A., Ramos, M.J.: Hot spots—a review of the protein–protein interface determinant amino-acid residues. Proteins: Struct., Funct., Bioinf. 68(4), 803–812 (2007)

    Article  Google Scholar 

  5. Bogan, A.A., Thorn, K.S.: Anatomy of hot spots in protein interfaces. J. Mol. Biol. 280(1), 1–9 (1998)

    Article  Google Scholar 

  6. Wells, J.A.: Systematic mutational analyses of protein-protein interfaces. Methods Enzymol. 202, 390–411 (1991)

    Article  Google Scholar 

  7. Thorn, K.S., Bogan, A.A.: ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions. Bioinformatics 17(3), 284–285 (2001)

    Article  Google Scholar 

  8. Fischer, T., et al.: The binding interface database (BID): a compilation of amino acid hot spots in protein interfaces. Bioinformatics 19(11), 1453–1454 (2003)

    Article  Google Scholar 

  9. Cho, K-i, Kim, D., Lee, D.: A feature-based approach to modeling protein–protein interaction hot spots. Nucleic Acids Res. 37(8), 2672–2687 (2009)

    Article  Google Scholar 

  10. Xia, J.F., Zhao, X.M., Song, J., Huang, D.S.: APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility. BMC Bioinformatics 11, 174 (2010)

    Article  Google Scholar 

  11. Zhu, X., Mitchell, J.C.: KFC2: a knowledge-based hot spot prediction method based on interface solvation, atomic density, and plasticity features. Proteins: Struct. Funct. Bioinf. 79(9), 2671–2683 (2011)

    Article  Google Scholar 

  12. Demirkır, C., Sankur, B.: Face detection using look-up table based gentle AdaBoost. In: Kanade, T., Jain, A., Ratha, N.K. (eds.) AVBPA 2005. LNCS, vol. 3546, pp. 339–345. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  13. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting (with discussion and a rejoinder by the authors). Ann. Stat. 28(2), 337–407 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  14. Tuncbag, N., Gursoy, A., Keskin, O.: Identification of computational hot spots in protein interfaces: combining solvent accessibility and inter-residue potentials improves the accuracy. Bioinformatics 25(12), 1513–1520 (2009)

    Article  Google Scholar 

  15. Breiman, L., et al.: Classification and Regression Trees. CRC Press, Boca Raton (1984)

    MATH  Google Scholar 

  16. Darnell, S.J., Page, D., Mitchell, J.C.: An automated decision-tree approach to predicting protein interaction hot spots. Proteins-Struct. Funct. Bioinf. 68(4), 813–823 (2007)

    Article  Google Scholar 

  17. Keskin, O., et al.: Protein-protein interactions: structurally conserved residues at protein-protein interfaces. Biophys. J. 86(1), 267a (2004)

    Google Scholar 

Download references

Acknowledgments

This work was supported by National Natural Science Foundation of China under grant nos. 61271098 and 61032007, and Provincial Natural Science Research Program of Higher Education Institutions of Anhui Province under grant no. KJ2012A005.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jun Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Sun, Z., Zhang, J., Zheng, CH., Wang, B., Chen, P. (2016). Accurate Prediction of Protein Hot Spots Residues Based on Gentle AdaBoost Algorithm. In: Huang, DS., Bevilacqua, V., Premaratne, P. (eds) Intelligent Computing Theories and Application. ICIC 2016. Lecture Notes in Computer Science(), vol 9771. Springer, Cham. https://doi.org/10.1007/978-3-319-42291-6_74

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42291-6_74

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42290-9

  • Online ISBN: 978-3-319-42291-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics