Skip to main content

BDSCyto: An Automated Approach for Identifying Cytokines Based on Best Dimension Searching

  • Conference paper
  • First Online:
PRICAI 2016: Trends in Artificial Intelligence (PRICAI 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9810))

Included in the following conference series:

  • 2563 Accesses

Abstract

We proposed an automated method for distinguishing cytokines from other proteins according to their primary sequences. Two strategies were employed to extract features from protein sequences. The first one is a single method, which includes autocorrelation and pseudo amino acid composition extracted feature methods based on composition and physical–chemical properties of proteins; while the second one is an optimal dimension searching method. Moreover, we developed BDSCyto as a web server to help researchers in classifying protein sequences efficiently and accurately. BDSCyto reduces the processing time and offers high accuracy by a series of efficient methods and multithreading technology based on Spark for large-scale data. Currently, numerous methods exceed 90 % accuracy in cytokine protein prediction, which is better than the existing single methods. BDSCyto is an open-source project and can be freely accessed by the public at http://bdscyto.sinaapp.com/.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Huang, N., Chen, H., Sun, Z.: CTKPred: an SVM-based method for the prediction and classification of the cytokine superfamily. Protein Eng. Des. Selection 18(8), 365–368 (2005)

    Article  Google Scholar 

  2. Zou, Q., et al.: BinMemPredict: a web server and software for predicting membrane protein types. Curr. Proteomics 888(1), 2–9 (2013)

    Article  Google Scholar 

  3. Altschul, S.F., et al.: Basic local alignment search tool. J. Mol. Biol. 215(3), 403–410 (1990)

    Article  Google Scholar 

  4. Pearson, W.R.: Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms. Genomics 11(3), 635–650 (1991)

    Article  Google Scholar 

  5. Liu, B., et al.: Prediction of protein binding sites in protein structures using hidden Markov support vector machine. BMC Bioinformatics 10(2), 1–14 (2009)

    Google Scholar 

  6. Chen, L., et al.: Hierarchical classification of protein folds using a novel ensemble classifier. PLoS ONE 8(2), e56499 (2013)

    Article  Google Scholar 

  7. Zou, Q., et al.: Identifying multi-functional enzyme with hierarchical multi-label classifier. J. Comput. Theor. Nanosci. 10(4), 1038–1043 (2013)

    Article  Google Scholar 

  8. Zeng, X., et al.: Identification of cytokine via an improved genetic algorithm. Front. Comput. Sci. 9(4), 643–651 (2015)

    Article  Google Scholar 

  9. Dong, Q., Zhou, S., Guan, J.: A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation. Bioinformatics 25(20), 2655–2662 (2009)

    Article  Google Scholar 

  10. Kawashima, S., Kanehisa, M.: AAindex: amino acid index database. Nucleic Acids Res. 28(1), 374 (2000)

    Article  Google Scholar 

  11. Cao, D.-S., Xu, Q.-S., Liang, Y.-Z.: propy: a tool to generate various modes of Chou’s PseAAC. Bioinformatics 29(7), 960–962 (2013)

    Article  Google Scholar 

  12. Lin, H., et al.: iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition. Nucleic Acids Res. 42(21), 12961–12972 (2014)

    Article  Google Scholar 

  13. Liu, B., et al.: Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences. Nucleic Acids Res. W1, W65–W71 (2015)

    Article  Google Scholar 

  14. Cai, R.C., Zhang, Z.J., Hao, Z.F.: Causal gene identification using combinatorial V-structure search. Neural Networks 43, 63–71 (2013)

    Article  MATH  Google Scholar 

  15. Zou, Q., et al.: A novel features ranking metric with application to scalable visual and bioinformatics data classification. Neurocomputing 173, 346–354 (2016)

    Article  Google Scholar 

  16. Lin, C., et al.: LibD3C: ensemble classifiers with a clustering and dynamic selection strategy. Neurocomputing 123, 424–435 (2014)

    Article  Google Scholar 

  17. Huang, Y., et al.: CD-HIT suite: a web server for clustering and comparing biological sequences. Bioinformatics 26(5), 680–682 (2010)

    Article  Google Scholar 

  18. Zou, Q., et al.: An approach for identifying cytokines based on a novel ensemble classifier. Biomed. Res. Int. 2013(8), 616–617 (2013)

    Google Scholar 

  19. Cai, C.Z., et al.: SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence. Nucleic Acids Res. 31(13), 3692–3697 (2003)

    Article  Google Scholar 

Download references

Acknowledgments

The work was supported by the Natural Science Foundation of China (No. 61370010, 61572384, 61402545).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Quan Zou .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Zou, Q., Wan, S., Han, B., Zhan, Z. (2016). BDSCyto: An Automated Approach for Identifying Cytokines Based on Best Dimension Searching. In: Booth, R., Zhang, ML. (eds) PRICAI 2016: Trends in Artificial Intelligence. PRICAI 2016. Lecture Notes in Computer Science(), vol 9810. Springer, Cham. https://doi.org/10.1007/978-3-319-42911-3_60

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42911-3_60

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42910-6

  • Online ISBN: 978-3-319-42911-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics