Abstract
We proposed an automated method for distinguishing cytokines from other proteins according to their primary sequences. Two strategies were employed to extract features from protein sequences. The first one is a single method, which includes autocorrelation and pseudo amino acid composition extracted feature methods based on composition and physical–chemical properties of proteins; while the second one is an optimal dimension searching method. Moreover, we developed BDSCyto as a web server to help researchers in classifying protein sequences efficiently and accurately. BDSCyto reduces the processing time and offers high accuracy by a series of efficient methods and multithreading technology based on Spark for large-scale data. Currently, numerous methods exceed 90 % accuracy in cytokine protein prediction, which is better than the existing single methods. BDSCyto is an open-source project and can be freely accessed by the public at http://bdscyto.sinaapp.com/.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Huang, N., Chen, H., Sun, Z.: CTKPred: an SVM-based method for the prediction and classification of the cytokine superfamily. Protein Eng. Des. Selection 18(8), 365–368 (2005)
Zou, Q., et al.: BinMemPredict: a web server and software for predicting membrane protein types. Curr. Proteomics 888(1), 2–9 (2013)
Altschul, S.F., et al.: Basic local alignment search tool. J. Mol. Biol. 215(3), 403–410 (1990)
Pearson, W.R.: Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms. Genomics 11(3), 635–650 (1991)
Liu, B., et al.: Prediction of protein binding sites in protein structures using hidden Markov support vector machine. BMC Bioinformatics 10(2), 1–14 (2009)
Chen, L., et al.: Hierarchical classification of protein folds using a novel ensemble classifier. PLoS ONE 8(2), e56499 (2013)
Zou, Q., et al.: Identifying multi-functional enzyme with hierarchical multi-label classifier. J. Comput. Theor. Nanosci. 10(4), 1038–1043 (2013)
Zeng, X., et al.: Identification of cytokine via an improved genetic algorithm. Front. Comput. Sci. 9(4), 643–651 (2015)
Dong, Q., Zhou, S., Guan, J.: A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation. Bioinformatics 25(20), 2655–2662 (2009)
Kawashima, S., Kanehisa, M.: AAindex: amino acid index database. Nucleic Acids Res. 28(1), 374 (2000)
Cao, D.-S., Xu, Q.-S., Liang, Y.-Z.: propy: a tool to generate various modes of Chou’s PseAAC. Bioinformatics 29(7), 960–962 (2013)
Lin, H., et al.: iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition. Nucleic Acids Res. 42(21), 12961–12972 (2014)
Liu, B., et al.: Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences. Nucleic Acids Res. W1, W65–W71 (2015)
Cai, R.C., Zhang, Z.J., Hao, Z.F.: Causal gene identification using combinatorial V-structure search. Neural Networks 43, 63–71 (2013)
Zou, Q., et al.: A novel features ranking metric with application to scalable visual and bioinformatics data classification. Neurocomputing 173, 346–354 (2016)
Lin, C., et al.: LibD3C: ensemble classifiers with a clustering and dynamic selection strategy. Neurocomputing 123, 424–435 (2014)
Huang, Y., et al.: CD-HIT suite: a web server for clustering and comparing biological sequences. Bioinformatics 26(5), 680–682 (2010)
Zou, Q., et al.: An approach for identifying cytokines based on a novel ensemble classifier. Biomed. Res. Int. 2013(8), 616–617 (2013)
Cai, C.Z., et al.: SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence. Nucleic Acids Res. 31(13), 3692–3697 (2003)
Acknowledgments
The work was supported by the Natural Science Foundation of China (No. 61370010, 61572384, 61402545).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Zou, Q., Wan, S., Han, B., Zhan, Z. (2016). BDSCyto: An Automated Approach for Identifying Cytokines Based on Best Dimension Searching. In: Booth, R., Zhang, ML. (eds) PRICAI 2016: Trends in Artificial Intelligence. PRICAI 2016. Lecture Notes in Computer Science(), vol 9810. Springer, Cham. https://doi.org/10.1007/978-3-319-42911-3_60
Download citation
DOI: https://doi.org/10.1007/978-3-319-42911-3_60
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42910-6
Online ISBN: 978-3-319-42911-3
eBook Packages: Computer ScienceComputer Science (R0)