Abstract
Classification of protein sequences is an important method to predict the structure and function of novel protein sequences. The determination of protein function has a very important role in promoting both disease prevention and drug development. With the continuous development of bioinformatics and the large accumulation of related data, the functional prediction of unknown proteins using scientific computational methods has become an important research topic in bioinformatics in the post-genomic era, so the classification algorithm of protein sequences has also become one of the primary tasks of the current life science research. In this paper, we try to use two classical classification algorithms, LetNet-5 and VGG16, to study the classification problem of protein sequences.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zheng, M., Kahrizi, S.: Protein molecular defect detection method based on a neural network algorithm. Cell Mol. Biol. (Noisy-le-Grand, France) 66(7), 76 (2020). https://doi.org/10.14715/cmb/2020.66.7.13
Cheng, J., Tegge, A.N., Baldi, P.: Machine learning methods for protein structure prediction. IEEE Rev. Biomed. Eng. 1(2008), 41–49 (2008). https://doi.org/10.1109/RBME.2008.2008239
Gharib, T.F., Salah, A., Salem, M.: PSISA: an algorithm for indexing and searching protein structure using suffix arrays. In: Annual Conference on Computers World Scientific and Engineering Academy and Society (WSEAS) (2008)
Wei, Z.: A summary of research and application of deep learning. Int. Core J. Eng. 5(9), 167–169 (2019)
Xiao, Y., et al.: Assessment of differential gene expression in human peripheral nerve injury. BMC Genomics 3(1), 28 (2002)
Gupta, R., et al.: Time-series approach to protein classification problem: WaVe-GPCR: wavelet variant feature for identification and classification of GPCR. Eng. Med. Biol. Mag. IEEE 28(4), 32–37 (2009)
Hasan, M.M., Manavalan, B., Shoombuatong, W., et al.: i6mA-Fuse: improved and robust prediction of DNA 6mA sites in the Rosaceae genome by fusing multiple feature representation. Plant Mol. Biol. 103(1), 225–234 (2020)
Nazari, I., Tahir, M., Tayara, H., et al.: iN6-Methyl (5-step): identifying RNA N6-methyladenosine sites using deep learning mode via Chou’s 5-step rules and Chou’s general PseKNC. Chemomet. Intell. Lab. Syst. 193, 103811 (2019)
Oubounyt, M., Louadi, Z., Tayara, H., et al.: Deep learning models based on distributed feature representations for alternative splicing prediction. IEEE Access PP(99), 1 (2018)
Tahir, M., Hayat, M., Chong, K.T.: A convolution neural network-based computational model to identify the occurrence sites of various RNA modifications by fusing varied features. Chemom. Intell. Lab. Syst. (2021). https://doi.org/10.1016/j.chemolab.2021.104233
Tahir, M., Tayara, H., Hayat, M., et al.: kDeepBind: prediction of RNA-Proteins binding sites using convolution neural network and k-gram features. Chemom. Intell. Lab. Syst. 208(7457) (2021)
Zhang, Y., Hamada, M.: DeepM6ASeq: prediction and characterization of m6A-containing sequences using deep learning. BMC Bioinform. 19, S19 (2018). https://doi.org/10.1186/s12859-018-2516-4
Tahir, M., Hayat, M.: iNuc-STNC: a sequence-based predictor for identification of nucleosome positioning in genomes by extending the concept of SAAC and Chou’s PseAAC. Mol. BioSyst. 12(8), 2587 (2016)
Acknowledgement
This work was supported by the Natural Science Foundation of China (No. 61902337), the fundamental Research Funds for the Central Universities, 2020QN89, Xuzhou science and technology plan project, KC19142, KC21047, Jiangsu Provincial Natural Science Foundation (No. SBK2019040953), Natural Science Fund for Colleges and Universities in Jiangsu Province (No. 19KJB520016) and Young talents of science and technology in Jiangsu. Zheng Tao, Zhen Yang, and Baitong Chen can be treated as the co-first authors.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Tao, Z., Yang, Z., Chen, B., Bao, W., Cheng, H. (2022). Protein Sequence Classification with LetNet-5 and VGG16. In: Huang, DS., Jo, KH., Jing, J., Premaratne, P., Bevilacqua, V., Hussain, A. (eds) Intelligent Computing Theories and Application. ICIC 2022. Lecture Notes in Computer Science, vol 13394. Springer, Cham. https://doi.org/10.1007/978-3-031-13829-4_60
Download citation
DOI: https://doi.org/10.1007/978-3-031-13829-4_60
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-13828-7
Online ISBN: 978-3-031-13829-4
eBook Packages: Computer ScienceComputer Science (R0)