skip to main content
10.1145/3412841.3442067acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
poster

Protein secondary structure prediction based on fusion of machine learning classifiers

Published: 22 April 2021 Publication History

Abstract

Protein secondary structure prediction plays an important role in protein folding and function classification. Although the works available in the literature present good results, protein secondary structure prediction is still an open problem. In this work, we present and discuss a fusion strategy using four different classifiers. The fusion is composed of bidirectional recurrent networks, random forests, Inception-v4 blocks and Inception recurrent networks. In order to evaluate our model, we used CB6133 dataset as training and testing. The fusion achieved 76.4% of Q8 accuracy using the amino acid sequence and similarity information on CB6133, surpassing state-of-the-art approaches.

References

[1]
Alessio Ceroni, Paolo Frasconi, and Gianluca Pollastri. 2005. Learning protein secondary structure from sequential and relational data. Neural Networks 18, 8 (2005), 1029--1039.
[2]
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. Computing Research Repository abs/1406.1078 (2014), 1--15.
[3]
Peter Chou and Gerald Fasman. 1974. Prediction of protein conformation. Biochemistry 13, 2 (1974), 222--245.
[4]
Iddo Drori, Isht Dwivedi, Pranav Shrestha, Jeffrey Wan, Yueqi Wang, Yunchu He, Anthony Mazza, Hugh Krogh-Freeman, Dimitri Leggas, Kendal Sandridge, Linyong Nan, Kaveri Thakoor, Chinmay Joshi, Sonam Goenka, Chen Keasar, and Itsik Peer. 2018. High quality prediction of protein Q8 secondary structure by diverse neural network architectures. Computing Research Repository abs/1811.07143 (2018), 1--10.
[5]
Yanbu Guo, Weihua Li, Bingyi Wang, Huiqing Liu, and Dongming Zhou. 2019. DeepACLSTM: deep asymmetric convolutional long short-term memory neural models for protein secondary structure prediction. BMC Bioinformatics 20, 1 (2019), 341.
[6]
Yanbu Guo, Bingyi Wang, Weihua Li, and Bei Yang. 2018. Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks. Journal of Bioinformatics and Computational Biology 16, 5 (2018), 1850021.
[7]
Leandro Takeshi Hattori, Cesar Manuel Vargas Benitez, and Heitor Silvério Lopes. 2017. A deep bidirectional long short-term memory approach applied to the protein secondary structure prediction problem. In 4th Latin American Conference on Computational Intelligence (LA-CCI). IEEE, Arequipa, Arequipa, Peru, 1--6.
[8]
John Henry Holland. 1992. Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. MIT press, London, England.
[9]
Alexander Rosenberg Johansen, Casper Kaae Sønderby, Søren Kaae Sønderby, and Ole Winther. 2017. Deep recurrent conditional random field network for protein secondary prediction. In 8th Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB). ACM, Boston, MA, US, 73--78.
[10]
David T Jones. 1999. Protein secondary structure prediction based on position-specific scoring matrices. Journal of Molecular Biology 292, 2 (1999), 195--202.
[11]
Yihui Liu, Jinyong Cheng, Yuming Ma, and Yehong Chen. 2017. Protein secondary structure prediction based on two dimensional deep convolutional neural networks. In 3rd International Conference on Computer and Communications (ICCC). IEEE, Chengdu, China, 1995--1999.
[12]
Gabriel Bianchin de Oliveira, Helio Pedrini, and Zanoni Dias. 2020. Ensemble of bidirectional recurrent networks and random forests for protein secondary structure prediction. In 27th International Conference on Systems, Signals and Image Processing (IWSSIP). IEEE, Rio de Janeiro, RJ, Brazil, 311--316.
[13]
Gabriel Bianchin de Oliveira, Helio Pedrini, and Zanoni Dias. 2020. Fusion of BLAST and Ensemble of Classifiers for Protein Secondary Structure Prediction. In 33rd Conference on Graphics, Patterns and Images (SIBGRAPI). IEEE, Porto de Galinhas, PE, Brazil, 308--315.
[14]
Aminur Rab Ratul, Marcel Turcotte, M Hamed Mozaffari, and WonSook Lee. 2019. Prediction of 8-state protein secondary structures by 1D-Inception and BD-LSTM. bioRxiv 1, 1 (2019), 1--14.
[15]
Andrew W Senior, Richard Evans, John Jumper, James Kirkpatrick, Laurent Sifre, Tim Green, Chongli Qin, Augustin Žídek, Alexander WR Nelson, Alex Bridgland, Hugo Penedones, Stig Petersen, Karen Simonyan, Steve Crossan, Pushmeet Kohli, David T. Jones, David Silver, Koray Kavukcuoglu, and Demis Hassabis. 2020. Improved protein structure prediction using potentials from deep learning. Nature 577 (2020), 1--5.
[16]
Søren Kaae Sønderby and Ole Winther. 2014. Protein secondary structure prediction with long short term memory networks. Computing Research Repository 1412.7828 (2014), 1--6.
[17]
Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alex Alemi. 2016. Inception-v4, inception-resnet and the impact of residual connections on learning. Computing Research Repository abs/1602.07261 (2016), 1--12.
[18]
Yuedong Yang, Jianzhao Gao, Jihua Wang, Rhys Heffernan, Jack Hanson, Kuldip Paliwal, and Yaoqi Zhou. 2018. Sixty-five years of the long march in protein secondary structure prediction: the final stretch? Briefings in Bioinformatics 19, 3 (2018), 482--494.
[19]
Buzhong Zhang, Jinyan Li, and Qiang Lü. 2018. Prediction of 8-state protein secondary structures by a novel deep learning architecture. BMC Bioinformatics 19, 1 (2018), 293.
[20]
Jian Zhou and Olga Troyanskaya. 2014. Deep Supervised and Convolutional Generative Stochastic Network for Protein Secondary Structure Prediction. In 31st International Conference on Machine Learning (ICML). PMLR, New York, NY, USA, 745--753.
[21]
Jiyun Zhou, Hongpeng Wang, Zhishan Zhao, Ruifeng Xu, and Qin Lu. 2018. CNNH_PSS: protein 8-class secondary structure prediction by convolutional neural network with highway. BMC Bioinformatics 19, 4 (2018), 60.
[22]
Zhun Zhou, Bingru Yang, and Wei Hou. 2010. Association classification algorithm based on structure sequence in protein secondary structure prediction. Expert Systems with Applications 37, 9 (2010), 6381--6389.

Cited By

View all
  • (2021)Ensemble of Template-Free and Template-Based Classifiers for Protein Secondary Structure PredictionInternational Journal of Molecular Sciences10.3390/ijms22211144922:21(11449)Online publication date: 23-Oct-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied Computing
March 2021
2075 pages
ISBN:9781450381048
DOI:10.1145/3412841
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 April 2021

Check for updates

Author Tags

  1. fusion classifiers
  2. machine learning
  3. neural networks
  4. protein secondary structure prediction

Qualifiers

  • Poster

Funding Sources

  • CNPq
  • FAPESP

Conference

SAC '21
Sponsor:
SAC '21: The 36th ACM/SIGAPP Symposium on Applied Computing
March 22 - 26, 2021
Virtual Event, Republic of Korea

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Upcoming Conference

SAC '25
The 40th ACM/SIGAPP Symposium on Applied Computing
March 31 - April 4, 2025
Catania , Italy

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Ensemble of Template-Free and Template-Based Classifiers for Protein Secondary Structure PredictionInternational Journal of Molecular Sciences10.3390/ijms22211144922:21(11449)Online publication date: 23-Oct-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media