A new intelligent prediction system model-the compound pyramid model

Yang, BingRu; Qu, Wu; Wang, LiJun; Zhou, Ying

doi:10.1007/s11432-011-4442-1

A new intelligent prediction system model-the compound pyramid model

Research Paper
Published: 25 February 2012

Volume 55, pages 723–736, (2012)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

BingRu Yang¹,
Wu Qu¹,
LiJun Wang¹ &
…
Ying Zhou¹

93 Accesses
1 Citation
Explore all metrics

Abstract

A current development trend in research on intelligent systems is to optimize a general intelligent prediction system into an individuation intelligent prediction system that is applied in specialized fields. Protein structure prediction is a challenging international issue. In this paper, we propose a new intelligent prediction system model, designed as a multi-layer compound pyramid model, for predicting secondary protein structure. The model comprises four independent intelligent interfaces and several knowledge discovery methods. The model penetrates throughout the domain knowledge, with the effective attributes chosen by Causal Cellular Automata. Furthermore, a high pure structure database is constructed for training. On the RS126 dataset, the overall state per-residue accuracy, Q ₃, reached 83.99%, while on the CB513 dataset, Q ₃ reached 85.58%. Meanwhile, on the CASP8 sequences, the results are superior to those produced by other methods, such as Psipred, Jpred, APSSP2 and BehairPred. These results confirm that our method has a strong generalization ability, and that it provides a model for the construction of other intelligent systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Hua S J, Sun Z R. A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. J Mol Biol, 2001, 308: 397–407
Article Google Scholar
Karplus K, Karchin R, Draper J, et al. Combining local-structure, foldrecognition, and new-fold methods for protein structure prediction. ProteinsMayo, 2003, 53: 491–496
Article Google Scholar
Haoudi A, Bensmail H. Bioinformatics and data mining in proteomics. Expert Rev Proteom, 2006, 3: 333–343
Article Google Scholar
Li J Y, Wong L S, Yang Q. Data mining in Bioinformatics. IEEE Intell Syst, 2005, 20: 16–18
Google Scholar
Wu X, Jain L, Wang J, et al. Data Mining in Bioinformatics. Berlin: Springer, 2005.
Book Google Scholar
Wu K P, Lin H N, Chang J M, et al. HYPROSP: a hybrid protein secondary structure prediction algorithm-a knowledge-based approach. Nucleic Acids Res, 2004, 32: 5059–5065
Article Google Scholar
Lin H N, Chang J M, Wu K P, et al. HYPROSP II-a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence. Bioinformatics, 2005, 21: 3227–3233
Article Google Scholar
Yang B R, Sun H H, Xiong F L. Ming quantitative association rules with standard SQL queries and it’s evaluation. J Comput Res Dev, 2002, 39: 307–312
Google Scholar
Yang B R. Knowledge Discovery Theory Based on Inner Cognitive Mechanism. Beijing: Electron Industry Publishment, 2004.
Google Scholar
Yang B R, Hou W, Zhou Z. KAAPRO: an approach of protein secondary structure prediction based on KDD* in the compound pyramid prediction model. Expert Syst Appl, 2009, 36: 9000–9006
Article Google Scholar
Zhou Z, Yang B R, Hou W. Association classification algorithm based on structure sequence in protein secondary structure prediction. Expert Syst Appl, 2010, 37: 6381–6389
Article Google Scholar
Rost B, Sander C. Prediction of secondary structure at better than 70% accuracy. J Mol Biol, 1993, 232: 5840–5899
Article Google Scholar
Cuff J A, Barton G J. Evaluation and improvement of multiple sequence methods for protein secondary structure prediction. Proteins Struct Func Genet, 1999, 34: 508–519
Article Google Scholar
http://predictioncenter.org/
Yang B R, Shen J T, Song W. KDK based double-basis fusion mechanism and its process model. Int J Artif Intell Tool, 2005, 14: 399–423
Article Google Scholar
Yang B R, Xiong F. KD(D&K) and double C bases cooperating mechanism. J Syst Eng Electron, 1999, 10: 48–54
Google Scholar
Yang B R, Li X, Song W. Generalized causal inductive reasoning model based on generalized causal cellular automata. In: 2005 International Conference on Neural Networks and Brain. Beijing: IEEE CS, 2005. 375–378
Chapter Google Scholar
Park J, Karplus K, Barrett C, et al. Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. J Mol Biol, 1998, 284: 1201–1210
Article Google Scholar
Altschul S F, Gish W, Miller W, et al. Basic local alignment search tool. J Mol Biol, 1990, 215: 403–410
Google Scholar
Altschul S F, Madden T L, Schffer, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res, 1997, 25: 3389–3402
Article Google Scholar
Zhai Y, Yan B R, Qu W, et al. Study on source of classification in inbalanced dataset based on new ensemble classifier. J Syst Eng Electron, 2011, 33: 196–201
Google Scholar
Hua S J, Sun Z R. A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. J Mol Biol, 2001, 308: 397–407
Article Google Scholar
Li W M, Han J W, Pei J. CMAR: accurate and efficient classification based on multiple class-association rules. In: Proceedings the 2001 IEEE International Conference on Data Mining. San Jose, 2001. 369–376
Jones D. Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol, 1999, 292: 195–202
Article Google Scholar
Pollastri G, Przybylski D, Rost B, et al. Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins, 2002, 47: 228–235
Article Google Scholar
Rost B, Sander C, Schneider R. PHD-an automatic mail server for protein secondary structure prediction. Comput Appl Biosc, 1994, 10: 53–60
Google Scholar
Ouali M, King R. Cascaded multiple classifiers for secondary structure prediction. Protein Sci, 2000, 9: 1162–1176
Article Google Scholar
Cuff J, Clamp M, Siddiqui A, et al. JPRED: a consensus secondary structure prediction server. Bioinformatics, 1998, 14: 892–893
Article Google Scholar
Hu H J, Pan Y, Harrison R, et al. Improved protein secondary structure prediction using support vector machine with a new encoding scheme and an advanced tertiary classifier. IEEE Trans NanoBiosci, 2004, 3: 265–271
Article Google Scholar
Xie X, Yang B, Chen Y H. Prediction of secondary structure of protein using neural network. J Jinan Univ(Sci Technol), 2008, 22: 111–115
Google Scholar
Chen J M, Narendra S, Chaudhari. Cascaded bidirectional recurrent neural networks for protein secondary structure prediction. IEEE/ACM Trans Comput Biol Bioinform, 2007, 4: 572–582
Article Google Scholar
Chopra P, Bender A. Evolved cellular automata for protein secondary structure prediction imitate the determinants for folding observed in nature. In Silico Biol, 2007, 7: 87–93
Google Scholar
Liu Y, Carbonel J, Klein-Seetharaman J, et al. Context sensitive vocabulary and its application in protein secondary structure prediction. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Sheffield: ACM, 2004. 538–539
Google Scholar
Guo J, Chen H, Sun Z R, et al. A novel method for protein secondary structure prediction using dual-layer SVM and profiles. Proteins, 2004, 54: 738–743
Article Google Scholar
Wang L H, Liu J, Li Y F, et al. Predicting protein secondary structure by a support vector machine based on a new coding scheme. Genome Inform, 2004, 15: 181–190
MathSciNet Google Scholar
Cheng H T, Sen T Z, Jernigan R L, et al. Consensus data mining (CDM) protein secondary structure prediction server: combining GOR V and fragment database mining (FDM). Bioinformatics, 2007, 23: 2628–2630
Article Google Scholar
Hyunsoo K, Haesun P. Protein secondary structure prediction based on an improved support vector machines approach. Prot Eng, 2003, 16: 553–560
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Engineering, University of Science and Technology Beijing, Beijing, 100083, China
BingRu Yang, Wu Qu, LiJun Wang & Ying Zhou

Authors

BingRu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Wu Qu
View author publications
You can also search for this author in PubMed Google Scholar
LiJun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ying Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to BingRu Yang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, B., Qu, W., Wang, L. et al. A new intelligent prediction system model-the compound pyramid model. Sci. China Inf. Sci. 55, 723–736 (2012). https://doi.org/10.1007/s11432-011-4442-1

Download citation

Received: 22 July 2010
Accepted: 20 September 2010
Published: 25 February 2012
Issue Date: March 2012
DOI: https://doi.org/10.1007/s11432-011-4442-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new intelligent prediction system model-the compound pyramid model

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence to deep learning: machine intelligence approach for drug discovery

Introduction to Machine Learning

An Overview of Scoring Functions Used for Protein–Ligand Interactions in Molecular Docking

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A new intelligent prediction system model-the compound pyramid model

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence to deep learning: machine intelligence approach for drug discovery

Introduction to Machine Learning

An Overview of Scoring Functions Used for Protein–Ligand Interactions in Molecular Docking

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation