Introduction
Proteins are complex structures synthesised by living organisms. They are actually a fundamental type of molecules and can perform a large number of functions in cell biology. Proteins can assume catalytic roles and accelerate or inhibit chemical reactions in our body. They can assume roles of transportation of smaller molecules, storage, movement, mechanical support, immunity and control of cell growth and differentiation [25]. All of these functions rely on the 3D-structure of the protein. The process of going from a linear sequence of amino acids, that together compose a protein, to the protein’s 3D shape is named protein folding. Anfinsen’s work [29] has proven that primary structure determines the way protein folds. Protein folding is so important that whenever it does not occur correctly it may produce diseases such as Alzheimer’s, Bovine Spongiform Encephalopathy (BSE), usually known as mad cows disease, Creutzfeldt-Jakob (CJD) disease, a Amyotrophic Lateral Sclerosis (ALS), Huntingtons syndrome, Parkinson disease, and other diseases related to cancer.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Salamov, A.A., Solovyev, V.V.: Prediction of protein structure by combining nearest-neighbor algorithms and multiple sequence alignments. J.Mol Biol 247, 11–15 (1995)
Aha, D., Kibler, D.: Instance-based learning algorithms. Machine Learning 6, 37–66 (1991)
Rost, B.: Phd: predicting 1d protein structure by profile based neural networks. Meth.in Enzym. 266, 525–539 (1996)
Blader, M., Zhang, X., Matthews, B.: Structural basis of aminoacid alpha helix propensity. Science 11, 1637–1640 (1993), http://www.ncbi.nlm.nih.gov/pubmed/850300
Breiman, L.: Random forests. Machine Learning 45(2), 5–32 (2001)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group, Belmont (1984)
Frishman., D., Argos, P.: Seventy-five percent accuracy in protein secondary structure prediction. Proteins 27, 329–335 (1997)
Jones, T.D.: Protein secondary structure prediction based on position-specific scoring matrices. Journal of Molecular Biology 292, 195–202 (1999)
Kneller, D., Cohen, F.E., Langridge, R.: Improvements in protein secondary structure prediction by an enhanced neural network. Journal of Molecular Biology 216, 441–457 (1990)
Fonseca, N., Camacho, R., aes, A.M.: A study on amino acid pairing at the n- and c-termini of helical segments in proteins. PROTEINS: Structure, Function, and Bioinformatics 70(1), 188–196 (2008)
Freund, Y., Mason, L.: The alternating decision tree learning algorithm. In: Proceeding of the Sixteenth International Conference on Machine Learning, Bled, Slovenia, pp. 124–133 (1999)
Wang, G., Dunbrack Jr., R.L.: Pisces: a protein sequence culling server. Bioinformatics 19, 1589–1591 (2003)
Gama, J.: Functional trees. Machine Learning 55(3), 219–250 (2004)
Cuff, J.A., Clamp, M.E., Siddiqui, A.S., Finlay, M., Barton, J.G., Sternberg, M.J.E.: Jpred: a consensus secondary structure prediction server. J. Bioinformatics 14(10), 892–893 (1998)
John, G.H., Langley, P.: Estimating continuous distributions in bayesian classifiers. In: Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 338–345. Morgan Kaufmann, San Mateo (1995)
Richardson, J., Richardson, D.C.: Amino acid preferences for specific locations at the ends of α-helices. Science 240, 1648–1652 (1988)
King, R., Sternberg, M.: A machine learning approach for the protein secondary structure. Journal of Molecular Biology 214, 171–182 (1990)
King, R., Sternberg, M.: Identification and application of the concepts important for accurate and reliable protein secondary structure prediction. Protein Sci. 5, 2298–2310 (1996)
Krittanai, C., Johnson, W.C.: The relative order of helical propensity of amino acids changes with solvent environment. Proteins: Structure, Function, and Genetics 39(2), 132–141 (2000)
Landwehr, N., Hall, M., Frank, E.: Logistic model trees. Machine Learning 95(1-2), 161–205 (2005)
Muggleton, S. (ed.): Inductive Logic Programming. Academic Press, London (1992)
Muggleton, S., Feng, C.: Efficient induction of logic programs. In: Proceedings of the First Conference on Algorithmic Learning Theory, Ohmsha, Tokyo (1990)
Qian, N., Sejnowski, T.J.: Predicting the secondary structure of globular proteins using neural network models. Journal of Molecular Biology 202, 865–884 (1988)
Petsko, G.A., Petsko, G.A.: Protein Stucture and Function (Primers in Biology). New Science Press Ltd. (2007)
Pietzsch, J.: The importance of protein folding. Horizon Symposia (2009), http://www.nature.com/horizon/proteinfolding/background/importance.html
Chou, P.Y., Fasman, G.D.: Prediction of secondary structure of proteins from their amino acid sequence. Advances in Enzymology and Related Areas of Molecular Biology 47, 45–148 (1978)
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)
Saraiva, L., Lopes, L.: Universidade Nova de Lisboa, Instituto de Tecnologia Química e Biológica (2007), http://www.cienciaviva.pt/docs/cozinha12.pdf
Sela, M., White, F.H., Anfinsen, C.B.: Reductive cleavage of disulfide bridges in ribonuclease. Science 125, 691–692 (1957)
Sternberg, M., Lewis, R., King, R., Muggleton, S.: Modelling the structure and function of enzymes by machine learning. Proceedings of the Royal Society of Chemistry: Faraday Discussions 93, 269–280 (1992)
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Camacho, R. et al. (2010). Predicting the Start of Protein α-Helices Using Machine Learning Algorithms. In: Rocha, M.P., Riverola, F.F., Shatkay, H., Corchado, J.M. (eds) Advances in Bioinformatics. Advances in Intelligent and Soft Computing, vol 74. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13214-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-13214-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13213-1
Online ISBN: 978-3-642-13214-8
eBook Packages: EngineeringEngineering (R0)