Abstract
This paper addresses the classification task of data mining (a form of supervised learning) in the context of an important bioinformatics problem, namely the prediction of protein functions. This problem is cast as a hierarchical classification problem, where the protein functions to be predicted correspond to classes that are arranged in a hierarchical structure, in the form of a class tree. The main contribution of this paper is to propose a new Artificial Immune System that creates a new representation for proteins, in order to maximize the predictive accuracy of a hierarchical classification algorithm applied to the corresponding protein function prediction problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Andrews, P.: opt-aiNet source code in Java, last modified October 2005 (Personal communication, 10 July 2007)
Andrews, P.S., Timmis, J.: On Diversity and Artificial Immune Systems: Incorporating a Diversity Operator into aiNet. In: Apolloni, B., Marinaro, M., Nicosia, G., Tagliaferri, R. (eds.) WIRN 2005 and NAIS 2005. LNCS, vol. 3931, pp. 293–306. Springer, Heidelberg (2006)
Bissantz, C.: Conformational changes of G protein-coupled receptors during their activation by agonist binding. J. Recept. Signal. Transduct. Res. 23, 123–153 (2003)
Brownlee, J. WEKA Classification Algorithms. Version 1.6. (retrieved December 2006), http://sourceforge.net/projects/wekaclassalgos (2006)
Chothia, C., Finkelstein, A.V.: The Classification and Origins of Protein Folding Patterns. Annual Review of Biochemistry 59, 1007–1035 (1990)
Christopoulos, A., Kenakin, T.G.: Protein-coupled receptor allosterism and complexing. Pharmacology Review 54, 323–374 (2002)
Cui, J., Han, L.Y., Li, H., Ung, C.Y., Tang, Z.Q., Zheng, C.J., Cao, Z.W., Chen, Y.Z.: Computer prediction of allergen proteins from sequence-derived protein structural and physicochemical properties. Mollecular Immunology 44, 514–520 (2007)
Davies, M.N., Secker, A., Freitas, A.A., Mendao, M., Timmis, J., Flower, D.R.: On the hierarchical classification of G Protein-Coupled Receptors. Bioinformatics 23(23), 3113–3118 (2007)
de Castro, L., Von Zuben, F.: Learning and Optimization Using the Clonal Selection Principle. IEEE Transactions on Evolutionary Computation, Special Issue on Artificial Immune Systems 6(3), 239–251 (2001)
de Castro, L.N., Timmis, J.: An artificial immune network for multimodal optimisation. In: Congress on Evolutionary Computation (CEC 2002). Part of the 2002 IEEE World Congress on Computational Intelligence, Honolulu, Hawaii, USA, pp. 699–704. IEEE, Los Alamitos (2002)
de Castro, L.N., Timmis, J.: Artificial Immune Systems: A New Computational Intelligence Approach. Springer, Heidelberg (2002)
Gether, U., Asmar, F., Meinild, A.K., Rasmussen, S.G.: Structural basis for activation of G-protein-coupled receptors. Pharmacological Toxicology 91, 304–312 (2002)
Klabunde, T., Hessler, G.: Drug Design Strategies for Targeting G-Protein Coupled Receptors. Chem. Bio. Chem. 3, 928–944 (2002)
Secker, A., Davies, M.N., Freitas, A.A., Timmis, J., Mendao, M., Flower, D.R.: An Experimental Comparison of Classification Algorithms for the Hierarchical Prediction of Protein Function. Expert Update (Magazine of the British Computer Society’s Specialist Group on AI), Special Issue on the 3rd UK KDD (Knowledge Discovery and Data Mining) Symposium 9(3), 17–22 (2007)
Secker, A., Davies, M.N., Freitas, A.A., Timmis, J., Mendao, M., Flower, D.R.: An Experimental Comparison of Classification Algorithms for the Hierarchical Prediction of Protein Function. In: 3rd UK Data mining and Knowledge Discovery Symposium (UKKDD 2007), Canterbury, pp. 13–18 (2007)
Timmis, J., Edmonds, C.: A Comment on opt-AINet: An Immune Network Algorithm for Optimisation. In: Deb, K., et al. (eds.) GECCO 2004. LNCS, vol. 3102, pp. 308–317. Springer, Heidelberg (2004)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Zhang, Z.H., Tammi, M.T., Zhang, G.L., Tong, J.C.: Prediction of protein allergenicity using local description of amino acid sequence (unpublished) (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Secker, A., Davies, M.N., Freitas, A.A., Timmis, J., Clark, E., Flower, D.R. (2008). An Artificial Immune System for Evolving Amino Acid Clusters Tailored to Protein Function Prediction. In: Bentley, P.J., Lee, D., Jung, S. (eds) Artificial Immune Systems. ICARIS 2008. Lecture Notes in Computer Science, vol 5132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85072-4_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-85072-4_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85071-7
Online ISBN: 978-3-540-85072-4
eBook Packages: Computer ScienceComputer Science (R0)