Abstract
Artificial neural networks have been applied, as solutions, in several different problems within the field of bioinformatics. Similarly, pattern discovery algorithms have also been used to uncover hidden motifs in protein sequences, which have further contributed to understanding the problem of classification of different protein sequences. G-protein coupled receptors (GPCRs) represent one of the largest protein families in the Human Genome. Most of these receptors are major targets for drug discovery and development; therefore, they are of interest to the pharmaceutical industry. The technique used in this article combines both: neural network and pattern discovery methods to develop a protein prediction procedure in relation to its functional class, more specifically, to predict the GPCR protein. Vilo [2] proposed an algorithm to extract patterns of regular expressions from known GPCR protein sequences. Our contribution in this article is to combine these patterns as features for a neural network. We select patterns through the PCA (Principal Component Analysis) procedure and produce a learning machine for the prediction of the GPCR super class.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Moller, S., Vilo, J., Croning, D.R.: Prediction of coupling specificity of G protein coupled receptors to their G proteins. Bioinformatics 17, S174–S181 (2001)
Vilo, J.: Discovering Frequent Patterns from Strings. Department of Computer Science, University of Helsinki (1998)
Wang, J.T.L., Shapiro, B.A., Shasha, D.: Pattern Discovery in Biomolecular Data: Tools, Techniques, and Applications. Oxford University Press, USA (1999)
Cao, J., Panetta, R., Yue, S., Steyaert, A., Young-Bellido, M., Ahmad, S.: A naive Bayes model to predict coupling between seven transmembrane domain receptors and G-proteins. Bioinformatics 19, 234–240 (2003)
Sgourakis, N.G., Bagos, P.G., Hamodrakas, S.J.: Prediction of the coupling specificity of GPCRs to four families of G-proteins using hidden Markov models and artificial neural networks. Bioinformatics 21, 4101–4106 (2005)
Sreekumar, K.R., Huang, Y., Pausch, M.H., Gulukota, K.: Predicting GPCR–G-protein coupling using hidden Markov models. Bioinformatics 20, 3490–3499 (2004)
Baldi, P., Brunak, S.: Bioinformatics: The Machine Learning Approach. The MIT Press, Cambrigde (1998)
Press, W.H., Teukolsky, S.A., Vetterling, W.T., Flannery, B.P.: Numerical Recipies in C++: The Art of Scientific Computing. Cambridge University Press (2002)
Haykin, S.: Neural Networks: A Comprehensive Foundation. Prentice Hall (1998)
Riedmiller, M., Braun, H.: A direct adaptive method for faster backpropagation learning: the Rprop algorithm. In: Proceedings of ICNN, San Francisco (1993)
Fawcett, T.: ROC Graphs: Notes and Practical Considerations for Researchers. HP Laboratories (March 16, 2004)
Karchin, R., Karplus, K., Haussler, D.: Classifying G-protein coupled receptors with support vector machines. Bioinformatics 18, 147–159 (2002)
Nascimento, F., Ren, T.I., Cavalcanti, G.D.C.: A SVM for GPCR Protein Prediction Using Pattern Discovery. In: Proceedings of HIS (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ren, T.I., Calvalcanti, G.D.C., Nascimento Junior, F., Espadas, G. (2012). A Neural Network Based Approach for GPCR Protein Prediction Using Pattern Discovery. In: Yin, H., Costa, J.A.F., Barreto, G. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2012. IDEAL 2012. Lecture Notes in Computer Science, vol 7435. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32639-4_56
Download citation
DOI: https://doi.org/10.1007/978-3-642-32639-4_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32638-7
Online ISBN: 978-3-642-32639-4
eBook Packages: Computer ScienceComputer Science (R0)