Abstract
It is estimated that 20% of genes in the human genome encode for integral membrane proteins (IMPs) and some estimates are much higher. IMPs control a broad range of events essential to the proper functioning of cells, tissues and organisms and are the most common target of clinically useful drugs [1]. However there is a dearth of high-resolution 3D structural information on the IMPs. Therefore good prediction methods of IMPs structures are to be highly valued. In this paper we apply Conditional Random Fields (CRFs) to build a probabilistic model to solve the membrane protein helix prediction problem. The advantage of CRFs is that it allows seamless and principled integration of biological domain knowledge into the model. Our results show that the CRF model outperforms other well known helix prediction approaches on several important measures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, C.P., Rost, B.: State-of-the-art in membrane protein prediction. Applied Bioinformatics 1, 21–35 (2002)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. 18th International Conf. on Machine Learning, pp. 282–289. Morgan Kaufmann, San Francisco (2001)
Wallach, H.M.: Conditional random fields: An introduction. Technical Report MS-CIS-04-21, University of Pennsylvania (2004)
Li, S.: Markov random field modeling in computer vision. Springer, New York (1995)
Berger, A.: The improved iterative scaling algorithm: A gentle introduction. Technical report, Carnegie Mellon University (1997)
Buehler, E.C., Ungar, L.H.: Maximum entropy methods for biological sequence modeling. In: BIOKDD, pp. 60–64 (2001)
Sternberg, M.J.: Protein Structure Prediction: A Practical Approach. Oxford University Press, Oxford (1996)
Moller, S., Kriventseva, E.V., Apweiler, R.: A collection of well characterized integral membrane proteins. Bioinformatics 16, 1159–1160 (2000)
Chen, C.P., Kernytsky, A., Rost, B.: Transmembrane helixpredictions revisited. Protein Science 11, 2774–2791 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lukov, L., Chawla, S., Church, W.B. (2005). Conditional Random Fields for Transmembrane Helix Prediction. In: Ho, T.B., Cheung, D., Liu, H. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2005. Lecture Notes in Computer Science(), vol 3518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11430919_20
Download citation
DOI: https://doi.org/10.1007/11430919_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26076-9
Online ISBN: 978-3-540-31935-1
eBook Packages: Computer ScienceComputer Science (R0)