Abstract
In biological problems such as protein sequence family identification and profile building the additive hypothesis of the probability measure is not well suited for modeling HMM based profiles because of a high degree of interdependency among homologous sequences of the same family . Fuzzy measure theory which is an extension of the classical additive theory is obtained by replacing the additive requirement of classical measures with weaker properties of monotonicity, continuity and semi-continuity. The strong correlations and the sequence preference involved in the protein structures make fuzzy measure architecture based models as suitable candidates for building profiles of a given family since fuzzy measures can handle uncertainties better than classical methods . In this paper we investigate the different measures(S-decomposable, λ and belief measures) of fuzzy measure theory for building profile models of protein sequence problems. The proposed fuzzy measure models have been tested on globin and kinase families . The results obtained from the fuzzy measure models establish the superiority of fuzzy measure theory compared to classical probability measures for biological sequence problems.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bateman, A.: The pfam protein families database. Nucleic Acids Research 30, 276–280 (2002)
Baldi, P., Brunak, S.: Bioinformatics-the machine learning approach. MIT press, Cambridge (2001)
Bidargaddi, N.P., Chetty, M., Kamruzzaman, J.: Fuzzy decoding in profile hidden Markov models for protein family identification. Advances in Bioinformatics and its Applications, Series in Mathematical Biology and Medicine 8 (2004)
Bidargaddi, N.P., Chetty, M., Kamruzzaman, J.: Fuzzy Viterbi algorithm for improved sequence alignment and searching of proteins. In: Evo Workshops 2005. LNCS, vol. 3449, pp. 11–21 (2005)
Cheok, A.D.: Use of a novel generalized fuzzy hidden Markov model for speech recognition. IEEE Conf. Fuzzy System, 1207–1210 (2001)
Durbin, R., Eddy, S., Krogh, A., Mitchison, G.: Biological sequence analysis- probabilistic models of proteins and nucleic acids. Cambridge University Press, Cambridge (2003)
Eddy, S.R.: Profile hidden Markov models. Bioinformatics 14, 755–763 (1998)
Grabisch, M., Murofushi, T., Sugeno, M.: Fuzzy measures and integrals - theory and applications. Physica-Verlag, Heidelberg (2000)
Koski, T.: Hidden Markov models in bioinformatics. Kluwer academic publishers, Dordrecht (2001)
Krogh, A.: An introduction to hidden Markov models for biological sequences. Computational Methods in Molecular Biology 99, 45–63 (1998)
Magdi, M.A., Gader, P.: Generalized hiddenMarkov models-part I: theoretical frameworks. IEEE Trans. Fuzzy Systems 8, 67–80 (2000)
Shi, H., Gader, P.D.: Lexicon-driven handwritten word recognition using Choquet fuzzy integral. IEEE Conf. 99, 412–417 (1996)
Sugeno, M.: Fuzzy measures and fuzzy integrals- a survey. In: Gupta, M.M., Saridis, G.N., Gaines, B.R. (eds.) Fuzzy Automata and Decision Processes, pp. 89–102. North-Holland, New York (1977)
Tran, D., Wagner, M.: Fuzzy hidden Markov models for speech and speaker recognition. In: IEEE Conf. Speech Processing, pp. 426–430 (1999)
Valsan, Z., Gavat, I., Sabac, B.: Statistical and hybrid methods for speech recognition in Romanian. International Journal of Speech Technology 5, 259–268 (2002)
Wang, Z., Klir, G.J.: Fuzzy measures and integrals - theory and applications. Physica-Verlag, Heidelberg (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bidargaddi, N.P., Chetty, M., Kamruzzaman, J. (2005). Evaluation of Fuzzy Measures in Profile Hidden Markov Models for Protein Sequences. In: Oliveira, J.L., Maojo, V., Martín-Sánchez, F., Pereira, A.S. (eds) Biological and Medical Data Analysis. ISBMDA 2005. Lecture Notes in Computer Science(), vol 3745. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573067_36
Download citation
DOI: https://doi.org/10.1007/11573067_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29674-4
Online ISBN: 978-3-540-31658-9
eBook Packages: Computer ScienceComputer Science (R0)