skip to main content
10.1145/3018896.3025147acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccConference Proceedingsconference-collections
research-article

Hidden markov model optimized by PSO algorithm for gene sequence clustering

Published:22 March 2017Publication History

ABSTRACT

Gene sequence modeling and clustering is one of the most important problems in bioinformatics. Hidden Markov Models (HMMs) have been widely used to find similarity between sequences with large and various lengths. In this paper a novel gene sequence clustering method based on HMMs optimized by Particle Swarm Optimization (PSO) algorithm is introduced. In this approach, each gene sequence is described by a specific HMM, and then its probability to generate individual sequence is evaluated for each model. A hierarchical clustering algorithm based on a new definition of a distance measure, has been applied to find the best clusters. Experiments carried out on lung cancer related genes dataset show that the proposed approach can be successfully utilized for gene clustering.

References

  1. Al-Hajj R, Mokbel C, Likforman-Sulem L (2007) Combination of HMM-Based Classifiers for the Recognition of Arabic Handwritten Words. In 9th International Conference on Document Analysis and Recognition: 959--963. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Angeline P.J (1998) Evolutionary Optimization versus Particle Swarm Optimization: Philosophy and Performance Differences. In Evolutionary Programming VII 1447: 601--610. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Banu P.K, Andrews S (2015) Gene Clustering Using Metaheuristic Optimization Algorithms. 14 International Journal of Applied Metaheuristic Computing, 6(4): 14--38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Bicego M, Murino V, Figueiredo M.A.T (2004) Similarity-based classification of sequences using hidden Markov models. Pattern Recognition Society 37: 2281 -- 2291. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Durbin R, Eddy S.R, Krogh A, Mitchison G (1998) Biological Sequence Analysis. Cambridge University Press.Google ScholarGoogle Scholar
  6. Ferles C, Stafylopatis A (2008) Sequence Clustering with the Self-Organizing Hidden Markov Model Map. In 8th IEEE International Conference on Bioinformatics and Bioengineering: 1--7.Google ScholarGoogle Scholar
  7. Goldberg D.E and Holland J.H (1988) Genetic Algorithms and Machine Learning. Machine Learning 3: 95--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Kennedy J and Eberhart R.C (1995) Particle Swarm Optimization. In Processing of the IEEE International Conference on Neural Networks 4: 1942--1948.Google ScholarGoogle ScholarCross RefCross Ref
  9. Krogh A, Brown M, Mian I.S, Sjolander K, Haussler D (1993) Hidden Markov Models in Computational Biology: Application to Protein Modeling. UCSC-CRL-93-32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Lee K.F (1990) Context-Dependent Phonetic Hidden Markov Models for Speaker-Independent Continuous Speech Recognition. IEEE Transactions on Acoustics, Speech and Signal Processing 38: 599--609.Google ScholarGoogle ScholarCross RefCross Ref
  11. Li C, Biswas G (2000) A Bayesian Approach to Temporal Data Clustering using Hidden Markov Models. In Proceedings of the 17th International Conference on Machine Learning: 543--550. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Mesa A, Basterrech S, Guerberoff G, Alveraz-Valin F (2015) Hidden Markov Models for Gene Sequence Classification. Pattern Analysis and Application: 1--13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Panuccio A, Bicego M, Murino V (2002) A Hidden Markov Model-Based Approach to Sequential Data Clustering. Structural Syntactic and Statistical Pattern Recognition 2396: 734--743. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Rabiner L.R (1989) A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. In Proceedings of IEEE 77: 257--286.Google ScholarGoogle ScholarCross RefCross Ref
  15. Rabiner L.R, Lee C.H, Juang B.H, Wilpon J.G (1989) HMM Clustering for Connected Word Recognition. In Proceedings of IEEE ICASSP: 405--408.Google ScholarGoogle ScholarCross RefCross Ref
  16. Theodoridis S and Koutroumbas K (1999) Pattern Recognition. Academic Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Vignes M, Forbes F (2009) Gene Clustering via Integrated Markov Models Combining Individual and Pairwise Features. In IEEE/ACM Transactions on Computational Biology and Bioinformatics 6: 260--270. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Xue L, Yin J, Ji Z, Jiang L (2006) A Particle Swarm Optimization for Hidden Markov Model Training. In Proceedings of 8th International Conference on Signal Processing.Google ScholarGoogle Scholar
  19. Zhang Z.Y, Li T, Ding C, Ren X.W, Zhang X.S (2010) Binary matrix factorization for analyzing gene expression data. Data Mining And Knowledge Discovery 20: 28--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. http://healthfinder.gov/orgs/HR3150.htm, visited Nov.2011.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICC '17: Proceedings of the Second International Conference on Internet of things, Data and Cloud Computing
    March 2017
    1349 pages
    ISBN:9781450347747
    DOI:10.1145/3018896

    Copyright © 2017 ACM

    © 2017 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 22 March 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    ICC '17 Paper Acceptance Rate213of590submissions,36%Overall Acceptance Rate213of590submissions,36%
  • Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader