skip to main content
10.1145/2557977.2558005acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

Identification of bacillus species using support vector machine and codon pair relative frequency

Published: 09 January 2014 Publication History

Abstract

In this paper, we proposed new approach to identify Bacillus species by using a new feature -- codon pair relative frequency -- and support vector machine (SVM). Our problem is how to use the information from some genes of specie to identify what kind of the specie is. This problem can be applied to not only research the evolutionary process but also predict the kind of specie for damaged samples. First gene database of sixteen Bacillus species is collected from National Center for Biotechnology Information (NCBI) website. Then, we extract codon pair relative frequency feature of each gene for each species. Finally, SVM "one-against-rest" method is applied to train these feature vectors. By using the proposed method we gained good results in identification for our database.

References

[1]
Smith, T. F., Waterman, M. S., 1981. Identification of common molecular subsequences. J. Mol. Biol. 147, 195--197.
[2]
Altschul, S. F., Gish, W., Miller, W., Myers, E. W., Lipman, D. J., 1990. Basic local alignment search tool. J. Mol. Biol. 215, 403--410.
[3]
Lipman, D. J., Pearson, W. R., 1985. Rapid and sensitive protein similarity searches. Science 227, 1435--1441.
[4]
Pearson, W. R., Lipman, D. J., 1988. Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. U. S. A. 85, 2444--2448.
[5]
Gina, M. C., Adrian, S., 2012. Codon Evolution Mechanisms and Models. Chapter 13, Oxford University Press, New York, ISBN 978-0-19-960116-5.
[6]
Raab, D., Graf, M., Notka, F., Schodl, T., Wagner, R., 2010. The GeneOptimizer Algorithm: using a sliding window approach to cope with the vast sequence space in multiparameter DNA sequence optimization. Syst. Synth. Biol. 4, 215--225.
[7]
Sharp, P. M., Li, W. H., 1987. The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 15, 1281--1295.
[8]
Moura, G., Pinheiro, M., Arrais, J., Gomes, A. C., Carreto, L., Freitas, A., Oliveira, J. L., Santos, M. A. S., 2007. Large Scale Comparative Codon-Pair Context Analysis Unveils General Rules that Fine-Tune Evolution of mRNA Primary Structure. PLoS ONE 2(9), e847.
[9]
Ma, J., Nguyen, M. N., Pang, G. W. L., Rajapakse, J. C. Gene classification using codon usage patterns and SVMs. Proceedings of 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2005), La Jolla, USA, pp. 435--442, Nov 14 -- 15, 2005.
[10]
Ma, J., Minh N. Nguyen, and Jagath C. Rajapakse. 2009. Gene Classification Using Codon Usage and Support Vector Machines. IEEE/ACM Trans. Comput. Biol. Bioinformatics 6, 1 (January 2009), 134--143. DOI= http://dx.doi.org/10.1109/TCBB.2007.70240.
[11]
Hsu, Chih-Wei and Lin, Chih-Jen. 2002. A comparison of methods for multiclass support vector machines. Trans. Neur. Netw. 13, 2 (March 2002), 415--425. DOI= http://dx.doi.org/10.1109/72.991427.

Cited By

View all

Index Terms

  1. Identification of bacillus species using support vector machine and codon pair relative frequency

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          ICUIMC '14: Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
          January 2014
          757 pages
          ISBN:9781450326445
          DOI:10.1145/2557977
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 09 January 2014

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. SVM
          2. codon pair
          3. gene classifies
          4. gene feature
          5. identification of bacillus species

          Qualifiers

          • Research-article

          Conference

          ICUIMC '14
          Sponsor:

          Acceptance Rates

          ICUIMC '14 Paper Acceptance Rate 116 of 407 submissions, 29%;
          Overall Acceptance Rate 251 of 941 submissions, 27%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 63
            Total Downloads
          • Downloads (Last 12 months)2
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 13 Jan 2025

          Other Metrics

          Citations

          Cited By

          View all

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media