Skip to main content

Evaluation and Fuzzy Classification of Gene Finding Programs on Human Genome Sequences

  • Conference paper
Fuzzy Systems and Knowledge Discovery (FSKD 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3614))

Included in the following conference series:

Abstract

This paper presents an evaluation of the four of the more common gene finding programs. The evaluation was conducted on a new data set consisting of only human genome sequences extracted from GenBank. Newest sequences were used to avoid overlap with the training sets of the gene-finding programs. The results of this evaluation are then used to classify the gene finding programs using fuzzy logic. The programs are classified into three fuzzy sets of high, mediocre and low accuracy. The results are then presented in the form of words so as to be easily understood by humans.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allen, J.E., Pertea, M., Salzberg, S.L.: Computational gene prediction using multiple sources of evidence. Genome Research 14(1) (2004)

    Google Scholar 

  2. Baxevanis, A.: Predictive Methods using DNA and Protein Sequences Current Topics in Genome Analysis, National Institute of Health, US (2000) (last accessed February 2005), Available online at http://www.genome.gov/Pages/Hyperion/COURSE2000/Pdf/baxevanis_lec1.pdf

  3. Burset, M., Guigo, R.: Evaluation of gene structure prediction programs. Genomics 34, 353–367 (1996)

    Article  Google Scholar 

  4. Dong, S., Searls, D.B.: Gene structure prediction by lin- guistic methods. Genomics 23, 540–551 (1994)

    Article  Google Scholar 

  5. Driese, K.L.: A Vegetation Map for the Catskill Park, NY, Derived from Multi-temporal Landsat Imagery and GIS Data Northeastern Naturalist (2004)

    Google Scholar 

  6. Duret, L., Mouchiroud, D., Gautier, C.: Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores. J. Mol. Evol. 40, 308–317 (1995)

    Article  Google Scholar 

  7. Guigo, et al.: An Assessment of Gene Prediction Accuracy in Large DNA Sequences Genome Research.  10(10) (2000)

    Google Scholar 

  8. Guigo, et al.: Prediction of gene structure. J. Mol. Biol. 226, 141–157 (1992)

    Article  Google Scholar 

  9. Haussler, D.: Computational Genefinding. Trends in Biochemical Sciences, Supplementary Guide to Bioinformatics, 12–15 (1998)

    Google Scholar 

  10. Li, W.: Computational Gene Recognition Programs (2004) (last accessed November 2004), Available at http://www.nslij-genetics.org/gene/programs.html

  11. Rogic, S.: Evaluating and Improving the Accuracy of Computational Gene Finding on Mammalian DNA Sequences, MSc. thesis, University of British Columbia, Canada (2000) (last accessed November 2004), Available online at http://www.cs.ubc.ca/~rogic/MSCthesisWeb.pdf

  12. Roubos, J.A., Setnes, M., Abonyi, J.: Learning Fuzzy Classification Rules From Data Developments in Soft Computing, pp. 108–115. Springer, Heidelberg (2001)

    Google Scholar 

  13. Snyder, E.E., Stormo, G.D.: Identification of coding regions in genomic DNA sequences: An application of dynamic programming and neural networks. Nucleic Acids Res. 21, 607–613 (1993)

    Article  Google Scholar 

  14. Solovyev, V.V., Salamov, A.A., Lawrence, C.B.: Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames. Nucleic Acids Res. 22, 5156–5163 (1994)

    Article  Google Scholar 

  15. Xu, Y., Mural, R.J., Uberbacher, E.C.: Constructing gene models from accurately predicted exons: An application of dynamic programming. Comput. Appl. Biosci. 10, 613–623 (1994b)

    Google Scholar 

  16. Zadeh, L.: Fuzzy Sets. Information and Control 8(3), 338–353 (1965)

    Article  MATH  MathSciNet  Google Scholar 

  17. Zhang, C., Zhang, R.: Evaluation of Gene-Finding algorithms by a Content-Balancing Accuracy Index. J. Biomol. Struct. Dyn. 19, 1045–1052 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nagar, A., Purushothaman, S., Tawfik, H. (2005). Evaluation and Fuzzy Classification of Gene Finding Programs on Human Genome Sequences. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_102

Download citation

  • DOI: https://doi.org/10.1007/11540007_102

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28331-7

  • Online ISBN: 978-3-540-31828-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics