Skip to main content

Annotation of Human Genomic Sequence by Combining Existing Gene Prediction Tools Using Hybrid Approach

  • Conference paper
  • 1951 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 306))

Abstract

Various methods and tools are used for genomic sequences annotation, each of which needs training data set and hence their accuracy is confined to specific type of organism. To surmount this problem, we proposed a hybrid method in which weighted annotated binary DNA sequences from different tools are convolved independently with multi scaled modified Gaussian function that generates set of multi scaled sequences for each tool. All the sequences of the same scale values from different tools are added based on each nucleotide position. Then this multi scaled sequences are normalized, scaled and combined together for each nucleotide position. By combining best predicted ranges among different predicted ranges from individual gene prediction tool, our proposed tool increases Exon level accuracy by 10 – 12 % whereas 2-4 % of missed and wrong exons can be identified in comparison to accuracy given by single gene predicting tool.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Burge, C., Karlin, S.: Prediction of complete gene structure in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997)

    Article  Google Scholar 

  2. Krogh, A.: Two methods for improving performance of an HMM and their application for gene-finding. In: Gaasterland, T., et al. (eds.) Proceedings of the Fifth International Conference on Intelligent Systems for Molecular Biology, pp. 179–186. AAAI Press, Menlo Park (1997)

    Google Scholar 

  3. Solovyev, V.V., Salamov, A.A., Lawrence, C.B.: Identification of human gene structure using linear discriminant functions and dynamic programming. In: Rawling, C., et al. (eds.) Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology, pp. 367–375. AAAI Press, Menlo Park (1995)

    Google Scholar 

  4. Lukashin, A.V., Borodovsky, M.: School of Biology and Schools of Biology and Mathematics, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA GeneMark. hmm: New solutions for gene-finding. Nucleic Acids Res. 26, 1107–1115 (1998)

    Article  Google Scholar 

  5. Kulp, D., Haussler, D., Reese, M.G., Eeckman, F.H.: A generalized hidden markov model for the recognition of human genes in DNA. In: States, D., et al. (eds.) Proceedings of the Fourth International Conference on Intelligent Systems for Molecular Biology, pp. 134–142. AAAI Press, Menlo Park (1996)

    Google Scholar 

  6. Salzberg, S., Delcher, A., Fasman, K., Henderson, J.: A decision tree system for finding genes in DNA. J. Comp. Biol. 5, 667–680 (1998)

    Article  Google Scholar 

  7. Zhang, M.Q.: Identification of protein coding regions in the human genome by quadratic discriminant analysis. Proc. Natl. Acad. Sci. 94, 565–568 (1997)

    Article  Google Scholar 

  8. Uberbacher, E.C., Xu, Y., Mural, R.J.: Discovering and understanding genes in human DNA sequence using GRAIL. Methods Enzymol. 266, 259–281 (1996)

    Article  Google Scholar 

  9. Murakami, K., Takagi, T.: Gene recognition by combination of several gene-finding programs. Bioinformatics 14(8), 665–675 (1998)

    Article  Google Scholar 

  10. Rogic, S., Francis Ouellette, B.F., Mackworth, A.K.: Improving gene recognition accuracy by combining predictions from two gene-finding programs. Bioinformatics (2002)

    Google Scholar 

  11. Burset, M., Guigó, R.: Evaluation of gene structure prediction programs. Genomics 34(3), 353–367 (1996)

    Article  Google Scholar 

  12. Solovyev, V., Salamov, A.: The Gene-Finder computer tools for analysis of human and model organisms genome sequences. In: ISMB 1997 Proceedings (1997)

    Google Scholar 

  13. Rogic, S., Mackworth, A.K., Ouellette, F.B.F.: Evaluation of genefinding programs on mammalian sequences. Genome Res. 11, 817–832 (2001)

    Article  Google Scholar 

  14. Bors, A.G.: Introduction of the Radial Basis Function. Department of Computer science University of York (1994)

    Google Scholar 

  15. Mathé, C., Sagot, M., Schiex, T., Rouze, P.: Current methods of gene prediction, their strengths and weaknesses. Nucleic Acids Res. (2002)

    Google Scholar 

  16. Web, https://ccrma.stanford.edu/~jos/sasp/Gaussian_Function_Properties.html

  17. Web, http://mathworld.wolfram.com/GaussianFunction.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Saxena, A., Pitchaipillai, G., Vardawaj, P.K. (2012). Annotation of Human Genomic Sequence by Combining Existing Gene Prediction Tools Using Hybrid Approach. In: Parashar, M., Kaushik, D., Rana, O.F., Samtaney, R., Yang, Y., Zomaya, A. (eds) Contemporary Computing. IC3 2012. Communications in Computer and Information Science, vol 306. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32129-0_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32129-0_48

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32128-3

  • Online ISBN: 978-3-642-32129-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics