Abstract
Various methods and tools are used for genomic sequences annotation, each of which needs training data set and hence their accuracy is confined to specific type of organism. To surmount this problem, we proposed a hybrid method in which weighted annotated binary DNA sequences from different tools are convolved independently with multi scaled modified Gaussian function that generates set of multi scaled sequences for each tool. All the sequences of the same scale values from different tools are added based on each nucleotide position. Then this multi scaled sequences are normalized, scaled and combined together for each nucleotide position. By combining best predicted ranges among different predicted ranges from individual gene prediction tool, our proposed tool increases Exon level accuracy by 10 – 12 % whereas 2-4 % of missed and wrong exons can be identified in comparison to accuracy given by single gene predicting tool.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Burge, C., Karlin, S.: Prediction of complete gene structure in human genomic DNA. J. Mol. Biol. 268, 78–94 (1997)
Krogh, A.: Two methods for improving performance of an HMM and their application for gene-finding. In: Gaasterland, T., et al. (eds.) Proceedings of the Fifth International Conference on Intelligent Systems for Molecular Biology, pp. 179–186. AAAI Press, Menlo Park (1997)
Solovyev, V.V., Salamov, A.A., Lawrence, C.B.: Identification of human gene structure using linear discriminant functions and dynamic programming. In: Rawling, C., et al. (eds.) Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology, pp. 367–375. AAAI Press, Menlo Park (1995)
Lukashin, A.V., Borodovsky, M.: School of Biology and Schools of Biology and Mathematics, Georgia Institute of Technology, Atlanta, GA 30332-0230, USA GeneMark. hmm: New solutions for gene-finding. Nucleic Acids Res. 26, 1107–1115 (1998)
Kulp, D., Haussler, D., Reese, M.G., Eeckman, F.H.: A generalized hidden markov model for the recognition of human genes in DNA. In: States, D., et al. (eds.) Proceedings of the Fourth International Conference on Intelligent Systems for Molecular Biology, pp. 134–142. AAAI Press, Menlo Park (1996)
Salzberg, S., Delcher, A., Fasman, K., Henderson, J.: A decision tree system for finding genes in DNA. J. Comp. Biol. 5, 667–680 (1998)
Zhang, M.Q.: Identification of protein coding regions in the human genome by quadratic discriminant analysis. Proc. Natl. Acad. Sci. 94, 565–568 (1997)
Uberbacher, E.C., Xu, Y., Mural, R.J.: Discovering and understanding genes in human DNA sequence using GRAIL. Methods Enzymol. 266, 259–281 (1996)
Murakami, K., Takagi, T.: Gene recognition by combination of several gene-finding programs. Bioinformatics 14(8), 665–675 (1998)
Rogic, S., Francis Ouellette, B.F., Mackworth, A.K.: Improving gene recognition accuracy by combining predictions from two gene-finding programs. Bioinformatics (2002)
Burset, M., Guigó, R.: Evaluation of gene structure prediction programs. Genomics 34(3), 353–367 (1996)
Solovyev, V., Salamov, A.: The Gene-Finder computer tools for analysis of human and model organisms genome sequences. In: ISMB 1997 Proceedings (1997)
Rogic, S., Mackworth, A.K., Ouellette, F.B.F.: Evaluation of genefinding programs on mammalian sequences. Genome Res. 11, 817–832 (2001)
Bors, A.G.: Introduction of the Radial Basis Function. Department of Computer science University of York (1994)
Mathé, C., Sagot, M., Schiex, T., Rouze, P.: Current methods of gene prediction, their strengths and weaknesses. Nucleic Acids Res. (2002)
Web, https://ccrma.stanford.edu/~jos/sasp/Gaussian_Function_Properties.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saxena, A., Pitchaipillai, G., Vardawaj, P.K. (2012). Annotation of Human Genomic Sequence by Combining Existing Gene Prediction Tools Using Hybrid Approach. In: Parashar, M., Kaushik, D., Rana, O.F., Samtaney, R., Yang, Y., Zomaya, A. (eds) Contemporary Computing. IC3 2012. Communications in Computer and Information Science, vol 306. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32129-0_48
Download citation
DOI: https://doi.org/10.1007/978-3-642-32129-0_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32128-3
Online ISBN: 978-3-642-32129-0
eBook Packages: Computer ScienceComputer Science (R0)