Abstract
In this paper we have design a computational model for prokaryotic and eukaryotic gene prediction by using the clustering algorithm. The input DNA (Deoxyribonucleic Acid) sequence is spliced and the open reading frames are identified. For identification of consensus sequences various data mining algorithm is applied for creation of clusters. This model saves the implementation time, as whole of the database is present online so the sequence to be predicted is just taken from any one of the available database. Several experiments have been done where the parameters of gene prediction are changed manually. The performance has been tested on different unknown DNA sequences found on the internet. The sequences having score greater than or equal to the threshold value are entered into one cluster and rest of the sequences having score less than the given threshold are entered into second cluster and GC (Guanine and cytosine)-content percentage is calculated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Al Shahib, A., Rainer, B., Gilbert, D.R.: Predicting protein function by machine learning on amino acid sequences – a critical Evaluation. BMC Genomics 10, 1–10 (2007)
Au, W.H., Chan, K.C.C., Yao, X.: A Novel Evolutionary Data Mining Algorithm with Applications to Churn Prediction. IEEE Trans. Evolutionary Computation 7(6), 532–545 (2003)
Baker, D., Sali, A.: Protein structure prediction and structural genomics. Nucleic Acids Research 294(5540), 93–96 (2001)
Brunak, S., Engelbrecht, J., Knudsen, S.: Prediction of human mRNA donor and acceptor sites from the DNA sequence. Journal of Molecular Biology 220, 49–65 (1991)
Lakshmi, K.M., Steven, G.S.: Department of Bioinformatics and Computational Biology George Manson university, Lecture- Bioinformatics tools and applications, book reference, Vol. 21 (2004)
Myburgh, G.: Euokaryotic RNA Polymerase II start site detection using artifical neural networks, M.Tech thesis, University of Pretoria (2005)
Vladimir, Makarov: Computer programs for eukaryotic gene prediction, vol. 3(2), pp. 195–199. Henary Stewart Publications 1467-5463 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kaur, S., Sheetal, A., Singh, P. (2011). Computational Model for Prokaryotic and Eukaryotic Gene Prediction. In: Mantri, A., Nandi, S., Kumar, G., Kumar, S. (eds) High Performance Architecture and Grid Computing. HPAGC 2011. Communications in Computer and Information Science, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22577-2_47
Download citation
DOI: https://doi.org/10.1007/978-3-642-22577-2_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22576-5
Online ISBN: 978-3-642-22577-2
eBook Packages: Computer ScienceComputer Science (R0)