Abstract
Computational prediction of eukaryotic promoter is one of most elusive problems in DNA sequence analysis. Although considerable efforts have been devoted to this study and a number of algorithms have been developed in the last few years, their performances still need to further improve. In this work, we developed a new algorithm called PPFB for promoter prediction base on following hypothesis: promoter is determined by some motifs or word patterns and different promoters are determined by different motifs. We select most potential motifs (i.e. features) by divergence distance between two classes and constructed a classifier by feature boosting. Different from other classifier, we adopted a different training and classifying strategy. Computational results on large genomic sequences and comparisons with the several excellent algorithms showed that our method is efficient with better sensitivity and specificity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Scherf, M., Klingenhoff, A., Werner, T.: Highlyspecific localization of promoter regions in large genomic sequences by Promoter Inspector: a novel context analysis approach. J. Mol. Biol. 297, 599–606 (2000)
Bajic, V.B., Seah, S.H., Chong, A., Krishnan, S.P.T., Koh, J.L.Y., Brusic, V.: Computer model for recognition of functional transcription start sites in polymerase II promoters of vertebrates. Journal of Molecular Graphics & Modeling 21, 323–332 (2003)
Davuluri, R.V., Grosse, I., Zhang, M.Q.: Computational identification of promoters and first exons in the human genome. Nat. Genet. 29, 412–417 (2001)
Down, T.A., Hubbard, T.J.: Computational detection and location of transcription start sites in mammalian genomic DNA. Genome Res. 12, 458–461 (2002)
Wu, S., Xie, X., Liew, A.W., Hong, Y.: Eukaryotic promoter prediction based on relative entropy and positional information. Physical Review E 75, 041908-1–041908-7 (2007)
Prestridge, D.S., Burks, C.: The density of transcriptional elements in promoter and non-promoter sequences. Hum. Mol. Genet. 2, 1449–1453 (1993)
Hutchinson, G.B.: The prediction of vertebrate promoter regions using differential hexamer frequency analysis. Comput. Appl. Biosci. 12, 391–398 (1996)
Cross, S.H., Clark, V.H., Bird, A.P.: Isolation of CpG islands from large genomic closnes. Nucleic Acids Res. 27, 2099–2107 (1999)
Ioshikhes, I.P., Zhang, M.Q.: Large-scale human promoter mapping using CpG islands. Nat. Genet. 26, 61–63 (2000)
Ponger, L., Mouchiroud, D.: CpGProD: identifying CpG islands associated with transcription start sites in large genomic mammalian sequences. Bioinformatics 18, 631–633 (2002)
Schapire, R., Singer, Y.: Improved boosting algorithms using confidence-rated predictions. In: Proc. Of Annual Conf. On Computational Learning Theory, pp. 80–91 (1998)
Sergios, T., Konstantinos, K.: Pattern Recognition, 2nd edn. Academic Press, San Diego (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, S., Zeng, Q., Song, Y., Wang, L., Zhang, Y. (2008). A New Strategy for Pridicting Eukaryotic Promoter Based on Feature Boosting. In: Sun, F., Zhang, J., Tan, Y., Cao, J., Yu, W. (eds) Advances in Neural Networks - ISNN 2008. ISNN 2008. Lecture Notes in Computer Science, vol 5263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87732-5_54
Download citation
DOI: https://doi.org/10.1007/978-3-540-87732-5_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87731-8
Online ISBN: 978-3-540-87732-5
eBook Packages: Computer ScienceComputer Science (R0)