Abstract
Previous research into the neural classification of E.coli promoters has focused on the use of raw DNA sequences and alignment methods. In this paper, we use sequence dependent structural profiles of DNA to train neural networks for promoter recognition. In addition to this, we evaluate the impact of different types of non-promoters used in training and testing on the classification accuracy. 872 E.coli promoters were used in addition to three types of non-promoters, random sequences with the same base frequency as the promoter sequences, genes selected from E.coli and random sequences with the same base frequencies as the gene non-promoters. Raw DNA sequences were then converted to stacking energy and GC-trinucleotide profiles. We found the promoter classification accuracy using structural profiles was comparable to other methods. However, our approach has the advantage of not requiring finding the -35 and -10 hexamers and alignment of the DNA. Overall, using non-promoters from coding regions and random sequences with the same base frequency as the gene non-promoter resulted in the best classification accuracy.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Borries Demeler and Guangwen Zhou. Neural Network optimization for E. coli promoter prediction. Nucl. Acids. Res., 19(7):1593–1599, April 1991.
Calvin B. Harley and Robert P. Reynolds. Analysis of E. coli promoter sequences. Nucl. Acids. Res., 15(5):2334–2361, 1987.
Diane K. Hawley and William R. McClure. Compilation and analysis of Escherichia coli promoter DNA sequences. Nucl. Acids. Res., 11(8):2237–2255, 1983.
Haym Hirsh and Michiel Noordewier. Using background knowledge to improve inductive learning of DNA sequences. In Artificial Intelligence for Applications, 1994., Proceedings of the Tenth Conference on, pages 351–357, March 1994.
Shlomit Lisser and Hanah Margalit. Compilation of E.coli mRNA promoter sequences. Nucl. Acids. Res., 21(7):1507–1516, April 1993.
A.V. Lukashin, V.V. Anshelevich, B.R. Amirikyan, A.I. Gragerov, and M.D. FrankKamenetskii. Neural network models for promoter recognition. Journal of Biomolecular Structure and Dynamics, 6:1123–1133, 1989.
Qicheng Ma, Jason T L Wang, and James R Gattiker. Mining Biomolecular Data Using Background Knowledge and Artificial Neural Networks in Handbook of Massive Data Sets, chapter 30, pages 1141–1168. Kluwer Academic Publishers, 2002.
Qicheng Ma, Jason T. L. Wang, Dennis Shasha, and Cathy H. Wu. DNA Sequence Classification via an Expectation Maximization Algorithm and Neural Networks: A Case Study. IEEE Transactions on Systems, Man and Cybernetics, part c, 31(4):468–475, November 2001.
Indu Mahadevan and Indira Ghosh. Analysis of E.coli Promoter Structures using Neural Networks. Nucl. Acids. Res., 22(11):2158–2165, June 1994.
Michael Connell O’Neil. Escherichia coli promoters: neural networks develop distinct descriptions in learning to search for promoters of different spacing classes. Nucl. Acids. Res., 20(13):3471–3477, 1992.
Rick L. Ornstein, Robert Rein, Donnal L. Breen, and Robert D. Macelroy. An optimized potential function for the calculation of nucleic acid interaction energies I. Base stacking. Biopolymers, 17(10):2341–2360, 1978.
Anders Gorm Pedersen and Jacob Engelbrecht. Investigations of Escherichia coli Promoter Sequences with Artificial Neural Networks: New Signals Discovered Upstream of the Transcriptional Startpoint. In Christopher J. Rawlings, Dominic A. Clark, Russ B. Altman, Lawrence Hunter, Thomas Lengauer, and Shoshana J. Wodak, editors, Proceedings of the Third International Conference on Intelligent Systems for Molecular Biology, Cambridge, United Kingdom. AAAI, July 1995.
Anders Gorm Pedersen, Lars Juhl Jensen, Soren Brunak, Hans-Henrik Staerfeldta, and David W. Ussery. A DNA structural atlas for Escherichia coli. Journal of Molecular Biology, 299(4):907–930, june 2000.
H Salgado, A Santos-Zavaleta, S Gama-Castro, D Milln-Zrate, E Daz-Peredo, F Snchez-Solano, E Prez-Rueda, C Bonavides-Martnez, and J Collado-Vides. RegulonDB (version 3.2): Transcriptional Regulation and Operon Organization in Escherichia coli K-12. Nucl. Acids. Res., 29(1):72–74, 2001.
Geoffrey G. Towell and Jude W. Shavlik. Knowledge-based artificial neural networks. Artificial Intelligance, 70:119–165, 1994.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Conilione, P.C., Wang, D. (2005). Neural Classification of E.coli Promoters Using Selected DNA Profiles. In: Abraham, A., Dote, Y., Furuhashi, T., Köppen, M., Ohuchi, A., Ohsawa, Y. (eds) Soft Computing as Transdisciplinary Science and Technology. Advances in Soft Computing, vol 29. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-32391-0_13
Download citation
DOI: https://doi.org/10.1007/3-540-32391-0_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25055-5
Online ISBN: 978-3-540-32391-4
eBook Packages: EngineeringEngineering (R0)