Mining Regulatory Elements in Non-coding Regions of Arabidopsis thaliana

Li, Xi; Wang, Dianhui

doi:10.1007/978-3-642-16750-8_9

Xi Li^4,5 &
Dianhui Wang⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 115))

Included in the following conference series:

International Conference on Computational Systems-Biology and Bioinformatics

831 Accesses

Abstract

Analysis of regulatory elements (DNA motifs) in non-coding regions is considered as one crucial step to understand the regulation mechanisms of genes with similar expression patterns. With the help of accumulated gene expression data and complete genome sequences, computational approaches have been developed in the past decade to accelerate the mining task. In previous studies, we proposed a DNA motif discovery framework, named as MODEC, which incorporated the evolutionary computation (EC) searching algorithm with data filtering techniques to favor the algorithm performance. With the attempt on exploring real-world motif mining problems, we apply both MODEC and a famous discovery algorithm MEME to predict regulatory elements in different non-coding regions of co-expressed genes from the model plant Arabidopsis thaliana. Results from both MODEC and MEME show that the targeted motif patterns can be found in the expected non-coding regions of the co-expressed gene groups. As the preliminary step of this work, we investigate whether different motif patterns can be detected in the specified non-coding regions of co-expressed genes with different functional categories. The similar prediction results from MODEC and MEME demonstrate the potential of MODEC in the field of practical motif discovery.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Galas, D.J., Schmitz, A.: DNAse footprinting: a simple method for the detection of protein-DNA binding specificity. Nucleic Acids Res. 5, 3157–3170 (1978)
Article CAS PubMed PubMed Central Google Scholar
van Helden, J., André, B., Collado-Vides, J.: Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J. Mol. Biol. 281, 827–842 (1998)
Article PubMed Google Scholar
Bailey, T.L., Elkan, C.: Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In: Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28–36. AAAI Press, Menlo Park (1994)
Google Scholar
Tompa, M., Li, N., Bailey, T.L., et al.: Assessing computational tools for the discovery of transcription factor binding sites. Nature Biotechnology 23, 137–144 (2005)
Article CAS PubMed Google Scholar
Hu, J., Li, B., Kihara, D.: Limitations and potentials of current motif discovery algorithms. Nucleic Acids Res. 33, 4899–4913 (2005)
Article CAS PubMed PubMed Central Google Scholar
Chan, T.-M., Leung, K.-S., Lee, K.-H.: TFBS identification based on genetic algorithm with combined representations and adaptive post-processing. Bioinformatics 24, 341–349 (2008)
Article CAS PubMed Google Scholar
Li, L.P., Liang, Y., Bass, R.L.L.: GAPWM: a genetic algorithm method for optimizing a position weight matrix. Bioinformatics 23, 1188–1194 (2007)
Article CAS PubMed Google Scholar
Wei, Z., Jensen, S.T.: GAME: detecting cis-regulatory elements using a genetic algorithm. Bioinformatics 22, 1577–1584 (2006)
Article CAS PubMed Google Scholar
Li, X., Wang, D.H.: Computational Discovery of Regulatory DNA Motifs Using Evolutionary Computation. In: CEC-IEEE 2010: IEEE Congress on Evolutionary Computation (accepted 2010)
Google Scholar
Fiume, E., Christou, P., Giani, S., Breviario, D.: Introns are key regulatory elements of rice tubulin expression. Planta 218, 693–704 (2004)
Article CAS PubMed Google Scholar
Xie, X., Lu, J., Kulbokas, E.J., Golub, T.R., Mootha, V., Lindblad-Toh, K., Lander, E.S., Kellis, M.: Systematic discovery of regulatory motifs in human promoters and 3’ UTRs by comparison of several mammals. Nature 434, 338–345 (2005)
Article CAS PubMed PubMed Central Google Scholar
Meinke, D.W., Cherry, J.M., Dean, C., Rounsley, S.D., Koornneef, M.: Arabidopsis thaliana: a model plant for genome analysis. Science 282, 662–682 (1998)
Article CAS PubMed Google Scholar
Swarbreck, D., Wilks, C., Lamesch, P., Berardini, T.Z., Garcia-Hernandez, M., Foerster, H., Li, D., Meyer, T., Muller, R., Ploetz, L., et al.: The arabidopsis information resource (TAIR): gene structure and function annotation. Nucleic Acids Res. 36, D1009–D1014 (2008)
Article Google Scholar
Vandepoele, K., Quimbaya, M., Casneuf, T., De Veylder, L., Van de Peer, Y.: Unraveling transcriptional control in arabidopsis using cis-regulatory elements and coexpression networks. Plant Physiol. 150, 535–546 (2009)
Article CAS PubMed PubMed Central Google Scholar
Wang, D.H., Lee, N.K.: MISCORE: mismatch-based matrix similarity scores for DNA motif detection. In: Köppen, M., Kasabov, N., Coghill, G. (eds.) ICONIP 2008. LNCS, vol. 5506, pp. 478–485. Springer, Heidelberg (2008)
Chapter Google Scholar
Benos, P.V., Bulyk, M.L., Stormo, G.D.: Additivity in protein-DNA interactions: how good an approximation is it? Nucleic Acids Res. 30, 4442–4451 (2002)
Article CAS PubMed PubMed Central Google Scholar
Wang, D.H.: Characterization of regulatory motif models. Technical Report, La Trobe University, Australia (October 2009)
Google Scholar
Thijs, G., Lescot, M., Marchal, K., Rombauts, S., De Moor, B., Rouzé, P., Moreau, Y.: A higher-order background model improves the detection of promoter regulatory elements by Gibbs sampling. Bioinformatics 17, 1113–1122 (2001)
Article CAS PubMed Google Scholar
Galtier, N., Piganeau, G., Mouchiroud, D., Duret, L.: GC content evolution in mammalian genomes, the biased gene conversion hypothesis. Genetics 159, 907–911 (2001)
CAS PubMed PubMed Central Google Scholar
Mahony, S., Hendrix, D., Golden, A., Smith, T.J., Rokhsar, D.S.: Transcription factor binding site identification using the Self-Organizing Map. Bioinformatics 21, 1807–1814 (2005)
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Computer Engineering, La Trobe University, Melbourne, VIC, 3086, Australia
Xi Li & Dianhui Wang
Department of Primary Industries, Bioscience Research Division, Victorian AgriBiosciences Centre, Bundoora, VIC, 3083, Australia
Xi Li

Authors

Xi Li
View author publications
You can also search for this author in PubMed Google Scholar
Dianhui Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology, King Mongkut’s University of Technology Thonburi, 126 Pracha U-Thit Rd, Bangmod, Thungkru, 10140, Bangkok, Thailand
Jonathan H. Chan
School of Computer Engineering, Nanyang Technological University, Block N4, 2b-39,Nanyang Avenue, 639798, Singapore
Yew-Soon Ong
Dept. of Computer Science, Yonsei University, 134 Shinchon-dong, Sudaemoon-ku, 120-749, Seoul, South Korea
Sung-Bae Cho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, X., Wang, D. (2010). Mining Regulatory Elements in Non-coding Regions of Arabidopsis thaliana . In: Chan, J.H., Ong, YS., Cho, SB. (eds) Computational Systems-Biology and Bioinformatics. CSBio 2010. Communications in Computer and Information Science, vol 115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16750-8_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-16750-8_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16749-2
Online ISBN: 978-3-642-16750-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics