Transcription Factor Binding Sites Prediction Based on Sequence Similarity

Sim, Jeong Seop; Park, Soo-Jun

doi:10.1007/11881599_131

Jeong Seop Sim²³ &
Soo-Jun Park²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4223))

Included in the following conference series:

International Conference on Fuzzy Systems and Knowledge Discovery

1712 Accesses

Abstract

Sequence algorithms are widely used to study genomic sequences in such fields as DNA fragment assembly, genomic sequence similarities, motif search, etc. In this paper, we propose an algorithm that predicts transcription factor binding sites from a given set of sequences of upstream regions of genes using sequence algorithms, suffix arrays and the Smith-Waterman algorithm.

This work was supported by INHA UNIVERSITY Research Grant (INHA-32744).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An efficient algorithm for identifying (ℓ, d) motif from huge DNA datasets

Article 01 May 2020

Towards a Better Understanding of Heuristic Approaches Applied to the Biological Motif Discovery

Sequence motif finder using memetic algorithm

Article Open access 03 January 2018

References

Batzoglou, S., Jaffe, D., Stanley, K., Butler, J., Gnerre, S., Mauceli, E., Berger, B., Mesirov, J., Lander, E.: Arachne: Awhole-genome shotgun assembler. Genome Research 12, 177–189 (2002)
Article Google Scholar
Chen, T., Skiena, S.S.: Trie-based data structures for sequence assembly. In: Hein, J., Apostolico, A. (eds.) CPM 1997. LNCS, vol. 1264, pp. 206–223. Springer, Heidelberg (1997)
Google Scholar
Green, P.: Documentation for phrap, Genome Center, University of Washington, http://www.phrap.org/phrap.docs/phrap.html
Gusfield, D.: Algorithms on Strings, Trees, and Sequences. Cambridge University Press, Cambridge (1997)
Book MATH Google Scholar
Ko, P., Aluru, S.: Space Efficient Linear Time Construction of Suffix Arrays. In: Baeza-Yates, R., Chávez, E., Crochemore, M. (eds.) CPM 2003. LNCS, vol. 2676, pp. 200–210. Springer, Heidelberg (2003)
Chapter Google Scholar
Kato, M., Hata, N., Banerjee, N., Futcher, B., Zhang, M.Q.: Identifying combinatorial regulation of transcription factors and binding motifs. Genome Biology 5(8), R56 (2004)
Article Google Scholar
Kärkkäinen, J., Sanders, P.: Simple linear work suffix array construction, In: International Colloquium on Automata, Languages and Programming, LNCS, vol. 2676, pp. 943–955 (2003)
Google Scholar
Kim, D.K., Sim, J.S., Park, H., Park, K.: Constructing suffix arrays in linear time. Journal of Discrete Algorithms 3, 126–142 (2005)
Article MATH MathSciNet Google Scholar
Lipman, D., Pearson, W.: Improved tools for biological sequence comparison. Proc. National Academy of Science 85, 2444–2448 (1988)
Article Google Scholar
Manber, U., Myers, G.: Suffix arrays: A new method for on-line string searches. SIAM Journal on Computing 22, 935–938 (1993)
Article MATH MathSciNet Google Scholar
Matys, V., Fricke, E., Geffers, R., Goling, E., Haubrock, M., Hehl, R., Hornischer, K., Karas, D., Kel, A.E., Kel-Margoulis, O.V., Kloos, D.U., Land, S., Lewicki-Potapov, B., Michael, H., Munch, R., Reuter, I., Rotert, S., Saxel, H., Scheer, M., Thiele, S., Wingender, E.: TRANSFAC: transcriptional regulation, from patterns to profiles. Nucleic Acids Research 31(1), 374–378 (2003)
Article Google Scholar
Ohler, U., Niemann, H., Liao, G., Rubin, G.M.: Joint modeling of DNA sequence and physical properties to improve eukaryotic promoter recognition. Bioinformatics 17 (Suppl. 1), 199–206 (2001)
Google Scholar
Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. Journal of Molecular Biology 147, 195–197 (1981)
Article Google Scholar
Stoesser, G., Baker, W., Broek, A., Garcia-Pastor, M., Kanz, C., Kulikova, T., Leinonen, R., Lin, Q., Lombard, V., Lopez, R., Mancuso, R., Nardone, F., Stoehr, P., Tuli, M.A., Tzouvara, K., Vaughan, R.: The EMBL ncleotide sequence database: major new developments. Nucleic Acids Research 31(1), 17–22 (2003)
Article Google Scholar
Zhang, M.Q.: Identification of human gene core promoters in silico. Genome Research 8(3), 319–326 (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Inha University, Incheon, Korea
Jeong Seop Sim
Electronics and Telecommunications Research Institute, Daejeon, Korea
Soo-Jun Park

Authors

Jeong Seop Sim
View author publications
You can also search for this author in PubMed Google Scholar
Soo-Jun Park
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Nanyang Technological University,, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
Life Science Research Center, School of Electronic Engineering, Xidian University,, 710071, Xi’an, Shaanxi, China
Licheng Jiao
School of Electrical and Electronic Engineering, Xidian University, 710071, Xi’an, China
Guanming Shi
School of Information Technology and Electrical Engineering, The University of Queensland, 4072, Brisbane, Queensland, Australia
Xue Li
College of Mathematics and Information Science, Hebei Normal University, 050016, Shijiazhuang, Hebei, P.R. China
Jing Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sim, J.S., Park, SJ. (2006). Transcription Factor Binding Sites Prediction Based on Sequence Similarity. In: Wang, L., Jiao, L., Shi, G., Li, X., Liu, J. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2006. Lecture Notes in Computer Science(), vol 4223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881599_131

Download citation

DOI: https://doi.org/10.1007/11881599_131
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45916-3
Online ISBN: 978-3-540-45917-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Transcription Factor Binding Sites Prediction Based on Sequence Similarity

Abstract

Access this chapter

Preview

Similar content being viewed by others

An efficient algorithm for identifying (ℓ, d) motif from huge DNA datasets

Towards a Better Understanding of Heuristic Approaches Applied to the Biological Motif Discovery

Sequence motif finder using memetic algorithm

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Transcription Factor Binding Sites Prediction Based on Sequence Similarity

Abstract

Access this chapter

Preview

Similar content being viewed by others

An efficient algorithm for identifying (ℓ, d) motif from huge DNA datasets

Towards a Better Understanding of Heuristic Approaches Applied to the Biological Motif Discovery

Sequence motif finder using memetic algorithm

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation