Abstract
The detection of various types of repeats is a fundamental and well studied problem in stringology. In this paper we present extensions to this problem with applications to bioinformatics. In this paper we consider the detection of all exact and approximate inverted repeats, as well as all exact and approximate weighted inverted repeats and give efficient algorithms for their computation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Akutsu, T.: Dynamic programming algorithms for rna secondary structure prediction with pseudoknots. Discrete Applied Mathematics 104, 45–62 (2000)
Brown, M., Wilson, C.: Rna pseudoknot modeling using intersections of stochastic context free grammars with applications to database search. In: Pacific Symposium on Biocomputing, pp. 109–125 (1995)
Barton, S.P.P.C., Iliopoulos, C.S., Smyth, W.F.: Prefix tables & border arrays with k-mismatches & applications (2013) (submitted for publication)
Chen, J.-L., Greider, C.W.: Functional analysis of the pseudoknot structure in human telomerase rna. Proceedings of the National Academy of Sciences of the United States of America 102(23), 8080–8085 (2005)
Crochemore, M.: An optimal algorithm for computing the repetitions in a word. Information Processing Letters 12(5), 244–250 (1981)
Fischer, J.: Inducing the lcp-array. In: Dehne, F., Iacono, J., Sack, J.-R. (eds.) WADS 2011. LNCS, vol. 6844, pp. 374–385. Springer, Heidelberg (2011)
Gusfield, D.: Algorithms on strings, trees, and sequences: computer science and computational biology. Cambridge University Press, New York (1997)
Gusfield, D., Stoye, J.: Linear time algorithms for finding and representing all the tandem repeats in a string. Journal of Computer and System Sciences 69(4), 525–546 (2004)
Hsu, P.-H., Chen, K.-Y., Chao, K.-M.: Finding all approximate gapped palindromes. In: Dong, Y., Du, D.-Z., Ibarra, O. (eds.) ISAAC 2009. LNCS, vol. 5878, pp. 1084–1093. Springer, Heidelberg (2009)
Ilie, L., Navarro, G., Tinta, L.: The longest common extension problem revisited and applications to approximate string searching (2010)
Iliopoulos, C.S., Makris, C., Panagis, Y., Perdikuri, K.: Evangelos Theodoridis, and Athanasios Tsakalidis. The weighted suffix tree: An efficient data structure for handling molecular weighted sequences and its applications. Fundam. Inf. 71(2,3), 259–277 (2006)
Kandoth, C., Ercal, F., Frank, R.: A framework for automated enrichment of functionally significant inverted repeats in whole genomes. BMC Bioinformatics 11(suppl. 6), 1–10 (2010)
Kato, Y., Seki, H., Kasami, T.: Stochastic multiple context-free grammar for rna pseudoknot modeling. In: Proceedings of the Eighth International Workshop on Tree Adjoining Grammar and Related Formalisms, TAGRF 2006, Stroudsburg, PA, USA, pp. 57–64. Association for Computational Linguistics (2006)
Kolpakov, R., Kucherov, G.: Finding maximal repetitions in a word in linear time. In: Proceedings of the 1999 Symposium on Foundations of Computer Science, pp. 596–604. IEEE Computer Society (1999)
Lyngso, R.B., Pedersen, C.N.S.: RNA Pseudoknot Prediction in Energy-Based Models. Journal of Computational Biology 7(3-4), 409–427 (2000)
Main, M.G., Lorentz, R.J.: An o(n log n) algorithm for finding all repetitions in a string. Journal of Algorithms 5(3), 422–432 (1984)
Manacher, G.: A new linear-time “on-line” algorithm for finding the smallest initial palindrome of a string. J. ACM 22(3), 346351 (1975)
Nong, G., Zhang, S., Chan, W.H.: Linear Suffix Array Construction by Almost Pure Induced-Sorting. In: Data Compression Conference, pp. 193–202 (2009)
Pleij, C.W.A., Rietveld, K., Bosch, L.: A new principle of rna folding based on pseudoknotting. Nucleic. Acids Research 13(5), 1717–1731 (1985) C.W.A. Pleij, K. Rietveld, L. Bosch
Porto, A.H.L., Barbosa, V.C.: Finding approximate palindromes in strings (2002)
Stoye, J., Gusfield, D.: Simple and exible detection of contiguous repeats using a suffix tree. Theoretical Computer Science 270(12), 843–856 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Barton, C., Iliopoulos, C.S., Mulder, N., Watson, B. (2013). Identification of All Exact and Approximate Inverted Repeats in Regular and Weighted Sequences. In: Iliadis, L., Papadopoulos, H., Jayne, C. (eds) Engineering Applications of Neural Networks. EANN 2013. Communications in Computer and Information Science, vol 384. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41016-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-41016-1_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41015-4
Online ISBN: 978-3-642-41016-1
eBook Packages: Computer ScienceComputer Science (R0)