Abstract
We developed a dynamic programming approach of computing common sequence/structure patterns between two RNAs given by their sequence and secondary structures. Common patterns between two RNAs are meant to share the same local sequential and structural properties. Nucleotides which are part of an RNA are linked together due to their phosphodiester or hydrogen bonds. These bonds describe the way how nucleotides are involved in patterns and thus delivers a bond-preserving matching definition. Based on this definition, we are able to compute all patterns between two RNAs in time O(nm) and space O(nm), where n and m are the lengths of the RNAs, respectively. Our method is useful for describing and detecting local motifs and for detecting local regions of large RNAs although they do not share global similarities. An implementation is available in C++ and can be obtained by contacting one of the authors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Backofen, R., Will, S.: Local sequence-structure motifs in RNA. Journal of Bioinformatics and Computational Biology, JBCB (2004) (accepted for publication)
Bafna, V., Muthukrishnan, S., Ravi, R.: Computing similarity between rna strings. In: Proc. 6th Symp. Combinatorical Pattern Matching, p. 16 (1995)
Eppstein, D.: Subgraph isomorphism in planar graphs and related problems. J. Graph Algorithms & Applications 3(3), 1–27 (1999)
Gendron, P., Gautheret, D., Major, F.: Structural ribonucleic acid motifs identification and classification. In: High Performance Computing Systems and Applications, Kluwer Academic Press, Dordrecht (1998)
Gramm, Guo, Niedermeier: Pattern matching for arc-annotated sequences. FSTTCS: Foundations of Software Technology and Theoretical Computer Science 22 (2002)
Hofacker, L., Priwitzer, B., Stadler, P.F.: Prediction of locally stable RNA secondary structures for genome-wide surveys. Bioinformatics 20(2), 186–190 (2004)
Höchsmann, M., Töller, T., Giegerich, R., Kurtz, S.: Local similarity in rna secondary structures. In: Proceedings of Computational Systems Bioinformatics, CSB 2003 (2003)
Jiang, T., Lin, G.-H., Ma, B., Zhang, K.: The longest common subsequence problem for arc-annotated sequences. In: Proceedings of the 11th Annual Symposium on Combinatorial Pattern Matching, CPM 2000 (2000)
Jiang, T., Lin, G., Ma, B., Zhang, K.: A general edit distance between RNA structures. Journal of Computational Biology 9(2), 371–388 (2002)
Morgenstern, B., Frech, K., Dress, A., Werner, T.: DIALIGN: finding local similarities by multiple sequence alignment. Bioinformatics 14(3), 290–294 (1998)
Ullmann, J.R.: An algorithm for subgraph isomorphism. Journal of the ACM 23(1), 31–42 (1976)
Viksna, J., Gilbert, D.: Pattern matching and pattern discovery algorithms for protein topologies. In: Gascuel, O., Moret, B.M.E. (eds.) WABI 2001. LNCS, vol. 2149, pp. 98–111. Springer, Heidelberg (2001)
Wang, J.T.-L., Shapiro, B.A., Shasha, D., Zhang, K., Currey, K.M.: An algorithm for finding the largest approximately common substructures of two trees. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 889–895 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Backofen, R., Siebert, S. (2004). Fast Detection of Common Sequence Structure Patterns in RNAs. In: Apostolico, A., Melucci, M. (eds) String Processing and Information Retrieval. SPIRE 2004. Lecture Notes in Computer Science, vol 3246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30213-1_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-30213-1_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23210-0
Online ISBN: 978-3-540-30213-1
eBook Packages: Springer Book Archive