Skip to main content

RNA Multiple Structural Alignment with Longest Common Subsequences

  • Conference paper
Computing and Combinatorics (COCOON 2005)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3595))

Included in the following conference series:

Abstract

In this paper, we present a new model for RNA multiple sequence structural alignment based on the longest common subsequence. We consider both the off-line and on-line cases. For the off-line case, i.e., when the longest common subsequence is given as a linear graph with n vertices, we first present a polynomial O(n 2) time algorithm to compute its maximum nested loop. We then consider a slightly different problem – the Maximum Loop Chain problem and present a factor-2 approximation which runs in O(n 2.5) time. For the on-line case, i.e., given m RNA sequences of lengths n, compute the longest common subsequence of them such that this subsequence either induces a maximum nested loop or the maximum number of matches, we present efficient algorithms using dynamic programming when m is small.

This research is partially supported by EPSCOR Visiting Scholar’s Program and MSU Short-term Professional Development Program.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Introduction to Algorithms, 2nd edn. MIT Press, Cambridge (2001)

    MATH  Google Scholar 

  2. Dayhoff, M.: Computer aids to protein sequence determination. J. Theoret. Biology 8(1), 97–112 (1965)

    Article  Google Scholar 

  3. Dayhoff, M.: Computer analysis of protein evolution. Scientific American 221(1), 86–95 (1969)

    Article  Google Scholar 

  4. Davydov, E., Batzoglu, S.: A computational model for RNA multiple structural alignment. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, pp. 254–269. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  5. Deng, X., Li, G., Li, Z., Ma, B., Wang, L.: A PTAS for distinguishing (sub)string selection. In: Widmayer, P., Triguero, F., Morales, R., Hennessy, M., Eidenbenz, S., Conejo, R. (eds.) ICALP 2002. LNCS, vol. 2380, pp. 740–751. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  6. Eddy, S.R.: Computational genomics of noncoding RNA genes. Cell 109, 137–140 (2002)

    Article  Google Scholar 

  7. Goldman, D., Istrail, S., Papadimitriou, C.: Algorithmic aspects of protein structure similarity. In: Proc. 40th Ann. Symp. Foundations of Computer Science (FOCS 1999), pp. 512–522 (1999)

    Google Scholar 

  8. Gabow, H.: An efficient implementation of Edmond’s algorithm for maximum matching on graphs. J. ACM 23(2), 221–234 (1976)

    Article  MathSciNet  MATH  Google Scholar 

  9. Greenberg, R.I.: Bounds on the Number of the Longest Common Subsequence Problem. CoRR cs.DM/0301030 (2003)

    Google Scholar 

  10. Hirschberg, D.: The longest common subsequence problem. PhD Thesis, Princeton University (1975)

    Google Scholar 

  11. Hsu, W.J., Du, M.W.: Computing a longest common subsequence for a set of strings. BIT 24, 45–59 (1984)

    Article  MathSciNet  MATH  Google Scholar 

  12. Jiang, T., Li, M.: On the approximation of shortest common supersequences and longest common subsequences. SIAM J. Comput. 24(5), 1122–1139 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  13. Lanctot, K., Li, M., Ma, B., Wang, S., Zhang, L.: Distinguishing string selection problems. In: Proc. 6th Ann. ACM-SIAM Symp. on Discrete Algorithms, pp. 633–642 (1999)

    Google Scholar 

  14. Li, M., Ma, B., Wang, L.: Finding similar regions in many strings. In: Proc. 31st ACM Symp. on Theory of Computing (STOC 1999), pp. 473–482 (1999)

    Google Scholar 

  15. Micali, S., Vazirani, V.: An O(_|V ||E|2) algorithm for finding maximum matching in general graphs. In: Proc. 21st Ann. Symp. Foundations of Computer Science (FOCS 1980), pp. 17–27 (1980)

    Google Scholar 

  16. Nussinov, R., Pieczenik, G., Griggs, J., Kleitman, D.: Algorithms for loop matching. SIAM J. Applied Math. 35, 68–82 (1978)

    Article  MathSciNet  MATH  Google Scholar 

  17. Rick, C.: Efficient Computation of All Longest Common Subsequences. In: Halldórsson, M.M. (ed.) SWAT 2000. LNCS, vol. 1851, pp. 407–418. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  18. Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Molecular Biology 147, 195–197 (1981)

    Article  Google Scholar 

  19. Zucker, M., Stiegler, P.: Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic Acids Research 9, 133–148 (1981)

    Article  Google Scholar 

  20. Zucker, M.: Computer prediction of RNA structure. Methods in Enzymology 180, 262–288 (1989)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bereg, S., Zhu, B. (2005). RNA Multiple Structural Alignment with Longest Common Subsequences. In: Wang, L. (eds) Computing and Combinatorics. COCOON 2005. Lecture Notes in Computer Science, vol 3595. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11533719_6

Download citation

  • DOI: https://doi.org/10.1007/11533719_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28061-3

  • Online ISBN: 978-3-540-31806-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics