Skip to main content

Constructing Structural Alignment of RNA Sequences by Detecting and Assessing Conserved Stems

  • Conference paper
Life System Modeling and Simulation (LSMS 2007)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 4689))

Included in the following conference series:

  • 1604 Accesses

Abstract

The comparative methods for predicting RNA secondary structure can be facilitated by taking structural alignments of homologous sequences as input. However, it is very difficult to construct a well structural alignment of RNA sequences without knowing the secondary structures. In this paper, we present a stem-based method for constructing structural alignment of RNA sequences with unknown structures. The method can be summarized by: 1) we detect possible stems in the RNA sequence using the so-called position matrix with which some possibly paired positions are uncovered; 2) we detect conserved stems across multiple sequences by multiplying the position matrices; 3) we assess the conserved stems using the Signal-to-Noise and the new SCFG model; 4) we construct structural alignment of RNA sequences by incorporating conserved stems with Clustal W which is a popular program for multiple sequence alignment. We tested our method on data sets composed of known structural alignments which are downloaded from the Rfam database. The accuracy, measured as sensitivity and true positive rate, of our method is much greater than alignments by Clustal W.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Eddy, S.R.: Non-coding RNA genes and modern RNA world. Nat. Rev. Genet. 2, 919–929 (2001)

    Article  Google Scholar 

  2. Huttenhofer, A., Schattner, P., Polacek, N.: Non-coding RNAs: hope or hype. TRENDS in Genetics 21, 289–297 (2005)

    Article  Google Scholar 

  3. Pace, N.R., Thomas, B.C., Woese, C.R.: Probing RNA structure, function, and history by comparative analysis. In: Gesteland, R.F., Cech, T.R., Atkins, J.F. (eds.) The RNA World, 2nd edn., pp. 113–141. Cold Spring Harbor Laboratory Press, NY (1999)

    Google Scholar 

  4. Knudsen, B., Hein, J.: Pfold: RNA secondary structure prediction using stochastic context-free grammars. Nucleic Acids Research 31, 3423–3428 (2003)

    Article  Google Scholar 

  5. Hofacker, I., Fekete, M., Stadler, P.: Secondary structure prediction for aligned RNA sequences. Journal of Molecular Biology 319, 1059–1066 (2002)

    Article  Google Scholar 

  6. Ruan, J., Stormo, G., Zhang, W.: An iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots. Bioinformatics 20, 58–66 (2004)

    Article  Google Scholar 

  7. Knight, R., et al.: BayesFold: rational 2 degrees folds that combine thermodynamic, covariation, and chemical data for aligned RNA sequences. RNA 10, 1323–1336 (2004)

    Article  MathSciNet  Google Scholar 

  8. Thompson, J., Higgins, D., Gibson, T.: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Research 22, 4673–4680 (1994)

    Article  Google Scholar 

  9. Notredame, C., Higgins, D., Heringa, J.: T-Coffee: A novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology 302, 205–217 (2000)

    Article  Google Scholar 

  10. Sankoff, D.: Simultaneous solution of the RNA folding, alignment and protosequence problems. SIAM Journal on Applied Mathematics 45, 810–825 (1985)

    Article  MATH  MathSciNet  Google Scholar 

  11. Hofacker, I., Bernhart, S., Stadler, P.: Alignment of RNA base pairing probability matrices. Bioinformatics 20, 2222–2227 (2004)

    Article  Google Scholar 

  12. Mathews, D., Turner, D.: Dynalign: An algorithm for finding the secondary structure common to two RNA sequences. Journal of Molecular Biology 317(2), 191–203 (2002)

    Article  Google Scholar 

  13. Perriquet, O., Touzet, H., Dauchet, M.: Finding the common structure shared by two homologous RNAs. Bioinformatics 19, 108–116 (2003)

    Article  Google Scholar 

  14. Ji, Y., Xu, X., Stormo, G.: A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences. Bioinformatics 20, 591–602 (2004)

    Article  Google Scholar 

  15. Bafna, V., Tang, H., Zhang, S.: Consensus folding of unaligned RNA sequences revisited. Journal of computational biology 13, 283–295 (2006)

    Article  MathSciNet  Google Scholar 

  16. Tabei, Y., et al.: SCARNA: fast and accurate structural alignment of RNA sequences by matching fixed-length stem fragments. Bioinformatics 22, 1723–1729 (2006)

    Article  Google Scholar 

  17. Durbin, R., et al.: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University press, Cambridge (1998)

    MATH  Google Scholar 

  18. Sam, G.J., Alex, B., et al.: Rfam: an RNA family database. Nucleic Acids Research 31, 439–441 (2003)

    Article  Google Scholar 

  19. Fang, X., et al.: The detection and assessment of possible RNA secondary structure using multiple sequence alignment. In: The 22nd Annual ACM Symposium on Applied Computing, Seoul, Korea, March 11-15, pp. 11–15. ACM Press, New York (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Kang Li Xin Li George William Irwin Gusen He

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fang, X., Luo, Z., Yuan, B., Wang, Z., Ding, F. (2007). Constructing Structural Alignment of RNA Sequences by Detecting and Assessing Conserved Stems. In: Li, K., Li, X., Irwin, G.W., He, G. (eds) Life System Modeling and Simulation. LSMS 2007. Lecture Notes in Computer Science(), vol 4689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74771-0_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74771-0_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74770-3

  • Online ISBN: 978-3-540-74771-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics