Time and Space Efficient Algorithms for Constrained Sequence Alignment

Peng, Z. S.; Ting, H. F.

doi:10.1007/978-3-540-30500-2_22

Z. S. Peng²⁰ &
H. F. Ting²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3317))

Included in the following conference series:

International Conference on Implementation and Application of Automata

612 Accesses
4 Citations

Abstract

In this paper, we study the constrained sequence alignment problem, which is a generalization of the classical sequence alignment problem with the additional constraint that some characters in the alignment must be positioned at the same columns. The problem finds important applications in Bioinformatics. Our major result is an O(ℓn²)-time and O(ℓn)-space algorithm for constructing an optimal constrained alignment of two sequences where n is the length of the longer sequence and ℓ is the length of the constraint. Our algorithm matches the best known time complexity and reduces the best known space complexity by a factor of n for solving the problem. We also apply our technique to design time and space efficient heuristic and approximation algorithm for aligning multiple sequences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tang, C.Y., Lu, C.L., Chang, M.D.T., Tsai, Y.T., Sun, Y.J., Chao, K.M., Chang, J.M., Chiou, Y.H., Wu, C.M., Chang, H.T., Chou, W.I.: Constrained multiple sequence alignment tool development and its application to RNase family alignment. In: Proceedings of the First IEEE Computer Society Bioinformatics Conference, pp. 127–137 (2002)
Google Scholar
Gusfield, D.: Efficient methods for multiple sequence alignment with guaranteed error bounds. Bulletin of Mathematical Biology 30, 141–154 (1993)
Google Scholar
Gusfield, D.: Algorithms on strings, trees, and sequence. Cambridge University Press, British (1999)
Google Scholar
Higgins, D., Sharpe, P.: CLUSTAL: a package for performing multiple sequence alignment on a microcomputer. Gene 73, 237–244 (1988)
Article Google Scholar
Corpet, F.: Multiple sequence alignment with hierarchical clustering. Nucleic Acids Research 16, 10881–10890 (1988)
Article Google Scholar
Chin, F.Y.L., Ho, N.L., Lam, T.W., Wong, W.H., Chan, M.Y.: Efficient constrained multiple sequence alignment with performance guarantee. In: Proceedings of the IEEE Computational Systems Bioinformatics Conference, pp. 337–346 (2003)
Google Scholar
Thompson, J.D., Higgins, D.G., Gibson, T.J.: CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22(22), 4673–4680 (1994)
Article Google Scholar
Wang, L., Jiang, T.: On the complexity of multiple sequence alignment. Journal of Computational Biology 1, 337–348 (1994)
Article Google Scholar
Pevzner, P.A.: Multiple alignment, communication cost, and graph matching. SIAM Journal on Applied Mathematics 52, 1763–1779 (1992)
Article MATH MathSciNet Google Scholar
Bonizzoni, P., Vedova, G.D.: The complexity of multiple sequence alignment with SP-score that is a metric. Theoretical Computer Science 259, 63–79 (2001)
Article MATH MathSciNet Google Scholar
Needleman, S., Wunsch, C.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Evolution 48, 443–453 (1970)
Google Scholar
Bafna, V., Lawler, E.L., Pevzner, P.A.: Approximation algorithms for multiple sequence alignment. Theoretical Computer Science 182, 233–244 (1997)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Z. S. Peng & H. F. Ting

Authors

Z. S. Peng
View author publications
You can also search for this author in PubMed Google Scholar
H. F. Ting
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Manitoba, R3T 2N2, Winnipeg, MB, Canada
Michael Domaratzki
Department of Mathematics, University of Turku, Finland
Alexander Okhotin
School of Computing, Queen’s University, K7L 3N6, Kingston, Ontario, Canada
Kai Salomaa
Department of Computer Science, University of Western Ontario, N6A 5B7, London, Ontario, Canada
Sheng Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peng, Z.S., Ting, H.F. (2005). Time and Space Efficient Algorithms for Constrained Sequence Alignment. In: Domaratzki, M., Okhotin, A., Salomaa, K., Yu, S. (eds) Implementation and Application of Automata. CIAA 2004. Lecture Notes in Computer Science, vol 3317. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30500-2_22

Download citation

DOI: https://doi.org/10.1007/978-3-540-30500-2_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24318-2
Online ISBN: 978-3-540-30500-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics