Abstract
We discuss a model for the evolutionary distance between two coding DNA sequences which specializes to the DNA/protein model proposed in Hein [4]. We discuss the DNA/protein model in details and present a quadratic time algorithm that computes an optimal alignment of two coding DNA sequences in the model under the assumption of affine gap cost. The algorithm solves a conjecture in [4] and we believe that the constant factor of the running time is sufficiently small to make the algorithm feasible in practice.
Supported by the ESPRIT Long Term Research Programme of the EU under project number 20244 (ALCOM-IT).
Basic Research in Computer Science, Centre of the Danish National Research Foundation.
Preview
Unable to display preview. Download preview PDF.
References
L. Arvestad. Aligning coding DNA in the presence of frame-shift errors. In Proceedings of the 8th Annual Symposium of Combinatorial Pattern Matching (CPM 97), volume 1264 of Lecture Notes in Computer Science, pages 180–190, 1997.
R. Durbin, R. Eddy, A. Krogh, and G. Mitchison. Biological Sequence Analysis: Probalistic Models of Proteins and Nucleic Acids. Cambrigde University Press, 1998.
O. Gotoh. An improved algorithm for matching biological sequences. Journal of Molecular Biology, 162:705–708, 1981.
J. Hein. An algorithm combining DNA and protein alignment. Journal of Theoretical Biology, 167:169–174, 1994.
J. Hein and J. Støvlbaek. Genomic alignment. Journal of Molecular Evolution, 38:310–316, 1994.
J. Hein and J. Støvlbæk. Combined DNA and protein alignment. Methods in Enzymology, 266:402–418, 1996.
D. S. Hirschberg. A linear space algortihm for computing maximal common subseqeunce. Communication of the ACM, 18(6):341–343, 1975.
Y. Hua, T. Jiang, and B. Wu. Aligning DNA sequences to minimize the change in protein. Accepted for CPM 98.
S. B. Needleman and C. D. Wunsch. A general method applicable to the search for similarities in the amino acid seqeunce of two proteins. Journal of Molecular Biology, 48:433–443, 1970.
H. Peltola, H. Söderlund, and E. Ukkonen. Algorithms for the search of amino acid patterns in nucleic acid sequences. Nuclear Acids Research, 14(1):99–107, 1986.
D. Sankoff. Matching sequences under deletion /insertion constraints. In Proceedings of the National Acadamy of Science USA, volume 69, pages 4–6, 1972.
P. H. Sellers. On the theory and computation of evolutionary distance. SIAM Journal of Applied Mathematics, 26:787–793, 1974.
R. A. Wagner and M. J. Fisher. The string to string correction problem. Journal of the ACM, 21:168–173, 1974.
Z. Zhang, W. R. Pearson, and W. Miller. Aligning a DNA sequence with a protein sequence. Journal of Computational Biology, 4(3):339–349, Fall 1997.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pedersen, C.N.S., Lyngsø, R., Hein, J. (1998). Comparison of coding DNA. In: Farach-Colton, M. (eds) Combinatorial Pattern Matching. CPM 1998. Lecture Notes in Computer Science, vol 1448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0030788
Download citation
DOI: https://doi.org/10.1007/BFb0030788
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64739-3
Online ISBN: 978-3-540-69054-2
eBook Packages: Springer Book Archive