Abstract
In this paper, we propose the alignment of trees as a measure of the similarity between two labeled trees. Both ordered and unordered trees are considered. An algorithm is designed for ordered trees. The time complexity of this algorithm is O(¦T 1¦· s¦T 2· (deg(T 1) + deg(T 2))2), where ¦T i¦ is the number of nodes in T i and deg(T i ) is the degree of T i , i=1,2. The algorithm is faster than the best known algorithm for tree edit when deg(T 1) and deg(T 2) are smaller than the depths of T 1 and T 2. For unordered trees, we show that the alignment problem can be solved in polynomial time if the trees have a bounded degree and becomes NP-hard if one of the trees is allowed to have an arbitrary degree. In contrast, the edit problem for unordered trees is NP-hard even if both trees have a bounded degree [17]. Finally, multiple alignment of trees is discussed.
Supported in part by NSERC Research Grant OGP0046613.
Supported in part by NSERC Research Grant OGP0046373.
Preview
Unable to display preview. Download preview PDF.
References
M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, W. H. Freeman, 1979.
D. Gusfield, Efficient methods for multiple sequence alignment with guaranteed error bounds, Bulletin of Mathematical Biology 55, pp. 141–154, 1993.
P. Kilpelainen and H. Mannila, Ordered and unordered tree inclusion, Report A-1991-4, Dept. of Comp. Science, University of Helsinki, August. 1991; to appear in SIAM J. on Computing.
S.-Y. Le, J. Owens, R. Nussinov, J.-H. Chen B. Shapiro and J. V. Maizel, RNA secondary structures: comparison and determination of frequently recurring substructures by consensus, Comp. Appl. Biosci. 5, 205–210, 1989.
S.-Y. Le, R. Nussinov, and J.V. Maizel, Tree graphs of RNA secondary structures and their comparisons, Computers and Biomedical Research, 22, 461–473, 1989.
S.Y. Lu, A tree-tree distance and its application to cluster analysis, IEEE Trans. Pattern Anal. Mach. Intelligence 1, 219–224, 1979.
D. Sankoff and J. Kruskal (Eds), Time Warps, String Edits, and Macromolecules: the Theory and Practice of Sequence Comparison, Addison Wesley, Reading Mass., 1983.
B. Shapiro, An algorithm for comparing multiple RNA secondary structures, Comput. Appl. Biosci. 387–393, 1988.
F.Y. Shih, Object representation and recognition using mathematical morphology model, J. System Integration, vol. 1, pp.235–256, 1991.
F.Y. Shih and O.R. Mitchell, Threshold decomposition of grayscale morphology into binary morphology, IEEE Trans. Pattern Anal. Mach. Intell., vol. PAMI-11, pp.31–42, 1989.
B. Shapiro and K. Zhang, Comparing multiple RNA secondary structures using tree comparisons, Comput. Appl. Biosci. vol. 6, no. 4, pp.309–318, 1990.
Y. Takahashi, Y. Satoh, H. Suzuki and S. Sasaki, Recognition of largest common structural fragment among a variety of chemical structures, Analytical Science, vol. 3, pp23–28, 1987.
K.C. Tai, The tree-to-tree correction problem, J. ACM, 26, 422–433, 1979.
L. Wang and T. Jiang, On the complexity of multiple sequence alignment, 1993, to appear in Journal of Computational Biology.
K. Zhang and T. Jiang, Some MAX SNP-hard results concerning unordered labeled trees, 1993, To appear in Information Processing Letters.
K. Zhang and D. Shasha, Simple fast algorithms for the editing distance between trees and related problems, SIAM J. Comput. 18, 1245–1262, 1989.
K. Zhang, R. Statman, and D. Shasha, On the editing distance between unordered labeled trees, Information Processing Letters, 42, 133–139, 1992.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, T., Wang, L., Zhang, K. (1994). Alignment of trees — An alternative to tree edit. In: Crochemore, M., Gusfield, D. (eds) Combinatorial Pattern Matching. CPM 1994. Lecture Notes in Computer Science, vol 807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58094-8_7
Download citation
DOI: https://doi.org/10.1007/3-540-58094-8_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58094-2
Online ISBN: 978-3-540-48450-9
eBook Packages: Springer Book Archive