Skip to main content

A New Dissimilarity Measure Between Trees by Decomposition of Unit-Cost Edit Distance

  • Conference paper
  • 3159 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4881))

Abstract

Tree edit distance is a conventional dissimilarity measure between labeled trees. However, tree edit distance including unit-cost edit distance contains the similarity of label and that of tree structure simultaneously. Therefore, even if the label similarity between two trees that share many nodes with the same label is high, the high label similarity is hard to be recognized from their tree edit distance when their tree sizes or shapes are quite different. To overcome this flaw, we propose a novel method that obtains a label dissimilarity measure and a structural dissimilarity measure separately by decomposing unit-cost edit distance.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Moulton, V., Zuker, M., Steel, M., Pointon, R., Penny, D.: Metrics on RNA Secondary Structures. J. of Computational Biology 7, 277–292 (2000)

    Article  Google Scholar 

  2. Bille, P.: A Survey on Tree Edit Distance and Related Problems. Theoretical Computer Science 337, 217–239 (2005)

    Article  MATH  MathSciNet  Google Scholar 

  3. Zhang, K., Shasha, D.: Simple Fast Algorithms for the Editing Distance between Trees and Related Problems. SIAM J. on Computing 18, 1245–1262 (1989)

    Article  MATH  MathSciNet  Google Scholar 

  4. Shasha, D., Zhang, K.: Fast Algorithms for the Unit Cost Editing Distance between Trees. J. of Algorithms 11, 581–621 (1990)

    Article  MATH  MathSciNet  Google Scholar 

  5. Oommen, B.J., Loke, R.K.S.: On the Pattern Recognition of Noisy Subsequence Trees. IEEE Trans. on PAMI 23(9), 929–946 (2001)

    Google Scholar 

  6. Schlieder, T., Naumann, F.: Approximate Tree Embedding for Querying XML Data. In: Proc. of ACM SIGIR Workshop on XML and Information Retrieval (2000)

    Google Scholar 

  7. Pinter, R.Y., Rokhlenko, O., Tsur, D., Ziv-Ukelson, M.: Approximate Labelled Subtree Homeomorphism. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, pp. 59–73. Springer, Heidelberg (2004)

    Google Scholar 

  8. Sanz, I., Mesiti, M., Guerrini, G., Llavori, R.B.: Approximate Subtree Identification in Heterogeneous XML Documents Collections. In: Bressan, S., Ceri, S., Hunt, E., Ives, Z.G., Bellahsène, Z., Rys, M., Unland, R. (eds.) XSym 2005. LNCS, vol. 3671, pp. 192–206. Springer, Heidelberg (2005)

    Google Scholar 

  9. Bunke, H., Shearer, K.: A Graph Distance Metric based on the Maximal Common Subgraph. Pattern Recognition Letters 19, 255–259 (1998)

    Article  MATH  Google Scholar 

  10. http://sd.is.uec.ac.jp/~koga/IDEALdata.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Hujun Yin Peter Tino Emilio Corchado Will Byrne Xin Yao

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Koga, H., Saito, H., Watanabe, T., Yokoyama, T. (2007). A New Dissimilarity Measure Between Trees by Decomposition of Unit-Cost Edit Distance. In: Yin, H., Tino, P., Corchado, E., Byrne, W., Yao, X. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2007. IDEAL 2007. Lecture Notes in Computer Science, vol 4881. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77226-2_65

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-77226-2_65

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-77225-5

  • Online ISBN: 978-3-540-77226-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics