Skip to main content

Efficient Learning of Unlabeled Term Trees with Contractible Variables from Positive Data

  • Conference paper
Inductive Logic Programming (ILP 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2835))

Included in the following conference series:

Abstract

In order to represent structural features common to tree structured data, we propose an unlabeled term tree, which is a rooted tree pattern consisting of an unlabeled ordered tree structure and labeled variables. A variable is a labeled hyperedge which can be replaced with any unlabeled ordered tree of size at least 2. In this paper, we deal with a new kind of variable, called a contractible variable, that is an erasing variable which is adjacent to a leaf. A contractible variable can be replaced with any unlabeled ordered tree, including a singleton vertex. Let \(\mathcal{OTT}^{c}\) be the set of all unlabeled term trees t such that all the labels attaching to the variables of t are mutually distinct. For a term tree t in \(\mathcal{OTT}^{c}\), the term tree language L(t) of t is the set of all unlabeled ordered trees which are obtained from t by replacing all variables with unlabeled ordered trees. First we give a polynomial time algorithm for deciding whether or not a given term tree in \(\mathcal{OTT}^{c}\) matches a given unlabeled ordered tree. Next for a term tree t in \(\mathcal{OTT}^{c}\), we define the canonical term tree c(t) of t in \(\mathcal{OTT}^{c}\) which satisfies L(c(t))=L(t). And then for two term trees t and t’ in \(\mathcal{OTT}^{c}\), we show that if L(t)=L(t’) then c(t) is isomorphic to c(t’). Using this fact, we give a polynomial time algorithm for finding a minimally generalized term tree in \(\mathcal{OTT}^{c}\) which explains all given data. Finally we conclude that the class \(\mathcal{OTT}^{c}\) is polynomial time inductively inferable from positive data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Buneman, P., Suciu, D.: Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann, San Francisco (2000)

    Google Scholar 

  2. Amoth, T.R., Cull, P., Tadepalli, P.: Exact learning of unordered tree patterns from queries. In: Proc. COLT-1999, pp. 323–332. ACM Press, New York (1999)

    Chapter  Google Scholar 

  3. Angluin, D.: Finding patterns common to a set of strings. Journal of Computer and System Science 21, 46–62 (1980)

    Article  MATH  MathSciNet  Google Scholar 

  4. Arimura, H., Sakamoto, H., Arikawa, S.: Efficient learning of semi-structured data from queries. In: Abe, N., Khardon, R., Zeugmann, T. (eds.) ALT 2001. LNCS (LNAI), vol. 2225, pp. 315–331. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  5. Matsumoto, S., Hayashi, Y., Shoudai, T.: Polynomial time inductive inference of regular term tree languages from positive data. In: Li, M. (ed.) ALT 1997. LNCS (LNAI), vol. 1316, pp. 212–227. Springer, Heidelberg (1997)

    Google Scholar 

  6. Matsumoto, S., Shoudai, T., Miyahara, T., Uchida, T.: Learning of finite unions of tree patterns with internal structured variables from queries. In: McKay, B., Slaney, J.K. (eds.) Canadian AI 2002. LNCS (LNAI), vol. 2557, pp. 523–534. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  7. Miyahara, T., Suzuki, Y., Shoudai, T., Uchida, T., Takahashi, K., Ueda, H.: Discovery of frequent tag tree patterns in semistructured web documents. In: Chen, M.-S., Yu, P.S., Liu, B. (eds.) PAKDD 2002. LNCS (LNAI), vol. 2336, pp. 341–355. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  8. Shoudai, T., Miyahara, T., Uchida, T., Matsumoto, S.: Inductive inference of regular term tree languages and its application to knowledge discovery. In: Information Modelling and Knowledge Bases XI, pp. 85–102. IOS Press, Amsterdam (2000)

    Google Scholar 

  9. Shoudai, T., Uchida, T., Miyahara, T.: Polynomial time algorithms for finding unordered tree patterns with internal variables. In: Freivalds, R. (ed.) FCT 2001. LNCS, vol. 2138, pp. 335–346. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  10. Suzuki, Y., Akanuma, R., Shoudai, T., Miyahara, T., Uchida, T.: Polynomial time inductive inference of ordered tree patterns with internal structured variables from positive data. In: Kivinen, J., Sloan, R.H. (eds.) COLT 2002. LNCS (LNAI), vol. 2375, pp. 169–184. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Suzuki, Y., Shoudai, T., Miyahara, T., Uchida, T.: A polynomial time matching algorithm of structured ordered tree patterns for data mining from semistructured data. In: Matwin, S., Sammut, C. (eds.) ILP 2002. LNCS (LNAI), vol. 2583, pp. 270–284. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  12. Suzuki, Y., Shoudai, T., Uchida, T., Miyahara, T.: Ordered term tree languages which are polynomial time inductively inferable from positive data. In: Cesa-Bianchi, N., Numao, M., Reischuk, R. (eds.) ALT 2002. LNCS (LNAI), vol. 2533, pp. 188–202. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  13. Wang, K., Liu, H.: Discovering structural association of semistructured data. IEEE Trans. Knowledge and Data Engineering 12, 353–371 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Suzuki, Y., Shoudai, T., Matsumoto, S., Uchida, T. (2003). Efficient Learning of Unlabeled Term Trees with Contractible Variables from Positive Data. In: Horváth, T., Yamamoto, A. (eds) Inductive Logic Programming. ILP 2003. Lecture Notes in Computer Science(), vol 2835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39917-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39917-9_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20144-1

  • Online ISBN: 978-3-540-39917-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics