Skip to main content

Finding minimal generalizations for unions of pattern languages and its application to inductive inference from positive data

  • Conference paper
  • First Online:
STACS 94 (STACS 1994)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 775))

Included in the following conference series:

Abstract

A pattern is a string of constant symbols and variables. The language defined by a pattern p is the set of constant strings obtained from p by substituting nonempty constant strings for variables in p. In this paper we are concerning with polynomial time inference from positive data of the class of unions of a bounded number of pattern languages. We introduce a syntactic notion of minimal multiple generalizations (mmg for short) to study the inferability of classes of unions. If a pattern p is obtained from another pattern q by substituting nonempty patterns for variables in q, q is said to be more general than p. A set of patterns defines a union of their languages. A set Q of patterns is said to be more general than a set P of patterns if for any pattern p in P there exists a more general pattern q in Q than p. Clearly more general set of patterns defines larger unions. A k-minimal multiple generalization (k-mmg) of a set S of strings is a minimally general set of at most k patterns that defines a union containing S. The syntactic notion of minimality enables us to efficiently compute a candidate for a semantically minimal concept. We present a general methodology for designing an efficient algorithm to find a k-mmg. Under some conditions an mmg can be used as an appropriate hypothesis for inductive inference from positive data. As results several classes of unions of pattern languages are shown to be polynomial time inferable from positive data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Arikawa, S. Kuhara, S. Miyano, A. Shinohara, and T. Shinohara. A learning algorithm for elementary formal systems and its experiments on identification of transmembrane domains. In Proc. of the 25th Hawaii International Conference on System Sciences, pp. 675–684, 1992.

    Google Scholar 

  2. D. Angluin. Finding patterns common to a set of strings. In Proceedings of the 11th Annual Symposium on Theory of Computing, pp. 130–141, 1979.

    Google Scholar 

  3. H. Arimura, T. Shinohara, and S. Otsuki. Polynomial time inference of unions of two tree pattern languages. IEICE Trans. Inf. & Syst., E75-D, pp. 426–434, 1992.

    Google Scholar 

  4. H. Arimura, T. Shinohara, and S. Otsuki. A polynomial time algorithm for finding finite unions of tree pattern languages. In Proceedings of the Second International Workshop on Nonmonotonic and Inductive Logic, pp. 118–131. LNAI 659, Springer, 1993.

    Google Scholar 

  5. S. Arikawa, T. Shinohara, and A. Yamamoto. Learning elementary formal systems. Theoretical Computer Science, Vol. 95, pp. 97–113, 1992.

    Google Scholar 

  6. E.M. Gold. Languages identification in the limit. Information and Control, Vol. 10, pp. 447–474, 1967.

    Google Scholar 

  7. P. D. Laird. Learning from good and bad data. Kluwer Academic, 1988.

    Google Scholar 

  8. S. Lange and R. Wiehagen. Polynomial-time inference of arbitrary pattern languages. New Generation Computing, Vol. 8, No. 4, pp. 361–370, 1991.

    Google Scholar 

  9. S. Miyano, A. Shinohara, and T. Shinohara. Which classes of elementary formal systems are polynomial-time learnable? In S. Arikawa, A. Maruoka, and T. Sato, editors, Proceedings of the Second Workshop on Algorithmic Learning Theory, pp. 139–150, 1991.

    Google Scholar 

  10. Y. Mukouchi. Characterization of pattern languages. IEICE Trans. Inf. and Syst., Vol. E75-D, No. 7, 1992.

    Google Scholar 

  11. R. Nix. Editing by example. Technical Report 280, Department of Computer Science, Yale University, 1983.

    Google Scholar 

  12. G. Plotkin. A note on inductive generalization. In B. Meltzer and D. Mitchie, editors, Machine Intelligence, volume 5, pp. 153–163. Edinburgh Univ. Press, 1970.

    Google Scholar 

  13. E. Y. Shapiro. Inductive inference of theories from facts. Technical Report 192, Yale University, Department of Computer Science, 1981.

    Google Scholar 

  14. T. Shinohara. Polynomial time inference of extended regular pattern languages. In RIMS Symposia on Software Science and Engineering, pp. 115–127. LNCS 147, Springer, 1982.

    Google Scholar 

  15. T. Shinohara. Polynomial time inference of pattern languages and its applications. In Proceedings of the 7th IBM Symposium on Mathematical Foundations of Computer Science, pp. 191–209, 1982.

    Google Scholar 

  16. T. Shinohara. Inferring unions of two pattern languages. Bulletin of Informatics and Cybernetics, Vol. 20, pp. 83–88, 1983.

    Google Scholar 

  17. R. M. Smullyan. Theory of Formal Systems. Princeton Univ. Press, 1961.

    Google Scholar 

  18. A. Salomaa, K. Salomaa, and Sheng Y. Inclusion is undecidable for pattern languages. In Proc. 20th ICALP, 1993.

    Google Scholar 

  19. N Tanida and T. Yokomori. Polynomial-time identification of strictly regular languages in the limit. IEICE Trans. Inf. and Syst., Vol. E75-D, No. 1, pp. 125–132, 1992.

    Google Scholar 

  20. L. G. Valiant. A theory of the learnable. Comm. ACM, Vol. 27, No. 11, pp. 1134–1142, 1984.

    Google Scholar 

  21. K. Wright. Inductive Inference of Pattern Languages. PhD thesis, University of Pittsburgh, 1989.

    Google Scholar 

  22. K. Wright. Identification of unions of languages drawn from an identifiable class. In Proceedings of the 2nd Annual Workshop on Computational Learning Theory, pp. 328–333, 1989.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Patrice Enjalbert Ernst W. Mayr Klaus W. Wagner

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Arimura, H., Shinohara, T., Otsuki, S. (1994). Finding minimal generalizations for unions of pattern languages and its application to inductive inference from positive data. In: Enjalbert, P., Mayr, E.W., Wagner, K.W. (eds) STACS 94. STACS 1994. Lecture Notes in Computer Science, vol 775. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-57785-8_178

Download citation

  • DOI: https://doi.org/10.1007/3-540-57785-8_178

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-57785-0

  • Online ISBN: 978-3-540-48332-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics