Abstract
A pattern is a string of constant symbols and variables. The language defined by a pattern p is the set of constant strings obtained from p by substituting nonempty constant strings for variables in p. In this paper we are concerning with polynomial time inference from positive data of the class of unions of a bounded number of pattern languages. We introduce a syntactic notion of minimal multiple generalizations (mmg for short) to study the inferability of classes of unions. If a pattern p is obtained from another pattern q by substituting nonempty patterns for variables in q, q is said to be more general than p. A set of patterns defines a union of their languages. A set Q of patterns is said to be more general than a set P of patterns if for any pattern p in P there exists a more general pattern q in Q than p. Clearly more general set of patterns defines larger unions. A k-minimal multiple generalization (k-mmg) of a set S of strings is a minimally general set of at most k patterns that defines a union containing S. The syntactic notion of minimality enables us to efficiently compute a candidate for a semantically minimal concept. We present a general methodology for designing an efficient algorithm to find a k-mmg. Under some conditions an mmg can be used as an appropriate hypothesis for inductive inference from positive data. As results several classes of unions of pattern languages are shown to be polynomial time inferable from positive data.
Preview
Unable to display preview. Download preview PDF.
References
S. Arikawa, S. Kuhara, S. Miyano, A. Shinohara, and T. Shinohara. A learning algorithm for elementary formal systems and its experiments on identification of transmembrane domains. In Proc. of the 25th Hawaii International Conference on System Sciences, pp. 675–684, 1992.
D. Angluin. Finding patterns common to a set of strings. In Proceedings of the 11th Annual Symposium on Theory of Computing, pp. 130–141, 1979.
H. Arimura, T. Shinohara, and S. Otsuki. Polynomial time inference of unions of two tree pattern languages. IEICE Trans. Inf. & Syst., E75-D, pp. 426–434, 1992.
H. Arimura, T. Shinohara, and S. Otsuki. A polynomial time algorithm for finding finite unions of tree pattern languages. In Proceedings of the Second International Workshop on Nonmonotonic and Inductive Logic, pp. 118–131. LNAI 659, Springer, 1993.
S. Arikawa, T. Shinohara, and A. Yamamoto. Learning elementary formal systems. Theoretical Computer Science, Vol. 95, pp. 97–113, 1992.
E.M. Gold. Languages identification in the limit. Information and Control, Vol. 10, pp. 447–474, 1967.
P. D. Laird. Learning from good and bad data. Kluwer Academic, 1988.
S. Lange and R. Wiehagen. Polynomial-time inference of arbitrary pattern languages. New Generation Computing, Vol. 8, No. 4, pp. 361–370, 1991.
S. Miyano, A. Shinohara, and T. Shinohara. Which classes of elementary formal systems are polynomial-time learnable? In S. Arikawa, A. Maruoka, and T. Sato, editors, Proceedings of the Second Workshop on Algorithmic Learning Theory, pp. 139–150, 1991.
Y. Mukouchi. Characterization of pattern languages. IEICE Trans. Inf. and Syst., Vol. E75-D, No. 7, 1992.
R. Nix. Editing by example. Technical Report 280, Department of Computer Science, Yale University, 1983.
G. Plotkin. A note on inductive generalization. In B. Meltzer and D. Mitchie, editors, Machine Intelligence, volume 5, pp. 153–163. Edinburgh Univ. Press, 1970.
E. Y. Shapiro. Inductive inference of theories from facts. Technical Report 192, Yale University, Department of Computer Science, 1981.
T. Shinohara. Polynomial time inference of extended regular pattern languages. In RIMS Symposia on Software Science and Engineering, pp. 115–127. LNCS 147, Springer, 1982.
T. Shinohara. Polynomial time inference of pattern languages and its applications. In Proceedings of the 7th IBM Symposium on Mathematical Foundations of Computer Science, pp. 191–209, 1982.
T. Shinohara. Inferring unions of two pattern languages. Bulletin of Informatics and Cybernetics, Vol. 20, pp. 83–88, 1983.
R. M. Smullyan. Theory of Formal Systems. Princeton Univ. Press, 1961.
A. Salomaa, K. Salomaa, and Sheng Y. Inclusion is undecidable for pattern languages. In Proc. 20th ICALP, 1993.
N Tanida and T. Yokomori. Polynomial-time identification of strictly regular languages in the limit. IEICE Trans. Inf. and Syst., Vol. E75-D, No. 1, pp. 125–132, 1992.
L. G. Valiant. A theory of the learnable. Comm. ACM, Vol. 27, No. 11, pp. 1134–1142, 1984.
K. Wright. Inductive Inference of Pattern Languages. PhD thesis, University of Pittsburgh, 1989.
K. Wright. Identification of unions of languages drawn from an identifiable class. In Proceedings of the 2nd Annual Workshop on Computational Learning Theory, pp. 328–333, 1989.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Arimura, H., Shinohara, T., Otsuki, S. (1994). Finding minimal generalizations for unions of pattern languages and its application to inductive inference from positive data. In: Enjalbert, P., Mayr, E.W., Wagner, K.W. (eds) STACS 94. STACS 1994. Lecture Notes in Computer Science, vol 775. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-57785-8_178
Download citation
DOI: https://doi.org/10.1007/3-540-57785-8_178
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-57785-0
Online ISBN: 978-3-540-48332-8
eBook Packages: Springer Book Archive