learning pattern languages using queries

Matsumoto, Satoshi; Shinohara, Ayumi

doi:10.1007/3-540-62685-9_16

Satoshi Matsumoto¹ &
Ayumi Shinohara¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1208))

Included in the following conference series:

European Conference on Computational Learning Theory

129 Accesses

Abstract

A pattern is a finite string of constant and variable symbols. For k≥1, we denote by kμΠ the set of all patterns in which each variable symbol occurs at most k times. In particular, we abbreviate μΠ for k=1. The language L(π) of a pattern π is the set of all strings obtained by substituting any non-null constant string for each variable symbol in π. In this paper, we show that any pattern π ∈ kμΠ is exactly identifiable in O(¦ω¦^k+2) time from one positive example w ∈ L(π) using ¦ω¦^k+1+¦π¦^k membership queries. Moreover, we introduce the notion of critical pattern, and show that the number of membership queries can be reduced to ¦ω¦+¦π¦ if the target pattern π∈μΠ is not critical. For instance, any pattern π∈μΠ whose constant parts are of length at most 3 is not critical. Finally, we show a nontrivial subclass of μΠ that is identified using membership queries only, without any initial positive example.

This author is a Research Fellow of the Japan Society for the Promotion of Science (JSPS). The author's research is partly supported by Grants-in-Aid for JSPS research fellows from the Ministry of Education, Science and Culture, Japan.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

The Teaching Complexity of Erasing Pattern Languages with Bounded Variable Frequency

Positive Characteristic Sets for Relational Pattern Languages

Learning Pattern Languages over Groups

References

D. Angluin. Finding patterns common to a set of strings. Journal of Computer and System Science, 21:46–62, 1980.
Google Scholar
D. Angluin. Inductive inference of formal languages from positive data. Information and Control, 45:117–135, 1980.
Google Scholar
D. Angluin. Learning regular sets from queries and counterexamples. Information and Computation, 75:87–106, 1987.
Google Scholar
D. Angluin. Queries and concept learning. Machine Learning, 2:319–342, 1988.
Google Scholar
S. Arikawa, S. Kuhara, S. Miyano, A. Shinohara and T. Shinohara. A learning algorithm for elementary formal systems and its experiments on identification of transmembrane domains. In Proceedings 25th Hawaii International Conference on System Sciences, Vol. I, pages 675–684, 1992.
Google Scholar
H. Arimura, H. Ishizaka and T. Shinohara. Learning unions of tree patterns using queries. In Proceedings of 6th Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 997, pages 66–79, 1995.
Google Scholar
A. Bairoch. PROSITE: A dictionary of sites and patterns in proteins. Nucleic Acids Research, 19:2241–2245, 1991.
Google Scholar
H. Ishizaka, H. Arimura and T. Shinohara. Finding tree patterns consistent with positive and negative examples using queries. In Proceedings of 5th Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 872, pages 317–332, 1994.
Google Scholar
M. Kearns and L. Pitt. A polynomial-time algorithm for learning k-variable pattern languages from examples. In Proceedings of the 2nd Annual Conference on Computational Learning Theory, pages 57–71, 1989.
Google Scholar
S. Lange, J. Nessel and R. Wiehagen. Language learning from good examples. In Proceedings of 5th International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 872, pages 423–437, 1994.
Google Scholar
S. Lange and R. Wiehagen. Polynomial-time inference of arbitrary pattern languages. New Generation Computing, 8(4):361–370, 1991.
Google Scholar
A. Marron. Learning pattern languages from a single initial example and from queries. In Proceedings of the first Annual Conference on Computational Learning Theory, pages 311–325, 1988.
Google Scholar
S. Matsumoto and A. Shinohara. Learning subsequence languages. In 6th European-Japanese Seminar on Information Modelling and Knowledge Bases, 1996.
Google Scholar
S. Miyano, A. Shinohara and T. Shinohara. Which classes of elementary formal systems are polynomial-time learnable? In Proceedings of 2nd Workshop on Algorithmic Learning Theory, pages 139–150, 1991.
Google Scholar
H. Sakamoto. Language learning from membership queries and characteristic examples. In Proceedings of 6th International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 997, pages 55–65. Springer-Verlag, 1995.
Google Scholar
S. Shimozono, A. Shinohara, T. Shinohara, S. Miyano, S. Kuhara and S. Arikawa. Knowledge acquisition from amino acid sequences by machine learning system BONSAI. Transactions of Information Processing Society of Japan, 35(10):2009–2018, 1994.
Google Scholar
A. Shinohara. Teachability in computational learning. New Generation Computing, 8(4):337–347, 1990.
Google Scholar
T. Shinohara. Polynomial time inference of extended regular pattern languages. In RIMS Symposia on Software Science and Engineering (Lecture Notes in Computer Science 147), pages 115–127, 1982.
Google Scholar
T. Shinohara. Polynomial time inference of pattern languages and its applications. In Proceedings 7th IBM Symp. Math. Found. Comp. Sci., pages 191–209, 1982.
Google Scholar
T. Shinohara. Inductive inference from positive data is powerful. In Proceedings of the 3rd Annual Conference on Computational Learning Theory, pages 97–110, 1990.
Google Scholar
E. Tateishi, O. Maruyama and S. Miyano. Extracting motifs from positive and negative sequence data. In Proceeding 13th Symposium on Theoretical Aspects of Computer Science, Lecture Notes in Computer Science 1046, pages 219–230, 1996.
Google Scholar
E. Tateishi and S. Miyano. A greedy strategy for finding motifs from positive and negative examples. In Proceeding First Pacific Symposium on Biocomputing, pages 599–613. World Scientific Press, 1996.
Google Scholar
L. G. Valiant. A theory of the learnable. Communications of the ACM, 27:1134–1142, 1984.
Google Scholar
T. Zeugmann. Average case analysis of pattern language learning algorithm. In Proceedings of 5th Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 872, pages 8–9, 1994.
Google Scholar
T. Zeugmann. Lange and Wiehagen's pattern language learning algorithm: an average-case analysis with respect to its total learing time. Technical Report RIFIS-TR-CS-111, Aplil 20, Kyushu University, 1995. (to appear in Annals of Mathematics and Artificial Intelligence).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics, Kyushu University 33, 812-81, Fukuoka, Japan
Satoshi Matsumoto & Ayumi Shinohara

Authors

Satoshi Matsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Ayumi Shinohara
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Shai Ben-David

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Matsumoto, S., Shinohara, A. (1997). learning pattern languages using queries. In: Ben-David, S. (eds) Computational Learning Theory. EuroCOLT 1997. Lecture Notes in Computer Science, vol 1208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-62685-9_16

Download citation

DOI: https://doi.org/10.1007/3-540-62685-9_16
Published: 03 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62685-5
Online ISBN: 978-3-540-68431-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics