Skip to main content

Average case analysis of pattern language learning algorithms

  • Invited Talks
  • Conference paper
  • First Online:
Algorithmic Learning Theory (AII 1994, ALT 1994)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 872))

  • 131 Accesses

Abstract

The present paper deals with the comparison of two pattern language learning algorithms with respect to their average case behavior. Pattern languages have been introduced by Angluin (1980) and are defined as follows:

Let Σ={a,b,} be any non-empty finite alphabet containing at least two elements. Furthermore, let X={x i } i ε In be an infinite set of variables such that Σ ∩ X=\(\not 0\). Patterns are non-empty strings from Σ ∪ X, e.g., ab, ax1ccc, bx1x1cx2x2 are patterns. L(p), the language generated by pattern p is the set of strings which can be obtained by substituting non-null strings from Σ * for the variables of the pattern p. Thus aabbb is generable from pattern ax1x2b, while aabba is not. Pat and PAT denote the set of all patterns and of all pattern languages over Σ, respectively. In order to deal with the learnability of pattern languages we have to specify from what information the inference algorithms are supposed to identify the target language. We consider both learning from text and informant. Intuitively, a text for L generates the language L without any information concerning the complement of L, whereas an informant of L decides L by informing the strategy whether or not any word from Σ * belongs to L. Note that we allow a text and an informant to be non-effective.

The learning algorithms we are going to analyze learn the class of all pattern languages in the limit. The first one has been established by Angluin (1980). It mainly uses a subprocedure that finds, for any given set of positive and negative examples, a pattern that is descriptive for the input sample. In particular, this learning algorithm exclusively outputs consistent hypotheses. However, its update time is not polynomially bounded in the length of the actual input unless P=NP.

Recently, Lange and Wiehagen (1991) described a learning algorithm that might behave inconsistently. Nevertheless, its update time is polynomially bounded in the length of the actual input. On the other hand, the latter algorithm mainly exploits the fact that every pattern language is uniquely characterized by its corresponding set of all strings that have minimal length.

Hence, it is only natural to compare these algorithms with respect to their average case behavior. This might be also interesting with respect to potential applications pattern language learning algorithms have (cf. Nix (1983)). Finally, we compare the results obtained with the fact that PAT is not PAC-learnable unless P=NP (cf. Schapire (1990)).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  • Angluin, D. (1980), Finding patterns common to a set of strings, Journal of Computer and System Sciences21, 46–62.

    Google Scholar 

  • Lange, S., and Wiehagen, R. (1991), Polynomial-time inference of arbitrary pattern languages, New Generation Computing8, 361–370.

    Google Scholar 

  • Nix, R.P. (1983), Editing by examples, Yale University, Dept. Computer Science, Technical Report 280.

    Google Scholar 

  • Schapire, R.E. (1990), Pattern languages are not learnable, in “Proceedings 3rd Annual Workshop on Computational Learning Theory”, (M.A. Fulk and J. Case, Eds.), pp. 122–129, Morgan Kaufmann Publishers, Inc., San Mateo.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Setsuo Arikawa Klaus P. Jantke

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zeugmann, T. (1994). Average case analysis of pattern language learning algorithms. In: Arikawa, S., Jantke, K.P. (eds) Algorithmic Learning Theory. AII ALT 1994 1994. Lecture Notes in Computer Science, vol 872. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58520-6_49

Download citation

  • DOI: https://doi.org/10.1007/3-540-58520-6_49

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-58520-6

  • Online ISBN: 978-3-540-49030-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics