Abstract
We consider the problem of learning the commutative subclass of regular languages in the on-line model of predicting {0,1∼-valued functions from examples and reinforcements due to Littlestone [7,4]. We show that the entire class of commutative deterministic finite state automata (CDFAs) of an arbitrary alphabet sizek is predictable inO(s k) time with the worst case number of mistakes bounded above byO(s kk logs), wheres is the number of states in the target DFA. As a corollary, this result implies that the class of CDFAs is also PAC-learnable from random labeled examples in timeO(s k) with sample complexity\(O\left( {\tfrac{1}{ \in }\left( {\log \tfrac{1}{\delta } + s^k k\log s} \right)} \right)\), using a different class of representations. The mistake bound of our algorithm is within a polynomial, for a fixed alphabet size, of the lower boundO(s+k) we obtain by calculating the VC-dimension of the class. Our result also implies the predictability of the class of finite sets of commutative DFAs representing the finite unions of the languages accepted by the respective DFAs.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abe, N., Polynomial Learnability of Semilinear Sets,Proceedings of the 1989 Workshop on Computational Learning Theory, August, 1989.
Angluin, D., Queries for Concept Learning,Machine Learning, 2:319–342, 1987.
Blumer, A., Ehrenfeucht, A., Haussler, D., and Warumuth, M.K. Learnability and the Vapnik-Chervonenkis Dimension,Journal of the ACM, 36(4):929–965, October, 1989. e
haussler, D., Littlestone, N., and Warumuth, M.K. Predicting {0,1} Functions on Randomly Drawn Points,Proceedings of 1988 IEEE Symposium on the Foundations of Computer Science, 1988.
Helmbold, D., Sloan, R., and Warumuth, M.K. Learning Integer Lattices,Proceedings of the 1990 Workshop on Computational Learning Theory, August, 1990.
Kearns, M. and Valiant, L.G. Cryptographic limitations on learning Boolean formulae and finite automata,Proceedings of the 21st Annual ACM Symposium on Theory of Computing, pages 433–444, 1989.
Littlestone, N. Learning quickly when irrelevant attributes abound: a new linear-threshold learning algorithm,Machine Learning, 2:285–318, 1988.
Littlestone, N. From On-line to Batch Learning,Proceedings of the Second Annual Workshop on Computational Learning Theory, pages 269–284, 1989.
Parikh, R.J. Language generating devices,M.I.T. Res. Lab. Electron. Quart. Prog. Rep., 60:199–212, 1961.
Pitt L. and Warumuth, M.K., Prediction Preserving Reducibility,Journal of Computer and System Sciences, 41(3):430–467, 1990.
Pitt L. and Warumuth, M.K., The minimum consistent DFA problem cannot be approximated within any polynomial.Proceedings of 19th ACM Symp. on Theory of Computation, 1989. To appear in Journal of the A.C.M.
Valiant, L.G. A Theory of the Learnable,Communications of A.C.M., 27:1134–1142, 1984.
Vapnik, V.G. and Chervonenkis, A.Ya. On the uniform convergence of relative frequencies of events to their probabilities,Theory of Probability and its Application, 16(2):264–280, 1971.
Author information
Authors and Affiliations
Additional information
Part of this work was supported by the Office of Naval Research under contract number N00014-87-K-0401 while the author was at the Department of Computer and Information Science, University of Pennsylvania, and N0014-86-K-0454 while at the Department of Computer and Information Sciences, U.C. Santa Cruz. The author’s email address is abe@IBL.CL.nec.co.jp
About this article
Cite this article
Abe, N. Learning commutative deterministic finite state automata in polynomial time. NGCO 8, 319–335 (1991). https://doi.org/10.1007/BF03037090
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF03037090