Abstract
Linearity and determinism seem to be two essential conditions for polynomial learning of grammars to be possible. We propose a general condition valid for certain subclasses of the linear grammars given which these classes can be polynomially identified in the limit from given data. This enables us to give new proofs of the identification of well known classes of grammars, and to propose a new (and larger) class of linear grammars for which polynomial identification is thus possible.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
D. Angluin. Learning regular sets from queries and counterexamples. Information and Control, 39:337–350, 1987.
D. Angluin. Queries revisited. In Proceedings of ALT 2001, pages 12–31. Springer-Verlag, 2001.
A. Brazma, I. Jonassen, J. Vilo, and E. Ukkonen. Pattern discovery in biosequences. In V. Honavar and G. Slutski, editors, Grammatical Inference, ICGI’ 98, number 1433 in LNCS, pages 257–270, Berlin, Heidelberg, 1998. Springer-Verlag.
C. de la Higuera. Characteristic sets for polynomial grammatical inference. Machine Learning, 27:125–138, 1997.
C. de la Higuera and J. Oncina. Learning deterministic linear languages. In R. H. Sloan J. Kivinen, editor, Proceedings of COLT 2002, volume 2375 of LNCS, pages 185–200. Springer-Verlag, 2002.
H. Fernau. Identification of function distinguishable languages. In S. Jain H. Arimura and A. Sharma, editors, Proceedings of the 11th International Conference on Algorithmic Learning Theory (ALT 2000), volume 1968, pages 116–130, Berlin, Heidelberg, 2000. Springer-Verlag.
H. Fernau. Learning XML grammars. In P. Perner, editor, Machine Learning and Data Mining in Pattern Recognition MLDM’01, number 2123, pages 73–87. Springer-Verlag, 2001.
M. Gold. Complexity of automaton identification from given data. Information and Control, 37:302–320, 1978.
Lisa Hellerstein, Krishnan Pillaipakkamnatt, Vijay Raghavan, and Dawn Wilkins. How many queries are needed to learn? Journal of the ACM, 43(5):840–862, 1996.
M. Kearns and L. Valiant. Cryptographic limitations on learning boolean formulae and finite automata. In 21st ACM Symposium on Theory of Computing, pages 433–444, 1989.
S. Lee. Learning of context-free languages: A survey of the literature. Technical Report TR-12-96, Center for Research in Computing Technology, Harvard University, Cambridge, Massachusetts, 1996.
K. J. Lang, B. A. Pearlmutter, and R. A. Price. Results of the Abbadingo one DFA learning competition and a new evidence-driven state merging algorithm. In Grammatical Inference, number 1433 in LNCS, pages 1–12. Springer-Verlag, 1998.
J. Oncina and P. García. Identifying regular languages in polynomial time. In H. Bunke, editor, Advances in Structural and Syntactic Pattern Recognition, volume 5 of Series in Machine Perception and Artificial Intelligence, pages 99–108. World Scientific, 1992.
L. Pitt. Inductive inference, DFA’s, and computational complexity. In Analogical and Inductive Inference, number 397 in LNCS, pages 18–44. Springer-Verlag, Berlin, 1989.
L. Pitt and M. Warmuth. The minimum consistent DFA problem cannot be approximated within any polynomial. Journal of the Association for Computing Machinery, 40(1):95–142, 1993.
Y. Sakakibara. Learning context-free grammars from structural data in polynomial time. Theoretical Computer Science, 76:223–242, 1990.
Y. Sakakibara. Recent advances of grammatical inference. Theoretical Computer Science, 185:15–45, 1997.
J. M. Sempere and P. García. A characterisation of even linear languages and its application to the learning problem. In R. C. Carrasco and J. Oncina, editors, Grammatical Inference and Applications, ICGI-94, number 862 in LNCS, pages 38–44, Berlin, Heidelberg, 1994. Springer
A. Stolcke. An efficient probablistic context-free parsing algorithm that computes prefix probabilities. Linguistics, 21(2):165–201, 1995.
Y. Takada. Grammatical inference for even linear languages based on control sets. Information Processing Letters, 28(4):193–199, 1988.
G. Valiant. A theory of the learnable. Communications of the Association for Computing Machinery, 27(11):1134–1142, 1984.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de la Higuera, C., Oncina, J. (2002). On Sufficient Conditions to Identify in the Limit Classes of Grammars from Polynomial Time and Data. In: Adriaans, P., Fernau, H., van Zaanen, M. (eds) Grammatical Inference: Algorithms and Applications. ICGI 2002. Lecture Notes in Computer Science(), vol 2484. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45790-9_11
Download citation
DOI: https://doi.org/10.1007/3-540-45790-9_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44239-4
Online ISBN: 978-3-540-45790-9
eBook Packages: Springer Book Archive