Abstract
In this paper an algorithm for the induction of a context-free grammar is proposed, and its application in obtaining a generating function for the number of certain combinatorial objects is demonstrated. In particular, two problems classified in The On-Line Encyclopedia of Integer Sequences (http://oeis.org/) under entries A000073 and A000108, as well as a problem from the domain of chemoinformatics, are solved as an illustration of our method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Our translation can be further re-formulated as an integer linear program, but the number of variables increases so much that this is not profitable.
- 2.
To expand this fraction in a power series, one can use Maclaurin’s formula:
$$\begin{aligned} f(z) = f(0) + \frac{z}{1!} f'(0) + \frac{z^2}{2!} f''(0) + \cdots + \frac{z^n}{n!} f^{(n)}(0) + \cdots \ . \end{aligned}$$ - 3.
Recall that we hope to obtain grammars that are likely to be unambiguous.
- 4.
We are aware of this imprecision. The number of words and their lengths should allow of executing a program in a reasonable amount of time. This, in turn, depends on many circumstances.
- 5.
To model and solve our non-linear program we make use of the Optimization Modeling Language (OML) and Microsoft Solver Foundation 3.1 development tools.
References
Alonso, L., Schott, R.: Random Generation of Trees: Random Generators in Computer Science. Springer, New York (1995)
Angluin, D.: An application of the theory of computational complexity to the study of inductive inference. Ph.D. thesis, University of California (1976)
Angluin, D.: Negative results for equivalence queries. Mach. Learn. 5, 121–150 (1990)
Basten, H.J.S.: The usability of ambiguity detection methods for context-free grammars. Electron. Notes Theor. Comput. Sci. 238(5), 35–46 (2009)
Bender, E.A., Williamson, S.G.: Foundations of Combinatorics with Applications. Dover Books on Mathematics Series. Dover Publications, Dover (2006)
Book, R., Otto, F.: String-Rewriting Systems. Springer, New York (1993)
Du D.Z., Ko, K.: Problem Solving in Automata, Languages, and Complexity. Wiley, New York (2001)
Delest, M.: Algebraic languages: a bridge between combinatorics and computer science. DIMACS: Ser. Discrete Math. Theor. Comput. Sci. 24, 71–87 (1994)
Eyraud, R., de la Higuera, C., Janodet, J.C.: Lars: a learning algorithm for rewriting systems. Mach. Learn. 66, 7–31 (2007)
Gold, E.M.: Language identification in the limit. Inf. Control 10, 447–474 (1967)
Gold, E.M.: Complexity of automaton identification from given data. Inf. Control 37, 302–320 (1978)
Graham, R., Knuth, D., Patashnik, O.: Concrete Mathematics: A Foundation for Computer Science. Addison-Wesley, New York (1994)
de la Higuera, C.: A bibliographical study of grammatical inference. Pattern Recogn. 38, 1332–1348 (2005)
de la Higuera, C.: Grammatical Inference: Learning Automata and Grammars. Cambridge University Press, Cambridge (2010)
Hopcroft, J.E., Motwani, R., Ullman, J.D.: Introduction to Automata Theory, Languages, and Computation, 2nd edn. Addison-Wesley, New York (2001)
Imada, K., Nakamura, K.: Learning context free grammars by using sat solvers. In: ICMLA 2009, pp. 267–272. IEEE Computer Society (2009)
Kuich, K., Salomaa, A.: Semirings, Automata, Languages. Springer, Berlin (1985)
Lothaire, M.: Algebraic Combinatorics on Words, Encyclopedia of Mathematics and its Applications, vol. 90. Cambridge University Press, Cambridge (2002)
Waterman, M.S.: Secondary structure of single-stranded nucleic acids. In: Studies on Foundations and Combinatorics. Advances in Mathematics Supplementary Studies, vol. 1, pp. 167–212. Academic Press, New York (1978)
Wieczorek, W., Unold, O.: Induction of directed acyclic word graph in a bioinformatics task. JMLR Workshop Conf. Proc. 34, 207–217 (2014)
Wood, D.: A generalised normal form theorem for context-free grammars. Comput. J. 13(3), 272–277 (1970)
Acknowledgments
This research was supported in part by PL-Grid Infrastructure, and by Grant No. DEC-2011/03/B/ST6/01588 from National Science Center of Poland.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Wieczorek, W., Nowakowski, A. (2016). Grammatical Inference in the Discovery of Generating Functions. In: Gruca, A., Brachman, A., Kozielski, S., Czachórski, T. (eds) Man–Machine Interactions 4. Advances in Intelligent Systems and Computing, vol 391. Springer, Cham. https://doi.org/10.1007/978-3-319-23437-3_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-23437-3_54
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23436-6
Online ISBN: 978-3-319-23437-3
eBook Packages: EngineeringEngineering (R0)