Skip to main content

Grammatical Inference in the Discovery of Generating Functions

  • Conference paper
  • First Online:
Man–Machine Interactions 4

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 391))

Abstract

In this paper an algorithm for the induction of a context-free grammar is proposed, and its application in obtaining a generating function for the number of certain combinatorial objects is demonstrated. In particular, two problems classified in The On-Line Encyclopedia of Integer Sequences (http://oeis.org/) under entries A000073 and A000108, as well as a problem from the domain of chemoinformatics, are solved as an illustration of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Our translation can be further re-formulated as an integer linear program, but the number of variables increases so much that this is not profitable.

  2. 2.

    To expand this fraction in a power series, one can use Maclaurin’s formula:

    $$\begin{aligned} f(z) = f(0) + \frac{z}{1!} f'(0) + \frac{z^2}{2!} f''(0) + \cdots + \frac{z^n}{n!} f^{(n)}(0) + \cdots \ . \end{aligned}$$
  3. 3.

    Recall that we hope to obtain grammars that are likely to be unambiguous.

  4. 4.

    We are aware of this imprecision. The number of words and their lengths should allow of executing a program in a reasonable amount of time. This, in turn, depends on many circumstances.

  5. 5.

    To model and solve our non-linear program we make use of the Optimization Modeling Language (OML) and Microsoft Solver Foundation 3.1 development tools.

References

  1. Alonso, L., Schott, R.: Random Generation of Trees: Random Generators in Computer Science. Springer, New York (1995)

    Book  Google Scholar 

  2. Angluin, D.: An application of the theory of computational complexity to the study of inductive inference. Ph.D. thesis, University of California (1976)

    Google Scholar 

  3. Angluin, D.: Negative results for equivalence queries. Mach. Learn. 5, 121–150 (1990)

    Google Scholar 

  4. Basten, H.J.S.: The usability of ambiguity detection methods for context-free grammars. Electron. Notes Theor. Comput. Sci. 238(5), 35–46 (2009)

    Article  Google Scholar 

  5. Bender, E.A., Williamson, S.G.: Foundations of Combinatorics with Applications. Dover Books on Mathematics Series. Dover Publications, Dover (2006)

    Google Scholar 

  6. Book, R., Otto, F.: String-Rewriting Systems. Springer, New York (1993)

    Book  MATH  Google Scholar 

  7. Du D.Z., Ko, K.: Problem Solving in Automata, Languages, and Complexity. Wiley, New York (2001)

    Google Scholar 

  8. Delest, M.: Algebraic languages: a bridge between combinatorics and computer science. DIMACS: Ser. Discrete Math. Theor. Comput. Sci. 24, 71–87 (1994)

    MathSciNet  Google Scholar 

  9. Eyraud, R., de la Higuera, C., Janodet, J.C.: Lars: a learning algorithm for rewriting systems. Mach. Learn. 66, 7–31 (2007)

    Article  Google Scholar 

  10. Gold, E.M.: Language identification in the limit. Inf. Control 10, 447–474 (1967)

    Article  MATH  Google Scholar 

  11. Gold, E.M.: Complexity of automaton identification from given data. Inf. Control 37, 302–320 (1978)

    Article  MathSciNet  MATH  Google Scholar 

  12. Graham, R., Knuth, D., Patashnik, O.: Concrete Mathematics: A Foundation for Computer Science. Addison-Wesley, New York (1994)

    MATH  Google Scholar 

  13. de la Higuera, C.: A bibliographical study of grammatical inference. Pattern Recogn. 38, 1332–1348 (2005)

    Article  Google Scholar 

  14. de la Higuera, C.: Grammatical Inference: Learning Automata and Grammars. Cambridge University Press, Cambridge (2010)

    Google Scholar 

  15. Hopcroft, J.E., Motwani, R., Ullman, J.D.: Introduction to Automata Theory, Languages, and Computation, 2nd edn. Addison-Wesley, New York (2001)

    MATH  Google Scholar 

  16. Imada, K., Nakamura, K.: Learning context free grammars by using sat solvers. In: ICMLA 2009, pp. 267–272. IEEE Computer Society (2009)

    Google Scholar 

  17. Kuich, K., Salomaa, A.: Semirings, Automata, Languages. Springer, Berlin (1985)

    Google Scholar 

  18. Lothaire, M.: Algebraic Combinatorics on Words, Encyclopedia of Mathematics and its Applications, vol. 90. Cambridge University Press, Cambridge (2002)

    Book  Google Scholar 

  19. Waterman, M.S.: Secondary structure of single-stranded nucleic acids. In: Studies on Foundations and Combinatorics. Advances in Mathematics Supplementary Studies, vol. 1, pp. 167–212. Academic Press, New York (1978)

    Google Scholar 

  20. Wieczorek, W., Unold, O.: Induction of directed acyclic word graph in a bioinformatics task. JMLR Workshop Conf. Proc. 34, 207–217 (2014)

    Google Scholar 

  21. Wood, D.: A generalised normal form theorem for context-free grammars. Comput. J. 13(3), 272–277 (1970)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

This research was supported in part by PL-Grid Infrastructure, and by Grant No. DEC-2011/03/B/ST6/01588 from National Science Center of Poland.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wojciech Wieczorek .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Wieczorek, W., Nowakowski, A. (2016). Grammatical Inference in the Discovery of Generating Functions. In: Gruca, A., Brachman, A., Kozielski, S., Czachórski, T. (eds) Man–Machine Interactions 4. Advances in Intelligent Systems and Computing, vol 391. Springer, Cham. https://doi.org/10.1007/978-3-319-23437-3_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23437-3_54

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23436-6

  • Online ISBN: 978-3-319-23437-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics