Skip to main content

Towards a Natural Language Compiler

  • Conference paper
  • First Online:
Advances in Computational Intelligence (MICAI 2018)

Abstract

Being able to create a natural language compiler has been one of the most sought-after goals to reach since the very beginning of artificial intelligence. Since then; however, it has been an elusive and difficult task to achieve to the extent of being considered almost impossible to perform. In this article, we present a promising path by using a grammar formalism which attempts to model natural language; in principle, by using minimalist grammars as one of the last proposed instances of formalism of this type. The main idea consists in creating a parser based on this type of grammars which could recognize and analyze the text (or input program) written in natural language and use this parser as a front-end of a compiler. Then, for the rest of the compilation process, utilize the usual phases of a classic compiler of a programming language. Moreover, we present a prototype of a natural language compiler whose specific language is that of arithmetic expressions, in order to show with evidence that it is indeed possible to implement it, that is to say, to put the proposed compiler design into practice, showing in this manner that it is actually possible to create a natural language compiler following this promising path.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    We believe that it makes sense to start with easy to process and well-defined natural language subsets and subsequently to cover more complex challenging ones.

References

  1. Aho, A.V., Lam, M.S., Sethi, R., Ullman, J.D.: Compilers: Principles, Techniques, and Tools, 2nd edn. Addison-Wesley, Boston (2006)

    MATH  Google Scholar 

  2. Aho, A.V., Ullman, J.D.: The Theory of Parsing, Translation, and Compiling. Prentice-Hall Inc, Upper Saddle River (1972)

    Google Scholar 

  3. Boullier, P.: Proposal for a natural language processing syntactic backbone. Research report RR-3342, INRIA (1998). https://hal.inria.fr/inria-00073347

  4. Boullier, P.: Range Concatenation Grammars, pp. 269–289. Springer, Dordrecht (2005). https://doi.org/10.1007/1-4020-2295-6_13

    Book  MATH  Google Scholar 

  5. Chomsky, N.: The Minimalist Program. Current Studies in Linguistics, MIT (1995)

    Google Scholar 

  6. Cisco Systems, Inc: Chez scheme. https://github.com/cisco/chezscheme

  7. Ghuloum, A.: Ikarus scheme. http://ikarus-scheme.org/

  8. Harkema, H.: A recognizer for minimalist grammars, vol. 2000, pp. 111–122, Trento, Italy (2000)

    Google Scholar 

  9. Iacob, R., Rebedea, T., Trausan-Matu, S.: NLCP: Towards a compiler for natural language. In: 2017 21st International Conference on Control Systems and Computer Science (CSCS), pp. 252–259, May 2017. https://doi.org/10.1109/CSCS.2017.42

  10. Jäger, G., Rogers, J.: Formal language theory: refining the Chomsky hierarchy. Philos. Trans. R. Soc. Lond. B: Biol. Sci. 367(1598), 1956–1970 (2012). https://doi.org/10.1098/rstb.2012.0077

    Article  Google Scholar 

  11. Joshi, A.K.: Tree adjoining grammars: how much context-sensitivity is required to provide reasonable structural descriptions?, pp. 206–250. Studies in Natural Language Processing, Cambridge University Press, Cambridge (1985). https://doi.org/10.1017/CBO9780511597855.007

  12. Joshi, A.K., Shanker, K.V., Weir, D.: The convergence of mildly context-sensitive grammar formalisms. Tech. report MS-CIS-09-01, University of Pennsylvania, January 1990. https://repository.upenn.edu/cgi/viewcontent.cgi?article=1571&context=cis_reports

  13. Kallmeyer, L.: Parsing Beyond Context-Free Grammars. Cognitive Technologies. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14846-0

    Book  MATH  Google Scholar 

  14. Pullum, G.K., Gazdar, G.: Natural languages and context-free languages. Linguist. Philos. 4(4), 471–504 (1982). https://doi.org/10.1007/BF00360802

    Article  Google Scholar 

  15. Seki, H., Matsumura, T., Fujii, M., Kasami, T.: On multiple context-free grammars. Theor. Comput. Sci. 88(2), 191–229 (1991). https://doi.org/10.1016/0304-3975(91)90374-B

    Article  MathSciNet  MATH  Google Scholar 

  16. Sørensen, M.H., Urzyczyin, P.: Lectures on the Curry-Howard Isomorphism. Studies in Logic and the Foundations of Mathematics, vol. 149. Elsevier, Amsterdam (2006). https://www.sciencedirect.com/bookseries/studies-in-logic-and-the-foundations-of-mathematics/vol/149/suppl/C

    MATH  Google Scholar 

  17. Stabler, E.: Derivational minimalism. In: Retoré, C. (ed.) LACL 1996. LNCS, vol. 1328, pp. 68–95. Springer, Heidelberg (1997). https://doi.org/10.1007/BFb0052152

    Chapter  Google Scholar 

  18. Stabler, E.: Top-down recognizers for MCFGs and MGs. In: Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics. pp. 39–48. Association for Computational Linguistics, Portland, June 2011. http://www.aclweb.org/anthology/W11-0605

  19. Stabler, E.: MG TD beam parser (2012). https://github.com/epstabler/mgtdb

  20. Stabler, E.: Two models of minimalist, incremental syntactic analysis. Top. Cogn. Sci. 5(3), 611–633 (2013). https://doi.org/10.1111/tops.12031

    Article  Google Scholar 

  21. Thomas, J., Antony, P.J., Balapradeep, K.N., Mithun, K.D., Maiya, N.: Natural language compiler for English and Dravidian languages. In: Shetty, N.R., Prasad, N.H., Nalini, N. (eds.) Emerging Research in Computing, Information, Communication and Applications, pp. 313–323. Springer, New Delhi (2015). https://doi.org/10.1007/978-81-322-2550-8_31

    Chapter  Google Scholar 

  22. Zúñiga, A., Sierra, G., Bel-Enguix, G., Galicia-Haro, S.N.: Towards a natural language compiler: the compiler prototype implementation. http://www.pcic.unam.mx/~zuniga.a/nlmgcomp.tgz

Download references

Acknowledgments

We sincerely thank Edward Stabler for allowing us to freely modify his MG top-down beam parser implementation, for getting acquainted with some of the most recent MG parsing advances and for encouraging us to develop a Spanish MG. This work was supported by the Mexican Council of Science and Technology (CONACYT), fund 2016-01-2225, and DGAPA, fund IN403016.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Angel Zúñiga .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zúñiga, A., Sierra, G., Bel-Enguix, G., Galicia-Haro, S.N. (2018). Towards a Natural Language Compiler. In: Batyrshin, I., Martínez-Villaseñor, M., Ponce Espinosa, H. (eds) Advances in Computational Intelligence. MICAI 2018. Lecture Notes in Computer Science(), vol 11289. Springer, Cham. https://doi.org/10.1007/978-3-030-04497-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-04497-8_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-04496-1

  • Online ISBN: 978-3-030-04497-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics