Abstract
Being able to create a natural language compiler has been one of the most sought-after goals to reach since the very beginning of artificial intelligence. Since then; however, it has been an elusive and difficult task to achieve to the extent of being considered almost impossible to perform. In this article, we present a promising path by using a grammar formalism which attempts to model natural language; in principle, by using minimalist grammars as one of the last proposed instances of formalism of this type. The main idea consists in creating a parser based on this type of grammars which could recognize and analyze the text (or input program) written in natural language and use this parser as a front-end of a compiler. Then, for the rest of the compilation process, utilize the usual phases of a classic compiler of a programming language. Moreover, we present a prototype of a natural language compiler whose specific language is that of arithmetic expressions, in order to show with evidence that it is indeed possible to implement it, that is to say, to put the proposed compiler design into practice, showing in this manner that it is actually possible to create a natural language compiler following this promising path.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
We believe that it makes sense to start with easy to process and well-defined natural language subsets and subsequently to cover more complex challenging ones.
References
Aho, A.V., Lam, M.S., Sethi, R., Ullman, J.D.: Compilers: Principles, Techniques, and Tools, 2nd edn. Addison-Wesley, Boston (2006)
Aho, A.V., Ullman, J.D.: The Theory of Parsing, Translation, and Compiling. Prentice-Hall Inc, Upper Saddle River (1972)
Boullier, P.: Proposal for a natural language processing syntactic backbone. Research report RR-3342, INRIA (1998). https://hal.inria.fr/inria-00073347
Boullier, P.: Range Concatenation Grammars, pp. 269–289. Springer, Dordrecht (2005). https://doi.org/10.1007/1-4020-2295-6_13
Chomsky, N.: The Minimalist Program. Current Studies in Linguistics, MIT (1995)
Cisco Systems, Inc: Chez scheme. https://github.com/cisco/chezscheme
Ghuloum, A.: Ikarus scheme. http://ikarus-scheme.org/
Harkema, H.: A recognizer for minimalist grammars, vol. 2000, pp. 111–122, Trento, Italy (2000)
Iacob, R., Rebedea, T., Trausan-Matu, S.: NLCP: Towards a compiler for natural language. In: 2017 21st International Conference on Control Systems and Computer Science (CSCS), pp. 252–259, May 2017. https://doi.org/10.1109/CSCS.2017.42
Jäger, G., Rogers, J.: Formal language theory: refining the Chomsky hierarchy. Philos. Trans. R. Soc. Lond. B: Biol. Sci. 367(1598), 1956–1970 (2012). https://doi.org/10.1098/rstb.2012.0077
Joshi, A.K.: Tree adjoining grammars: how much context-sensitivity is required to provide reasonable structural descriptions?, pp. 206–250. Studies in Natural Language Processing, Cambridge University Press, Cambridge (1985). https://doi.org/10.1017/CBO9780511597855.007
Joshi, A.K., Shanker, K.V., Weir, D.: The convergence of mildly context-sensitive grammar formalisms. Tech. report MS-CIS-09-01, University of Pennsylvania, January 1990. https://repository.upenn.edu/cgi/viewcontent.cgi?article=1571&context=cis_reports
Kallmeyer, L.: Parsing Beyond Context-Free Grammars. Cognitive Technologies. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-14846-0
Pullum, G.K., Gazdar, G.: Natural languages and context-free languages. Linguist. Philos. 4(4), 471–504 (1982). https://doi.org/10.1007/BF00360802
Seki, H., Matsumura, T., Fujii, M., Kasami, T.: On multiple context-free grammars. Theor. Comput. Sci. 88(2), 191–229 (1991). https://doi.org/10.1016/0304-3975(91)90374-B
Sørensen, M.H., Urzyczyin, P.: Lectures on the Curry-Howard Isomorphism. Studies in Logic and the Foundations of Mathematics, vol. 149. Elsevier, Amsterdam (2006). https://www.sciencedirect.com/bookseries/studies-in-logic-and-the-foundations-of-mathematics/vol/149/suppl/C
Stabler, E.: Derivational minimalism. In: Retoré, C. (ed.) LACL 1996. LNCS, vol. 1328, pp. 68–95. Springer, Heidelberg (1997). https://doi.org/10.1007/BFb0052152
Stabler, E.: Top-down recognizers for MCFGs and MGs. In: Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics. pp. 39–48. Association for Computational Linguistics, Portland, June 2011. http://www.aclweb.org/anthology/W11-0605
Stabler, E.: MG TD beam parser (2012). https://github.com/epstabler/mgtdb
Stabler, E.: Two models of minimalist, incremental syntactic analysis. Top. Cogn. Sci. 5(3), 611–633 (2013). https://doi.org/10.1111/tops.12031
Thomas, J., Antony, P.J., Balapradeep, K.N., Mithun, K.D., Maiya, N.: Natural language compiler for English and Dravidian languages. In: Shetty, N.R., Prasad, N.H., Nalini, N. (eds.) Emerging Research in Computing, Information, Communication and Applications, pp. 313–323. Springer, New Delhi (2015). https://doi.org/10.1007/978-81-322-2550-8_31
Zúñiga, A., Sierra, G., Bel-Enguix, G., Galicia-Haro, S.N.: Towards a natural language compiler: the compiler prototype implementation. http://www.pcic.unam.mx/~zuniga.a/nlmgcomp.tgz
Acknowledgments
We sincerely thank Edward Stabler for allowing us to freely modify his MG top-down beam parser implementation, for getting acquainted with some of the most recent MG parsing advances and for encouraging us to develop a Spanish MG. This work was supported by the Mexican Council of Science and Technology (CONACYT), fund 2016-01-2225, and DGAPA, fund IN403016.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Zúñiga, A., Sierra, G., Bel-Enguix, G., Galicia-Haro, S.N. (2018). Towards a Natural Language Compiler. In: Batyrshin, I., Martínez-Villaseñor, M., Ponce Espinosa, H. (eds) Advances in Computational Intelligence. MICAI 2018. Lecture Notes in Computer Science(), vol 11289. Springer, Cham. https://doi.org/10.1007/978-3-030-04497-8_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-04497-8_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04496-1
Online ISBN: 978-3-030-04497-8
eBook Packages: Computer ScienceComputer Science (R0)