Towards Synthesizing Complex Programs from Input-Output Examples

Chen, Xinyun; Liu, Chang; Song, Dawn

Computer Science > Machine Learning

arXiv:1706.01284v3 (cs)

[Submitted on 5 Jun 2017 (v1), revised 11 Feb 2018 (this version, v3), latest version 8 Mar 2018 (v4)]

Title:Towards Synthesizing Complex Programs from Input-Output Examples

Authors:Xinyun Chen, Chang Liu, Dawn Song

View PDF

Abstract:In recent years, deep learning techniques have been developed to improve the performance of program synthesis from input-output examples. Albeit its significant progress, the programs that can be synthesized by state-of-the-art approaches are still simple in terms of their complexity. In this work, we move a significant step forward along this direction by proposing a new class of challenging tasks in the domain of program synthesis from input-output examples: learning a context-free parser from pairs of input programs and their parse trees. We show that this class of tasks are much more challenging than previously studied tasks, and the test accuracy of existing approaches is almost 0%.
We tackle the challenges by developing three novel techniques inspired by three novel observations, which reveal the key ingredients of using deep learning to synthesize a complex program. First, the use of a non-differentiable machine is the key to effectively restrict the search space. Thus our proposed approach learns a neural program operating a domain-specific non-differentiable machine. Second, recursion is the key to achieve generalizability. Thus, we bake-in the notion of recursion in the design of our non-differentiable machine. Third, reinforcement learning is the key to learn how to operate the non-differentiable machine, but it is also hard to train the model effectively with existing reinforcement learning algorithms from a cold boot. We develop a novel two-phase reinforcement learning-based search algorithm to overcome this issue. In our evaluation, we show that using our novel approach, neural parsing programs can be learned to achieve 100% test accuracy on test inputs that are 500x longer than the training samples.

Comments:	Published as a conference paper at ICLR 2018
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)
Cite as:	arXiv:1706.01284 [cs.LG]
	(or arXiv:1706.01284v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1706.01284

Submission history

From: Xinyun Chen [view email]
[v1] Mon, 5 Jun 2017 11:44:35 UTC (1,127 KB)
[v2] Tue, 30 Jan 2018 04:54:32 UTC (1,121 KB)
[v3] Sun, 11 Feb 2018 04:33:30 UTC (1,121 KB)
[v4] Thu, 8 Mar 2018 00:22:59 UTC (1,121 KB)

Computer Science > Machine Learning

Title:Towards Synthesizing Complex Programs from Input-Output Examples

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Synthesizing Complex Programs from Input-Output Examples

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators