Data Generation for Neural Programming by Example

Clymo, Judith; Manukian, Haik; Fijalkow, Nathanaël; Gascón, Adrià; Paige, Brooks

Computer Science > Machine Learning

arXiv:1911.02624 (cs)

[Submitted on 6 Nov 2019]

Title:Data Generation for Neural Programming by Example

Authors:Judith Clymo, Haik Manukian, Nathanaël Fijalkow, Adrià Gascón, Brooks Paige

View PDF

Abstract:Programming by example is the problem of synthesizing a program from a small set of input / output pairs. Recent works applying machine learning methods to this task show promise, but are typically reliant on generating synthetic examples for training. A particular challenge lies in generating meaningful sets of inputs and outputs, which well-characterize a given program and accurately demonstrate its behavior. Where examples used for testing are generated by the same method as training data then the performance of a model may be partly reliant on this similarity. In this paper we introduce a novel approach using an SMT solver to synthesize inputs which cover a diverse set of behaviors for a given program. We carry out a case study comparing this method to existing synthetic data generation procedures in the literature, and find that data generated using our approach improves both the discriminatory power of example sets and the ability of trained machine learning models to generalize to unfamiliar data.

Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Programming Languages (cs.PL); Machine Learning (stat.ML)
Cite as:	arXiv:1911.02624 [cs.LG]
	(or arXiv:1911.02624v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.02624

Submission history

From: Nathanaël Fijalkow [view email]
[v1] Wed, 6 Nov 2019 20:57:03 UTC (4,308 KB)

Computer Science > Machine Learning

Title:Data Generation for Neural Programming by Example

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Data Generation for Neural Programming by Example

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators