Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Trinity: an extensible synthesis framework for data science

Published: 01 August 2019 Publication History
  • Get Citation Alerts
  • Abstract

    In this demo paper, we introduce Trinity, a general-purpose framework that can be used to quickly build domain-specific program synthesizers for automating many tedious tasks that arise in data science. We illustrate how Trinity can be used by three different users: First, we show how end-users can use Trinity's built-in synthesizers to automate data wrangling tasks. Second, we show how advanced users can easily extend existing synthesizers to support additional functionalities. Third, we show how synthesis experts can change the underlying search engine in Trinity. Overall, this paper is intended to demonstrate how users can quickly use, modify, and extend the Trinity framework with the goal of automating many tasks that are considered to be the "janitor" work of data science.

    References

    [1]
    M. Balog, A. L. Gaunt, M. Brockschmidt, S. Nowozin, and D. Tarlow. Deepcoder: Learning to write programs. In ICLR, 2017.
    [2]
    Y. Feng, R. Martins, O. Bastani, and I. Dillig. Program synthesis using conflict-driven learning. In PLDI, 2018.
    [3]
    Y. Feng, R. Martins, J. Van Geffen, I. Dillig, and S. Chaudhuri. Component-based synthesis of table consolidation and transformation tasks from examples. In PLDI, 2017.
    [4]
    S. Gulwani. Automating string processing in spreadsheets using input-output examples. In POPL, 2011.
    [5]
    W. R. Harris and S. Gulwani. Spreadsheet table transformations from examples. In PLDI, 2011.
    [6]
    Z. Jin, M. R. Anderson, M. Cafarella, and H. Jagadish. Foofah: Transforming data by example. In SIGMOD, 2017.
    [7]
    O. Polozov and S. Gulwani. Flashmeta: a framework for inductive program synthesis. In OOPSLA, 2015.
    [8]
    R. Singh and S. Gulwani. Synthesizing Number Transformations from Input-Output Examples. In CAV, 2012.
    [9]
    E. Torlak and R. Bodík. A lightweight symbolic virtual machine for solver-aided host languages. In PLDI, 2014.
    [10]
    C. Wang, A. Cheung, and R. Bodik. Synthesizing highly expressive sql queries from input-output examples. In PLDI, 2017.
    [11]
    X. Wang, I. Dillig, and R. Singh. Program synthesis using abstraction refinement. In POPL, 2018.
    [12]
    N. Yaghmazadeh, X. Wang, and I. Dillig. Automated migration of hierarchical data to relational tables using programming-by-example. PVLDB, 11(5):580--593, 2018.
    [13]
    N. Yaghmazadeh, Y. Wang, I. Dillig, and T. Dillig. Sqlizer: query synthesis from natural language. In OOPSLA, 2017.
    [14]
    S. Zhang and Y. Sun. Automatically synthesizing sql queries from input-output examples. In ASE, 2013.

    Cited By

    View all
    • (2024)Towards Efficient Data Wrangling with LLMs using Code GenerationProceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning10.1145/3650203.3663334(62-66)Online publication date: 9-Jun-2024
    • (2024)Towards Reliable SQL Synthesis: Fuzzing-Based Evaluation and DisambiguationFundamental Approaches to Software Engineering10.1007/978-3-031-57259-3_11(232-254)Online publication date: 6-Apr-2024
    • (2023)A SQL Synthesis System with Operator HandlerProceedings of the 2023 7th International Conference on Computer Science and Artificial Intelligence10.1145/3638584.3638654(132-136)Online publication date: 8-Dec-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 12, Issue 12
    August 2019
    547 pages

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 August 2019
    Published in PVLDB Volume 12, Issue 12

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)3

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Towards Efficient Data Wrangling with LLMs using Code GenerationProceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning10.1145/3650203.3663334(62-66)Online publication date: 9-Jun-2024
    • (2024)Towards Reliable SQL Synthesis: Fuzzing-Based Evaluation and DisambiguationFundamental Approaches to Software Engineering10.1007/978-3-031-57259-3_11(232-254)Online publication date: 6-Apr-2024
    • (2023)A SQL Synthesis System with Operator HandlerProceedings of the 2023 7th International Conference on Computer Science and Artificial Intelligence10.1145/3638584.3638654(132-136)Online publication date: 8-Dec-2023
    • (2023)Programming by Example Made EasyACM Transactions on Software Engineering and Methodology10.1145/360718533:1(1-36)Online publication date: 7-Jul-2023
    • (2023)Automated Translation of Functional Big Data Queries to SQLProceedings of the ACM on Programming Languages10.1145/35860477:OOPSLA1(580-608)Online publication date: 6-Apr-2023
    • (2022)Visualization question answering using introspective program synthesisProceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation10.1145/3519939.3523709(137-151)Online publication date: 9-Jun-2022
    • (2021)PATSQLProceedings of the VLDB Endowment10.14778/3476249.347625314:11(1937-1949)Online publication date: 1-Jul-2021
    • (2021)UDF to SQL translation through compositional lazy inductive synthesisProceedings of the ACM on Programming Languages10.1145/34854895:OOPSLA(1-26)Online publication date: 15-Oct-2021
    • (2021)Synthesizing data structure refinements from integrity constraintsProceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation10.1145/3453483.3454063(574-587)Online publication date: 19-Jun-2021
    • (2021)SOARProceedings of the 43rd International Conference on Software Engineering10.1109/ICSE43902.2021.00023(112-124)Online publication date: 22-May-2021
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media