Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

GenSQL: A Probabilistic Programming System for Querying Generative Models of Database Tables

Published: 20 June 2024 Publication History

Abstract

This article presents GenSQL, a probabilistic programming system for querying probabilistic generative models of database tables. By augmenting SQL with only a few key primitives for querying probabilistic models, GenSQL enables complex Bayesian inference workflows to be concisely implemented. GenSQL’s query planner rests on a unified programmatic interface for interacting with probabilistic models of tabular data, which makes it possible to use models written in a variety of probabilistic programming languages that are tailored to specific workflows. Probabilistic models may be automatically learned via probabilistic program synthesis, hand-designed, or a combination of both. GenSQL is formalized using a novel type system and denotational semantics, which together enable us to establish proofs that precisely characterize its soundness guarantees. We evaluate our system on two case real-world studies—an anomaly detection in clinical trials and conditional synthetic data generation for a virtual wet lab—and show that GenSQL more accurately captures the complexity of the data as compared to common baselines. We also show that the declarative syntax in GenSQL is more concise and less error-prone as compared to several alternatives. Finally, GenSQL delivers a 1.7-6.8x speedup compared to its closest competitor on a representative benchmark set and runs in comparable time to hand-written code, in part due to its reusable optimizations and code specialization.

References

[1]
Ryan P. Adams, Hanna Wallach, and Zoubin Ghahramani. 2010. Learning the Structure of Deep Sparse Graphical Models. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics. PMLR, Norfolk, MA, USA. 1–8.
[2]
Sameer Agarwal, Barzan Mozafari, Aurojit Panda, Henry Milner, Samuel Madden, and Stoica Ion. 2013. BlinkDB: Queries with bounded errors and bounded response times on very large data. In Proceedings of the 8th ACM European Conference on Computer Systems. ACM, New York, NY, USA. 29–42. https://doi.org/10.1145/2465351.2465355
[3]
Vince Bárány, Balder Ten Cate, Benny Kimelfeld, Dan Olteanu, and Zografoula Vagena. 2017. Declarative Probabilistic Programming with Datalog. ACM Transactions on Database Systems, 42, 4 (2017), 1–35. https://doi.org/10.1145/3132700
[4]
Luc Bauwens, Michel Lubrano, and Jean-Francois Richard. 2000. Bayesian Inference in Dynamic Econometric Models. Oxford University Press, Oxford, UK.
[5]
Véronique Benzaken and Evelyne Contejean. 2019. A Coq Mechanised Formal Semantics For Realistic SQL Queries: Formally Reconciling SQL and Bag Relational Algebra. In Proceedings of the 8th ACM SIGPLAN International Conference on Certified Programs and Proofs. ACM, New York, NY, USA. 249–261. https://doi.org/10.1145/3293880.3294107
[6]
James Bergstra, Brent Komer, Chris Eliasmith, Dan Yamins, and David D Cox. 2015. Hyperopt: A Python Library for Model Selection And Hyperparameter Optimization. Computational Science & Discovery, 8, 1 (2015), 014008. https://doi.org/10.1088/1749-4699/8/1/014008
[7]
Patrick Billingsley. 1995. Probability and Measure (3rd ed.). John Wiley & Sons, New York.
[8]
Eli Bingham, Jonathan P Chen, Martin Jankowiak, Fritz Obermeyer, Neeraj Pradhan, Theofanis Karaletsos, Rohit Singh, Paul Szerlip, Paul Horsfall, and Noah D Goodman. 2019. Pyro: Deep Universal Probabilistic Programming. Journal of Machine Learning Research, 20, 1 (2019), 973–978.
[9]
Frederick R. Blattner. 1997. The Complete Genome Sequence of Escherichia Coli K-12. Science, 277, 5331 (1997), 1453–1462. https://doi.org/10.1126/science.277.5331.1453
[10]
Johannes Borgström, Andrew D. Gordon, Long Ouyang, Claudio Russo, Adam Scibior, and Marcin Szymczak. 2016. Fabular: Regression Formulas as Probabilistic Programming. In Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. ACM, New York, NY, USA. 271–283. https://doi.org/10.1145/2837614.2837653
[11]
Joachim Borya. 2023. Formalisation of Relational Algebra and a SQL-like Language with the RISCAL Model Checker. Research Institute for Symbolic Computation (RISC), Johannes Kepler University Linz. https://doi.org/10.35011/risc.23-06
[12]
Zhuhua Cai, Zografoula Vagena, Luis Perez, Subramanian Arumugam, Peter J. Haas, and Christopher Jermaine. 2013. Simulation of Database-Valued Markov Chains Using SimSQL. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. ACM, New York, NY, USA. 637–648. https://doi.org/10.1145/2463676.2465283
[13]
José Cambronero, John K. Feser, Micah J. Smith, and Samuel Madden. 2017. Query Optimization for Dynamic Imputation. Proceedings of the VLDB Endowment, 10, 11 (2017), 1310–1321. https://doi.org/10.14778/3137628.3137641
[14]
Bob Carpenter, Andrew Gelman, Matthew D. Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus A. Brubaker, Jiqiang Guo, Peter Li, and Allen Riddell. 2017. Stan: A Probabilistic Programming Language. Journal of Statistical Software, 76, 1 (2017), 1–32. https://doi.org/10.18637/jss.v076.i01
[15]
CDC. 2022. BMI Percentile Calculator: Body Mass Indexes in the United States. https://dqydj.com/bmi-percentile-calculator-united-states
[16]
Nicholas G. Charchut. 2020. Implementation of a Cross-Platform Automated Bayesian Data Modeling System. Master’s thesis. Massachusetts Institute of Technology. Cambridge, MA.
[17]
Sarah Chasins and Phitchaya M. Phothilimthana. 2017. Data-driven synthesis of full probabilistic programs. In Proceedings of the 29th International Conference on Computer Aided Verification. Springer, Cham. 279–304. https://doi.org/10.1007/978-3-319-63387-9_14
[18]
Nicolas Chopin and Omiros Papaspiliopoulos. 2020. An Introduction to Sequential Monte Carlo. Springer, Cham. https://doi.org/10.1007/978-3-030-47845-2
[19]
Edgar F. Codd. 1970. A Relational Model of Data for Large Shared Data Banks. Commun. ACM, 13, 6 (1970), 377–387. https://doi.org/10.1145/362384.362685
[20]
Marco F. Cusumano-Towner, Feras A. Saad, Alexander K. Lew, and Vikash K. Mansinghka. 2019. Gen: A general-purpose probabilistic programming system with programmable inference. In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation. ACM, New York, NY, USA. 221–236. https://doi.org/10.1145/3314221.3314642
[21]
Swaraj Dash and Sam Staton. 2021. Monads for Measurable Queries in Probabilistic Databases. arxiv:2112.14048.
[22]
Sebastian Eck and Wolfgang Stephan. 2008. Determining the Relationship of Gene Expression and Global mRNA Stability in Drosophila Melanogaster And Escherichia Coli Using Linear Models. Gene, 424, 1-2 (2008), 102–107. https://doi.org/10.1016/j.gene.2008.07.033
[23]
Nick Erickson, Jonas Mueller, Alexander Shirkov, Hang Zhang, Pedro Larroy, Mu Li, and Alexander Smola. 2020. AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data. arxiv:2003.06505.
[24]
D. H. Fremlin. 2001. Measure Theory. 2, Torres Fremlin, Colchester, England.
[25]
Jonah Gabry, Daniel Simpson, Aki Vehtari, Michael Betancourt, and Andrew Gelman. 2019. Visualization in Bayesian Workflow. Journal of the Royal Statistical Society Series A: Statistics in Society, 182, 2 (2019), 389–402. https://doi.org/10.1111/rssa.12378
[26]
Hong Ge, Kai Xu, and Zoubin Ghahramani. 2018. Turing: A Language for Flexible Probabilistic Inference. In Proceedings of the 21st International Conference on Artificial Intelligence and Statistics. PMLR, Norfolk, MA, USA. 1682–1690.
[27]
Timon Gehr, Sasa Misailovic, and Martin Vechev. 2016. PSI: Exact Symbolic Inference for Probabilistic Programs. In Proceedings of the 28th International Conference on Computer Aided Verification. Springer, Cham. 62–83. https://doi.org/10.1007/978-3-319-41528-4_4
[28]
Andrew Gelman, John B. Carlin, Hal S. Stern, and Donald B. Rubin. 2014. Bayesian Data Analysis (3rd ed.). CRC Press, Boca Raton. https://doi.org/10.1201/9780429258411
[29]
Andrew Gelman, Aki Vehtari, Daniel Simpson, Charles C. Margossian, Bob Carpenter, Yuling Yao, Lauren Kennedy, Jonah Gabry, Paul-Christian Bürkner, and Martin Modrák. 2020. Bayesian Workflow. arxiv:2011.01808.
[30]
Robert Gens and Pedro Domingos. 2013. Learning the Structure of Sum-Product Networks. In Proceedings of the 30th International Conference on Machine Learning. PMLR, Norfolk, MA, USA. 873–880.
[31]
Alan Genz and Frank Bretz. 2009. Computation of Multivariate Normal and t Probabilities (Lecture Notes in Statistics, Vol. 195). Springer, Cham. https://doi.org/10.1007/978-3-642-01689-9
[32]
Michele Giry. 2006. A Categorical Approach To Probability Theory. In Categorical Aspects of Topology and Analysis: Proceedings of an International Conference. Springer, Cham. 68–85. https://doi.org/10.1007/BFb0092872
[33]
Andrew D. Gordon, Thore Graepel, Nicolas Rolland, Claudio Russo, Johannes Borgström, and John Guiver. 2014. Tabular: A Schema-Driven Probabilistic Programming Language. In Proceedings of the 41st ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. ACM, New York, NY, USA. 321–334. https://doi.org/10.1145/2535838.2535850
[34]
Martin Grohe, Benjamin Lucien Kaminski, Joost-Pieter Katoen, and Peter Lindner. 2022. Generative Datalog with Continuous Distributions. J. ACM, 69, 6 (2022), 1–52. https://doi.org/10.1145/3559102
[35]
Martin Grohe and Peter Lindner. 2022. Infinite Probabilistic Databases. Logical Methods in Computer Science, 18, 1 (2022), Article 34, 43 pages. https://doi.org/10.46298/LMCS-18(1:34)2022
[36]
Roger Grosse, Ruslan Salakhutdinov, William Freeman, and Joshua B. Tenenbaum. 2012. Exploiting compositionality to explore a large space of model structures. In Proceedings of the 28th Annual Conference on Uncertainty in Artificial Intelligence. AUAI Press, Puyallup, WA, USA. 306–315.
[37]
Allen Hatcher. 2002. Algebraic Topology. Cambridge University Press, Cambridge, UK.
[38]
Miguel A. Hernán and James M. Robins. 2006. Estimating Causal Effects from Epidemiological Data. Journal of Epidemiology & Community Health, 60, 7 (2006), 578–586. https://doi.org/10.1136/jech.2004.029496
[39]
Benjamin Hilprecht, Andreas Schmidt, Moritz Kulessa, Alejandro Molina, Kristian Kersting, and Carsten Binnig. 2020. DeepDB: Learn from Data, Not from Queries!. Proc. VLDB Endow., 13, 7 (2020), 992–1005. https://doi.org/10.14778/3384345.3384349
[40]
Steven Holtzen, Guy Van den Broeck, and Todd Millstein. 2020. Scaling Exact Inference for Discrete Probabilistic Programs. Proc. ACM Program. Lang., 4, OOPSLA (2020), Article 140, 31 pages. https://doi.org/10.1145/3133904
[41]
Irvin Hwang, Andreas Stuhlmüller, and Noah D Goodman. 2011. Inducing Probabilistic Programs by Bayesian Program Merging. arxiv:1110.5667.
[42]
Ravi Jampani, Fei Xu, Mingxi Wu, Luis Perez, Chris Jermaine, and Peter J. Haas. 2011. The Monte Carlo Database System: Stochastic Analysis Close to the Data. ACM Transactions on Database Systems, 36, 3 (2011), 1–41. https://doi.org/10.1145/2000824.2000828
[43]
Haifeng Jin, Qingquan Song, and Xia Hu. 2019. Auto-Keras: An Efficient Neural Architecture Search System. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, New York, NY, USA. 1946–1956. https://doi.org/10.1145/3292500.3330648
[44]
Ingrid M Keseler, Socorro Gama-Castro, Amanda Mackie, Peter E Midford, Alan J Wolfe, Julio Collado-Vides, Ian T Paulsen, and Peter D Karp. 2021. The EcoCyc Database in 2021. Frontiers in Microbiology, 12 (2021), 711077. https://doi.org/10.3389/fmicb.2021.711077
[45]
Erin LeDell and Sebastien Poirier. 2020. H2O AutoML: Scalable Automatic Machine Learning. In Proceedings of the 7th ICML Workshop on AutoML. AutoML-Conf, 16 pages.
[46]
Alexander Lew, Monica Agrawal, David Sontag, and Vikash Mansinghka. 2021. PClean: Bayesian Data Cleaning at Scale With Domain-Specific Probabilistic Programming. In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics. PMLR, Norfolk, MA, USA. 1927–1935.
[47]
Alexander K Lew, Matin Ghavamizadeh, Martin C Rinard, and Vikash K Mansinghka. 2023. Probabilistic Programming with Stochastic Probabilities. Proceedings of the ACM on Programming Languages, 7, PLDI (2023), 1708–1732. https://doi.org/10.1145/3591290
[48]
Brian K. Lohman, Jesse N. Weber, and Daniel I. Bolnick. 2016. Evaluation Of Tagseq, A Reliable Low-Cost Alternative for RNA Seq. Molecular Ecology Resources, 16, 6 (2016), 1315–1321. https://doi.org/10.1111/1755-0998.12529
[49]
David JC MacKay. 1998. Introduction to Gaussian Processes. NATO ASI series F computer and systems sciences, 168 (1998), 133–166.
[50]
Vikash Mansinghka, Patrick Shafto, Eric Jonas, Cap Petschulat, Max Gasner, and Joshua B. Tenenbaum. 2016. CrossCat: A Fully Bayesian Nonparametric Method for Analyzing Heterogeneous, High Dimensional Data. Journal of Machine Learning Research, 17, 138 (2016), 1–49.
[51]
Vikash K. Mansinghka, Ulrich Schaechtle, Shivam Handa, Alexey Radul, Yutian Chen, and Martin Rinard. 2018. Probabilistic programming with programmable inference. In Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation. ACM, New York, NY, USA. 603–616. https://doi.org/10.1145/3192366.3192409
[52]
Vikash K. Mansinghka, Richard Tibbetts, Jay Baxter, Pat Shafto, and Baxter Eaves. 2015. BayesDB: A Probabilistic Programming System for Querying the Probable Implications of Data. arxiv:1512.05006.
[53]
Maysam Mansouri and Martin Fussenegger. 2022. Therapeutic Cell Engineering: Designing Programmable Synthetic Genetic Circuits in Mammalian Cells. Protein & Cell, 13, 7 (2022), 476–489. https://doi.org/10.1007/s13238-021-00876-1
[54]
Brian Milch, Bhaskara Marthi, Stuart Russell, David Sontag, Daniel L. Ong, and Sontag Kolobov. 2005. BLOG: Probabilistic models with unknown objects. In Proceedings of the 19th International Joint Conference on Artificial Intelligence. Morgan Kaufmann, San Francisco, CA, USA. 1352–1359.
[55]
Kevin P. Murphy. 2022. Probabilistic Machine Learning: An Introduction. MIT Press, Cambridge, MA.
[56]
Alec A. K. Nielsen, Bryan S. Der, Jonghyeon Shin, Prashant Vaidyanathan, Vanya Paralanov, Elizabeth A. Strychalski, David Ross, Douglas Densmore, and Christopher A. Voigt. 2016. Genetic circuit design automation. Science, 352, 6281 (2016), aac7341. https://doi.org/10.1126/science.aac7341
[57]
Aditya V. Nori, Sherjil Ozair, Sriram K. Rajamani, and Deepak Vijaykeerthy. 2015. Efficient synthesis of probabilistic programs. In Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation. ACM, New York, NY, USA. 208–217. https://doi.org/10.1145/2737924.2737982
[58]
Randal S. Olson, Ryan J. Urbanowicz, Peter C. Andrews, Nicole A. Lavender, La Creis Kidd, and Jason H. Moore. 2016. Automating Biomedical Data Science Through Tree-Based Pipeline Optimization. In Applications of Evolutionary Computation. Springer, Cham. 123–137. https://doi.org/10.1007/978-3-319-31204-0_9
[59]
Judea Pearl. 1988. Probabilistic Reasoning In Intelligent Systems: Networks Of Plausible Inference. Morgan Kaufmann, San Francisco, CA, USA.
[60]
Fabian Pedregosa. 2011. Scikit-learn: Machine Learning in Python. Journal of Machine Learning research, 12 (2011), 2825–2830.
[61]
MIT Probabilistic Computing Project. 2024. Gen.clj. https://github.com/probcomp/Gen.clj
[62]
Tom Rainforth. 2018. Nesting Probabilistic Programs. In Proceedings of the 34th Annual Conference on Uncertainty in Artificial Intelligence. AUAI Press, Puyallup, WA, USA. 10 pages. arxiv:1803.06328.
[63]
Theodoros Rekatsinas, Xu Chu, Ihab F Ilyas, and Christopher Ré. 2017. HoloClean: Holistic Data Repairs with Probabilistic Inference. Proceedings of the VLDB Endowment, 10, 11 (2017), 1190–1201. https://doi.org/10.14778/3137628.3137631
[64]
Christian P. Robert and George Casella. 2004. Monte Carlo Statistical Methods (2 ed.). Springer, Cham. https://doi.org/10.1007/978-1-4757-4145-2
[65]
Feras Saad, Leonardo Casarsa, and Vikash Mansinghka. 2017. Probabilistic Search for Structured Data via Probabilistic Programming and Nonparametric Bayes. arxiv:1704.01087.
[66]
Feras Saad and Vikash Mansinghka. 2016. Probabilistic Data Analysis with Probabilistic Programming. arxiv:1608.05347.
[67]
Feras Saad and Vikash Mansinghka. 2017. Detecting Dependencies in Sparse, Multivariate Databases using Probabilistic Programming and Non-Parametric Bayes. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. PMLR, Norfolk, MA, USA. 632–641.
[68]
Feras Saad and Vikash K. Mansinghka. 2016. A Probabilistic Programming Approach to Probabilistic Data Analysis. In Advances in Neural Information Processing Systems. 29, Curran Associates, Inc., Red Hook, NY, USA. 2011–2019.
[69]
Feras A. Saad, Marco F. Cusumano-Towner, Ulrich Schaechtle, Martin C. Rinard, and Vikash K. Mansinghka. 2019. Bayesian Synthesis of Probabilistic Programs for Automatic Data Modeling. Proceedings of the ACM on Programming Languages, 3, POPL (2019), Article 37, 29 pages. https://doi.org/10.1145/3290350
[70]
Feras A. Saad and Vikash K. Mansinghka. 2018. Temporally-Reweighted Chinese Restaurant Process Mixtures for Clustering, Imputing, and Forecasting Multivariate Time Series. In Proceedings of the 21st International Conference on Artificial Intelligence and Statistics. PMLR, Norfolk, MA, USA. 755–764.
[71]
Feras A. Saad and Vikash K. Mansinghka. 2021. Hierarchical Infinite Relational Model. In Proceedings of the 37th Conference on Uncertainty in Artificial Intelligence. PMLR, Norfolk, MA, USA. 1067–1077.
[72]
Feras A. Saad, Brian J. Patton, Matthew D. Hoffmann, Rif A. Saurous, and Vikash K. Mansinghka. 2023. Sequential Monte Carlo Learning for Time Series Structure Discovery. In Proceedings of the 40th International Conference on Machine Learning. PMLR, Norfolk, MA, USA. Article 1226, 17 pages.
[73]
Feras A. Saad, Martin C. Rinard, and Vikash K. Mansinghka. 2021. SPPL: Probabilistic Programming with Fast Exact Symbolic Inference. In Proceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation. ACM, New York, NY, USA. 804–819. https://doi.org/10.1145/3453483.3454078
[74]
Feras A. K. Saad. 2022. Scalable Structure Learning, Inference, and Analysis with Probabilistic Programs. Ph. D. Dissertation. Massachusetts Institute of Technology.
[75]
John Salvatier, Thomas V. Wiecki, and Christopher Fonnesbeck. 2016. Probabilistic Programming in Python using PyMC3. PeerJ Computer Science, 2 (2016), e55. https://doi.org/10.7717/peerj-cs.55
[76]
Ulrich Schaechtle, Cameron Freer, Zane Shelby, Feras Saad, and Vikash Mansinghka. 2022. Bayesian AutoML for Databases via the InferenceQL Probabilistic Programming System. In Proceedings of the 1st Conference on Automated Machine Learning (Late-Breaking Workshop). AutoML-Conf, 8 pages.
[77]
Ulrich Schaechtle and Mathieu Huot. 2024. PLDI artifact evaluation for "PSQL: A Probabilistic Programming System for Automating Inference over Databases and Probabilistic Models. Zenodo. https://doi.org/10.5281/zenodo.10949799
[78]
Chung-chieh Shan and Norman Ramsey. 2017. Exact Bayesian Inference by Symbolic Disintegration. In Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages. ACM, New York, NY, USA. 130–144. https://doi.org/10.1145/3009837.3009852
[79]
Dan Suciu, Dan Olteanu, Christopher Ré, and Christoph Koch. 2011. Probabilistic Databases. Springer, Cham. https://doi.org/10.1007/978-3-031-01879-4
[80]
Chris Thornton, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. 2013. Auto-WEKA: Combined Selection and Hyperparameter Optimization Of Classification Algorithms. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, New York, NY, USA. 847–855. https://doi.org/10.1145/2487575.2487629
[81]
Guy Van den Broeck and Dan Suciu. 2017. Query Processing on Probabilistic Data: A Survey. Foundations and Trends in Databases, 7, 3-4 (2017), 197–341. https://doi.org/10.1561/1900000052
[82]
Baojun Wang, Mauricio Barahona, and Martin Buck. 2013. A Modular Cell-Based Biosensor Using Engineered Genetic Logic Circuits to Detect and Integrate Multiple Environmental Signals. Biosensors and Bioelectronics, 40, 1 (2013), 368–376. https://doi.org/10.1016/j.bios.2012.08.011
[83]
WHO. 2023. World Health Organization: Obesity. https://www.who.int/health-topics/obesity
[84]
Darren J. Wilkinson. 2018. Stochastic Modelling for Systems Biology. CRC Press, Boca Raton, FL, USA.
[85]
Frank Wood, Jan Willem Meent, and Vikash Mansinghka. 2014. A New Approach To Probabilistic Programming Inference. In Proceedings of the 17th International Conference on Artificial Intelligence and Statistics. PMLR, Norfolk, MA, USA. 1024–1032.
[86]
Yi Wu, Siddharth Srivastava, Nicholas Hay, Simon Du, and Stuart Russell. 2018. Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms. In Proceedings of the 35th International Conference on Machine Learning. PMLR, Norfolk, MA, USA. 5343–5352.
[87]
xCures. 2019. BEAT19 Behavior, Environment And Treatments for COVID-19. https://classic.clinicaltrials.gov/ct2/show/results/NCT04321811?view = results
[88]
Lei Xu, Maria Skoularidou, Alfredo Cuesta-Infante, and Kalyan Veeramachaneni. 2019. Modeling Tabular Data using Conditional GAN. In Advances in Neural Information Processing Systems. 32, Curran Associates, Inc., Red Hook, NY, USA. 11 pages.
[89]
Yubin Xue, Pei Du, Amal Amin Ibrahim Shendi, and Bo Yu. 2022. Mercury Bioremediation in Aquatic Environment by Genetically Modified Bacteria with Self-Controlled Biosecurity Circuit. Journal of Cleaner Production, 337 (2022), 130524. https://doi.org/10.1016/j.jclepro.2022.130524
[90]
Fabian Zaiser, Andrzej Murawski, and Chih-Hao Luke Ong. 2023. Exact Bayesian Inference on Discrete Models via Probability Generating Functions: A Probabilistic Programming Approach. In Advances in Neural Information Processing Systems. 36, Curran Associates, Inc., Red Hook, NY, USA. 2427–2462.

Cited By

View all
  • (2024)Building machines that learn and think with peopleNature Human Behaviour10.1038/s41562-024-01991-98:10(1851-1863)Online publication date: 22-Oct-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the ACM on Programming Languages
Proceedings of the ACM on Programming Languages  Volume 8, Issue PLDI
June 2024
2198 pages
EISSN:2475-1421
DOI:10.1145/3554317
Issue’s Table of Contents
This work is licensed under a Creative Commons Attribution International 4.0 License.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2024
Published in PACMPL Volume 8, Issue PLDI

Permissions

Request permissions for this article.

Check for updates

Badges

Author Tags

  1. AutoML
  2. Bayesian data analysis
  3. generative modeling
  4. probabilistic programming
  5. query language
  6. semantics and correctness

Qualifiers

  • Research-article

Funding Sources

  • DARPA Synergistic Discovery and Design Program
  • DARPA Machine Common Sense Program
  • DARPA Compositionally Organized Learning To Reason About Novel Experiences grant

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4,565
  • Downloads (Last 6 weeks)194
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Building machines that learn and think with peopleNature Human Behaviour10.1038/s41562-024-01991-98:10(1851-1863)Online publication date: 22-Oct-2024

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media