Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–47 of 47 results for author: Van de Meent, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04843  [pdf, other

    cs.LG stat.ML

    Variational Flow Matching for Graph Generation

    Authors: Floor Eijkelboom, Grigory Bartosh, Christian Andersson Naesseth, Max Welling, Jan-Willem van de Meent

    Abstract: We present a formulation of flow matching as variational inference, which we refer to as variational flow matching (VFM). Based on this formulation we develop CatFlow, a flow matching method for categorical data. CatFlow is easy to implement, computationally efficient, and achieves strong results on graph generation tasks. In VFM, the objective is to approximate the posterior probability path, whi… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2405.19024  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory

    Authors: Mustafa Mert Çelikok, Frans A. Oliehoek, Jan-Willem van de Meent

    Abstract: We consider inverse reinforcement learning problems with concave utilities. Concave Utility Reinforcement Learning (CURL) is a generalisation of the standard RL objective, which employs a concave function of the state occupancy measure, rather than a linear function. CURL has garnered recent attention for its ability to represent instances of many important applications including the standard RL s… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2403.09429  [pdf, other

    stat.ML cs.LG

    VISA: Variational Inference with Sequential Sample-Average Approximations

    Authors: Heiko Zimmermann, Christian A. Naesseth, Jan-Willem van de Meent

    Abstract: We present variational inference with sequential sample-average approximation (VISA), a method for approximate inference in computationally intensive models, such as those based on numerical simulations. VISA extends importance-weighted forward-KL variational inference by employing a sequence of sample-average approximations, which are considered valid inside a trust region. This makes it possible… ▽ More

    Submitted 15 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  4. arXiv:2402.10109  [pdf, other

    cs.AI cs.CL cs.LG

    Towards Reducing Diagnostic Errors with Interpretable Risk Prediction

    Authors: Denis Jered McInerney, William Dickinson, Lucy C. Flynn, Andrea C. Young, Geoffrey S. Young, Jan-Willem van de Meent, Byron C. Wallace

    Abstract: Many diagnostic errors occur because clinicians cannot easily access relevant information in patient Electronic Health Records (EHRs). In this work we propose a method to use LLMs to identify pieces of evidence in patient EHR data that indicate increased or decreased risk of specific diagnoses; our ultimate aim is to increase access to evidence and reduce diagnostic errors. In particular, we propo… ▽ More

    Submitted 19 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  5. arXiv:2312.07529  [pdf, other

    cs.LG

    Topological Obstructions and How to Avoid Them

    Authors: Babak Esmaeili, Robin Walters, Heiko Zimmermann, Jan-Willem van de Meent

    Abstract: Incorporating geometric inductive biases into models can aid interpretability and generalization, but encoding to a specific geometric structure can be challenging due to the imposed topological constraints. In this paper, we theoretically and empirically characterize obstructions to training encoders with geometric latent spaces. We show that local optima can arise due to singularities (e.g. self… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  6. arXiv:2306.12392  [pdf, other

    cs.RO cs.LG

    One-shot Imitation Learning via Interaction Warping

    Authors: Ondrej Biza, Skye Thompson, Kishore Reddy Pagidi, Abhinav Kumar, Elise van der Pol, Robin Walters, Thomas Kipf, Jan-Willem van de Meent, Lawson L. S. Wong, Robert Platt

    Abstract: Imitation learning of robot policies from few demonstrations is crucial in open-ended applications. We propose a new method, Interaction Warping, for learning SE(3) robotic manipulation policies from a single demonstration. We infer the 3D mesh of each object in the environment using shape warping, a technique for aligning point clouds across object instances. Then, we represent manipulation actio… ▽ More

    Submitted 4 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: CoRL 2023

  7. arXiv:2305.02506  [pdf, ps, other

    cs.PL cs.LG cs.LO math.CT math.PR

    String Diagrams with Factorized Densities

    Authors: Eli Sennesh, Jan-Willem van de Meent

    Abstract: A growing body of research on probabilistic programs and causal models has highlighted the need to reason compositionally about model classes that extend directed graphical models. Both probabilistic programs and causal models define a joint probability density over a set of random variables, and exhibit sparse structure that can be used to reason about causation and conditional independence. This… ▽ More

    Submitted 14 December, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: In Proceedings ACT 2023, arXiv:2312.08138

    Journal ref: EPTCS 397, 2023, pp. 260-278

  8. arXiv:2302.12343  [pdf, other

    cs.CL cs.AI cs.LG

    CHiLL: Zero-shot Custom Interpretable Feature Extraction from Clinical Notes with Large Language Models

    Authors: Denis Jered McInerney, Geoffrey Young, Jan-Willem van de Meent, Byron C. Wallace

    Abstract: We propose CHiLL (Crafting High-Level Latents), an approach for natural-language specification of features for linear models. CHiLL prompts LLMs with expert-crafted queries to generate interpretable features from health records. The resulting noisy labels are then used to train a simple linear classifier. Generating features based on queries to an LLM can empower physicians to use their domain exp… ▽ More

    Submitted 19 October, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: To be published at EMNLP Findings 2023

  9. arXiv:2211.09676  [pdf, other

    cs.PL cs.AI cs.IT

    Verified Reversible Programming for Verified Lossless Compression

    Authors: James Townsend, Jan-Willem van de Meent

    Abstract: Lossless compression implementations typically contain two programs, an encoder and a decoder, which are required to be inverse to one another. We observe that a significant class of compression methods, based on asymmetric numeral systems (ANS), have shared structure between the encoder and decoder -- the decoder program is the 'reverse' of the encoder program -- allowing both to be simultaneousl… ▽ More

    Submitted 23 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

  10. arXiv:2210.07992  [pdf, other

    stat.ML cs.LG

    A Variational Perspective on Generative Flow Networks

    Authors: Heiko Zimmermann, Fredrik Lindsten, Jan-Willem van de Meent, Christian A. Naesseth

    Abstract: Generative flow networks (GFNs) are a class of models for sequential sampling of composite objects, which approximate a target distribution that is defined in terms of an energy function or a reward. GFNs are typically trained using a flow matching or trajectory balance objective, which matches forward and backward transition models over trajectories. In this work, we define variational objectives… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  11. arXiv:2210.06565  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data

    Authors: Denis Jered McInerney, Geoffrey Young, Jan-Willem van de Meent, Byron C. Wallace

    Abstract: Pretraining multimodal models on Electronic Health Records (EHRs) provides a means of learning representations that can transfer to downstream tasks with minimal supervision. Recent multimodal models induce soft local alignments between image regions and sentences. This is of particular interest in the medical domain, where alignments might highlight regions in an image relevant to specific phenom… ▽ More

    Submitted 22 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing

  12. arXiv:2204.13022  [pdf, other

    cs.LG

    Binding Actions to Objects in World Models

    Authors: Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong, Thomas Kipf

    Abstract: We study the problem of binding actions to objects in object-factored world models using action-attention mechanisms. We propose two attention mechanisms for binding actions to objects, soft attention and hard attention, which we evaluate in the context of structured world models for five environments. Our experiments show that hard attention helps contrastively-trained structured world models to… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Published at the ICLR 2022 workshop on Objects, Structure and Causality

  13. arXiv:2204.11371  [pdf, other

    cs.LG

    Learning Symmetric Embeddings for Equivariant World Models

    Authors: Jung Yeon Park, Ondrej Biza, Linfeng Zhao, Jan Willem van de Meent, Robin Walters

    Abstract: Incorporating symmetries can lead to highly data-efficient and generalizable models by defining equivalence classes of data samples related by transformations. However, characterizing how transformations act on input data is often difficult, limiting the applicability of equivariant models. We propose learning symmetric embedding networks (SENs) that encode an input space (e.g. images), where we d… ▽ More

    Submitted 30 June, 2022; v1 submitted 24 April, 2022; originally announced April 2022.

    Comments: ICML 2022

  14. arXiv:2202.05333  [pdf, other

    cs.RO cs.LG

    Factored World Models for Zero-Shot Generalization in Robotic Manipulation

    Authors: Ondrej Biza, Thomas Kipf, David Klee, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: World models for environments with many objects face a combinatorial explosion of states: as the number of objects increases, the number of possible arrangements grows exponentially. In this paper, we learn to generalize over robotic pick-and-place tasks using object-factored world models, which combat the combinatorial explosion by ensuring that predictions are equivariant to permutations of obje… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  15. arXiv:2201.05151  [pdf, other

    cs.CV

    Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot Learning

    Authors: Peyman Bateni, Jarred Barber, Raghav Goyal, Vaden Masrani, Jan-Willem van de Meent, Leonid Sigal, Frank Wood

    Abstract: Modern deep learning requires large-scale extensively labelled datasets for training. Few-shot learning aims to alleviate this issue by learning effectively from few labelled examples. In previously proposed few-shot visual classifiers, it is assumed that the feature manifold, where classifier decisions are made, has uncorrelated feature dimensions and uniform feature variance. In this work, we fo… ▽ More

    Submitted 12 December, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

  16. arXiv:2106.13798  [pdf, other

    cs.LG stat.ML

    Conjugate Energy-Based Models

    Authors: Hao Wu, Babak Esmaeili, Michael Wick, Jean-Baptiste Tristan, Jan-Willem van de Meent

    Abstract: In this paper, we propose conjugate energy-based models (CEBMs), a new class of energy-based models that define a joint density over data and latent variables. The joint density of a CEBM decomposes into an intractable distribution over data and a tractable posterior over latent variables. CEBMs have similar use cases as variational autoencoders, in the sense that they learn an unsupervised mappin… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  17. arXiv:2106.11302  [pdf, other

    stat.ML cs.LG

    Nested Variational Inference

    Authors: Heiko Zimmermann, Hao Wu, Babak Esmaeili, Jan-Willem van de Meent

    Abstract: We develop nested variational inference (NVI), a family of methods that learn proposals for nested importance samplers by minimizing an forward or reverse KL divergence at each level of nesting. NVI is applicable to many commonly-used importance sampling strategies and provides a mechanism for learning intermediate densities, which can serve as heuristics to guide the sampler. Our experiments appl… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  18. arXiv:2104.07155  [pdf, other

    cs.CL cs.LG

    Disentangling Representations of Text by Masking Transformers

    Authors: Xiongyi Zhang, Jan-Willem van de Meent, Byron C. Wallace

    Abstract: Representations from large pretrained models such as BERT encode a range of features into monolithic vectors, affording strong predictive accuracy across a multitude of downstream tasks. In this paper we explore whether it is possible to learn disentangled representations by identifying existing subnetworks within pretrained models that encode distinct, complementary aspect representations. Concre… ▽ More

    Submitted 10 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: 14 pages, 9 figures

  19. arXiv:2104.06338  [pdf, other

    cs.CL

    On the Impact of Random Seeds on the Fairness of Clinical Classifiers

    Authors: Silvio Amir, Jan-Willem van de Meent, Byron C. Wallace

    Abstract: Recent work has shown that fine-tuning large networks is surprisingly sensitive to changes in random seed(s). We explore the implications of this phenomenon for model fairness across demographic groups in clinical prediction tasks over electronic health records (EHR) in MIMIC-III -- the standard dataset in clinical NLP research. Apparent subgroup performance varies substantially for seeds that yie… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted for publication at NAACL 2021

  20. arXiv:2103.00668  [pdf, other

    stat.ML cs.LG cs.PL

    Learning Proposals for Probabilistic Programs with Inference Combinators

    Authors: Sam Stites, Heiko Zimmermann, Hao Wu, Eli Sennesh, Jan-Willem van de Meent

    Abstract: We develop operators for construction of proposals in probabilistic programs, which we refer to as inference combinators. Inference combinators define a grammar over importance samplers that compose primitive operations such as application of a transition kernel and importance resampling. Proposals in these samplers can be parameterized using neural networks, which in turn can be trained by optimi… ▽ More

    Submitted 16 June, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

    Comments: Accepted to UAI 2021

  21. arXiv:2102.11163  [pdf, other

    cs.CV eess.IV

    Generator Surgery for Compressed Sensing

    Authors: Niklas Smedemark-Margulies, Jung Yeon Park, Max Daniels, Rose Yu, Jan-Willem van de Meent, Paul Hand

    Abstract: Image recovery from compressive measurements requires a signal prior for the images being reconstructed. Recent work has explored the use of deep generative models with low latent dimension as signal priors for such problems. However, their recovery performance is limited by high representation error. We introduce a method for achieving low representation error using generators as signal priors. U… ▽ More

    Submitted 28 February, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: Code available at: https://github.com/nik-sm/generator-surgery

  22. arXiv:2101.04178  [pdf, other

    cs.RO cs.LG

    Action Priors for Large Action Spaces in Robotics

    Authors: Ondrej Biza, Dian Wang, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: In robotics, it is often not possible to learn useful policies using pure model-free reinforcement learning without significant reward shaping or curriculum learning. As a consequence, many researchers rely on expert demonstrations to guide learning. However, acquiring expert demonstrations can be expensive. This paper proposes an alternative approach where the solutions of previously solved tasks… ▽ More

    Submitted 15 February, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: 13 pages, 9 figures

    Journal ref: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS '21). 2021. 205 - 213

  23. arXiv:2006.12245  [pdf, other

    cs.CV cs.LG stat.ML

    Enhancing Few-Shot Image Classification with Unlabelled Examples

    Authors: Peyman Bateni, Jarred Barber, Jan-Willem van de Meent, Frank Wood

    Abstract: We develop a transductive meta-learning method that uses unlabelled instances to improve few-shot image classification performance. Our approach combines a regularized Mahalanobis-distance-based soft k-means clustering procedure with a modified state of the art neural adaptive feature extractor to achieve improved test-time classification accuracy using unlabelled data. We evaluate our method on t… ▽ More

    Submitted 21 October, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

  24. arXiv:2004.04645  [pdf, other

    cs.LG stat.ML

    Query-Focused EHR Summarization to Aid Imaging Diagnosis

    Authors: Denis Jered McInerney, Borna Dabiri, Anne-Sophie Touret, Geoffrey Young, Jan-Willem van de Meent, Byron C. Wallace

    Abstract: Electronic Health Records (EHRs) provide vital contextual information to radiologists and other physicians when making a diagnosis. Unfortunately, because a given patient's record may contain hundreds of notes and reports, identifying relevant information within these in the short time typically allotted to a case is very difficult. We propose and evaluate models that extract relevant text snippet… ▽ More

    Submitted 26 April, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Journal ref: Proceedings of Machine Learning Research 126 (2020) 632-659

  25. arXiv:2003.09779  [pdf, other

    cs.LG stat.ML

    Deep Markov Spatio-Temporal Factorization

    Authors: Amirreza Farnoosh, Behnaz Rezaei, Eli Zachary Sennesh, Zulqarnain Khan, Jennifer Dy, Ajay Satpute, J Benjamin Hutchinson, Jan-Willem van de Meent, Sarah Ostadabbas

    Abstract: We introduce deep Markov spatio-temporal factorization (DMSTF), a generative model for dynamical analysis of spatio-temporal data. Like other factor analysis methods, DMSTF approximates high dimensional data by a product between time dependent weights and spatially dependent factors. These weights and factors are in turn represented in terms of lower dimensional latents inferred using stochastic v… ▽ More

    Submitted 18 August, 2020; v1 submitted 21 March, 2020; originally announced March 2020.

  26. arXiv:2003.04300  [pdf, other

    cs.LG stat.ML

    Learning Discrete State Abstractions With Deep Variational Inference

    Authors: Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong

    Abstract: Abstraction is crucial for effective sequential decision making in domains with large state spaces. In this work, we propose an information bottleneck method for learning approximate bisimulations, a type of state abstraction. We use a deep neural encoder to map states onto continuous embeddings. We map these embeddings onto a discrete representation using an action-conditioned hidden Markov model… ▽ More

    Submitted 11 January, 2021; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 15 pages, 7 figures

  27. arXiv:1911.04594  [pdf, other

    cs.LG stat.ML

    Rate-Regularization and Generalization in VAEs

    Authors: Alican Bozkurt, Babak Esmaeili, Jean-Baptiste Tristan, Dana H. Brooks, Jennifer G. Dy, Jan-Willem van de Meent

    Abstract: Variational autoencoders optimize an objective that combines a reconstruction loss (the distortion) and a KL term (the rate). The rate is an upper bound on the mutual information, which is often interpreted as a regularizer that controls the degree of compression. We here examine whether inclusion of the rate also acts as an inductive bias that improves generalization. We perform rate-distortion a… ▽ More

    Submitted 25 March, 2021; v1 submitted 11 November, 2019; originally announced November 2019.

  28. arXiv:1911.01382  [pdf, other

    stat.ML cs.LG

    Amortized Population Gibbs Samplers with Neural Sufficient Statistics

    Authors: Hao Wu, Heiko Zimmermann, Eli Sennesh, Tuan Anh Le, Jan-Willem van de Meent

    Abstract: We develop amortized population Gibbs (APG) samplers, a class of scalable methods that frames structured variational inference as adaptive importance sampling. APG samplers construct high-dimensional proposals by iterating over updates to lower-dimensional blocks of variables. We train each conditional proposal by minimizing the inclusive KL divergence with respect to the conditional posterior. To… ▽ More

    Submitted 9 July, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

  29. arXiv:1906.08901  [pdf, other

    cs.LG eess.IV stat.ML

    Neural Topographic Factor Analysis for fMRI Data

    Authors: Eli Sennesh, Zulqarnain Khan, Yiyu Wang, Jennifer Dy, Ajay B. Satpute, J. Benjamin Hutchinson, Jan-Willem van de Meent

    Abstract: Neuroimaging studies produce gigabytes of spatio-temporal data for a small number of participants and stimuli. Rarely do researchers attempt to model and examine how individual participants vary from each other -- a question that should be addressable even in small samples given the right statistical tools. We propose Neural Topographic Factor Analysis (NTFA), a probabilistic factor analysis model… ▽ More

    Submitted 20 November, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: 15 pages, 9 figures, associated source code available at https://github.com/neu-spiral/HTFATorch

    Journal ref: Advances in Neural Information Processing Systems 34 (2020)

  30. arXiv:1812.09624  [pdf, other

    cs.LG stat.ML

    Can VAEs Generate Novel Examples?

    Authors: Alican Bozkurt, Babak Esmaeili, Dana H. Brooks, Jennifer G. Dy, Jan-Willem van de Meent

    Abstract: An implicit goal in works on deep generative models is that such models should be able to generate novel examples that were not previously seen in the training data. In this paper, we investigate to what extent this property holds for widely employed variational autoencoder (VAE) architectures. VAEs maximize a lower bound on the log marginal likelihood, which implies that they will in principle ov… ▽ More

    Submitted 22 December, 2018; originally announced December 2018.

    Comments: Presented at Critiquing and Correcting Trends in Machine Learning Workshop at NeurIPS 2018

  31. arXiv:1812.05035  [pdf, other

    cs.CL cs.LG

    Structured Neural Topic Models for Reviews

    Authors: Babak Esmaeili, Hongyi Huang, Byron C. Wallace, Jan-Willem van de Meent

    Abstract: We present Variational Aspect-based Latent Topic Allocation (VALTA), a family of autoencoding topic models that learn aspect-based representations of reviews. VALTA defines a user-item encoder that maps bag-of-words vectors for combined reviews associated with each paired user and item onto structured embeddings, which in turn define per-aspect topic weights. We model individual reviews in a struc… ▽ More

    Submitted 1 January, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

  32. arXiv:1812.01569  [pdf, other

    cs.AI

    Nested Reasoning About Autonomous Agents Using Probabilistic Programs

    Authors: Iris Rubi Seaman, Jan-Willem van de Meent, David Wingate

    Abstract: As autonomous agents become more ubiquitous, they will eventually have to reason about the plans of other agents, which is known as theory of mind reasoning. We develop a planning-as-inference framework in which agents perform nested simulation to reason about the behavior of other agents in an online manner. As a concrete application of this framework, we use probabilistic programs to model a hig… ▽ More

    Submitted 4 March, 2020; v1 submitted 4 December, 2018; originally announced December 2018.

  33. arXiv:1811.05965  [pdf, other

    cs.LG cs.PL stat.CO stat.ML

    Composing Modeling and Inference Operations with Probabilistic Program Combinators

    Authors: Eli Sennesh, Adam Åšcibior, Hao Wu, Jan-Willem van de Meent

    Abstract: Probabilistic programs with dynamic computation graphs can define measures over sample spaces with unbounded dimensionality, which constitute programmatic analogues to Bayesian nonparametrics. Owing to the generality of this model class, inference relies on `black-box' Monte Carlo methods that are often not able to take advantage of conditional independence and exchangeability, which have historic… ▽ More

    Submitted 28 November, 2018; v1 submitted 14 November, 2018; originally announced November 2018.

    Comments: Published at the NeurIPS workshop "All of Bayesian Nonparametrics (Especially the Useful Bits)" 2018 (https://sites.google.com/view/nipsbnp2018/)

  34. arXiv:1810.13296  [pdf, other

    stat.ML cs.LG

    On Exploration, Exploitation and Learning in Adaptive Importance Sampling

    Authors: Xiaoyu Lu, Tom Rainforth, Yuan Zhou, Jan-Willem van de Meent, Yee Whye Teh

    Abstract: We study adaptive importance sampling (AIS) as an online learning problem and argue for the importance of the trade-off between exploration and exploitation in this adaptation. Borrowing ideas from the bandits literature, we propose Daisee, a partition-based AIS algorithm. We further introduce a notion of regret for AIS and show that Daisee has $\mathcal{O}(\sqrt{T}(\log T)^{\frac{3}{4}})$ cumulat… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

  35. arXiv:1809.10756  [pdf, other

    stat.ML cs.AI cs.LG cs.PL

    An Introduction to Probabilistic Programming

    Authors: Jan-Willem van de Meent, Brooks Paige, Hongseok Yang, Frank Wood

    Abstract: This book is a graduate-level introduction to probabilistic programming. It not only provides a thorough background for anyone wishing to use a probabilistic programming system, but also introduces the techniques needed to design and build these systems. It is aimed at people who have an undergraduate-level understanding of either or, ideally, both probabilistic machine learning and programming la… ▽ More

    Submitted 19 October, 2021; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: Under review at Foundations and Trends in Machine Learning

  36. arXiv:1804.07212  [pdf, other

    cs.CL

    Learning Disentangled Representations of Texts with Application to Biomedical Abstracts

    Authors: Sarthak Jain, Edward Banner, Jan-Willem van de Meent, Iain J. Marshall, Byron C. Wallace

    Abstract: We propose a method for learning disentangled representations of texts that code for distinct and complementary aspects, with the aim of affording efficient model transfer and interpretability. To induce disentangled embeddings, we propose an adversarial objective based on the (dis)similarity between triplets of documents with respect to specific aspects. Our motivating application is embedding bi… ▽ More

    Submitted 3 September, 2018; v1 submitted 19 April, 2018; originally announced April 2018.

    Comments: Accepted to EMNLP 2018

  37. arXiv:1804.02086  [pdf, other

    stat.ML cs.LG

    Structured Disentangled Representations

    Authors: Babak Esmaeili, Hao Wu, Sarthak Jain, Alican Bozkurt, N. Siddharth, Brooks Paige, Dana H. Brooks, Jennifer Dy, Jan-Willem van de Meent

    Abstract: Deep latent-variable models learn representations of high-dimensional data in an unsupervised manner. A number of recent efforts have focused on learning representations that disentangle statistically independent axes of variation by introducing modifications to the standard objective function. These approaches generally assume a simple diagonal Gaussian prior and as a result are not able to relia… ▽ More

    Submitted 12 December, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

  38. arXiv:1707.04314  [pdf, other

    stat.ML cs.AI cs.PL stat.CO

    Bayesian Optimization for Probabilistic Programs

    Authors: Tom Rainforth, Tuan Anh Le, Jan-Willem van de Meent, Michael A. Osborne, Frank Wood

    Abstract: We present the first general purpose framework for marginal maximum a posteriori estimation of probabilistic program variables. By using a series of code transformations, the evidence of any probabilistic program, and therefore of any graphical model, can be optimized with respect to an arbitrary subset of its sampled variables. To carry out this optimization, we develop the first Bayesian optimiz… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

  39. arXiv:1706.00400  [pdf, other

    stat.ML cs.AI cs.LG

    Learning Disentangled Representations with Semi-Supervised Deep Generative Models

    Authors: N. Siddharth, Brooks Paige, Jan-Willem van de Meent, Alban Desmaison, Noah D. Goodman, Pushmeet Kohli, Frank Wood, Philip H. S. Torr

    Abstract: Variational autoencoders (VAEs) learn representations of data by jointly training a probabilistic encoder and decoder network. Typically these models encode all features of the data into a single variable. Here we are interested in learning disentangled representations that encode distinct aspects of the data into separate variables. We propose to learn such representations using model architectur… ▽ More

    Submitted 13 November, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: Accepted for publication at NIPS 2017

  40. arXiv:1611.07492  [pdf, other

    stat.ML cs.CV cs.LG

    Inducing Interpretable Representations with Variational Autoencoders

    Authors: N. Siddharth, Brooks Paige, Alban Desmaison, Jan-Willem Van de Meent, Frank Wood, Noah D. Goodman, Pushmeet Kohli, Philip H. S. Torr

    Abstract: We develop a framework for incorporating structured graphical models in the \emph{encoders} of variational autoencoders (VAEs) that allows us to induce interpretable representations through approximate variational inference. This allows us to both perform reasoning (e.g. classification) under the structural constraints of a given graphical model, and use deep generative models to deal with messy,… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  41. arXiv:1611.06863  [pdf, other

    stat.ML cs.LG

    Probabilistic structure discovery in time series data

    Authors: David Janz, Brooks Paige, Tom Rainforth, Jan-Willem van de Meent, Frank Wood

    Abstract: Existing methods for structure discovery in time series data construct interpretable, compositional kernels for Gaussian process regression models. While the learned Gaussian process model provides posterior mean and variance estimates, typically the structure is learned via a greedy optimization procedure. This restricts the space of possible solutions and leads to over-confident uncertainty esti… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  42. arXiv:1608.05263  [pdf, other

    cs.PL

    Design and Implementation of Probabilistic Programming Language Anglican

    Authors: David Tolpin, Jan Willem van de Meent, Hongseok Yang, Frank Wood

    Abstract: Anglican is a probabilistic programming system designed to interoperate with Clojure and other JVM languages. We introduce the programming language Anglican, outline our design choices, and discuss in depth the implementation of the Anglican language and runtime, including macro-based compilation, extended CPS-based evaluation model, and functional representations for probabilistic paradigms, such… ▽ More

    Submitted 30 November, 2016; v1 submitted 18 August, 2016; originally announced August 2016.

    Comments: IFL 2016 submission, 12 pages, 2 figures

  43. arXiv:1507.04635  [pdf, other

    stat.ML cs.AI

    Black-Box Policy Search with Probabilistic Programs

    Authors: Jan-Willem van de Meent, Brooks Paige, David Tolpin, Frank Wood

    Abstract: In this work, we explore how probabilistic programs can be used to represent policies in sequential decision problems. In this formulation, a probabilistic program is a black-box stochastic simulator for both the problem domain and the agent. We relate classic policy gradient techniques to recently introduced black-box variational methods which generalize to probabilistic program inference. We pre… ▽ More

    Submitted 4 August, 2016; v1 submitted 16 July, 2015; originally announced July 2015.

    Journal ref: Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (2016) 1195-1204

  44. arXiv:1507.00996  [pdf, other

    stat.ML cs.AI cs.PL

    A New Approach to Probabilistic Programming Inference

    Authors: Frank Wood, Jan Willem van de Meent, Vikash Mansinghka

    Abstract: We introduce and demonstrate a new approach to inference in expressive probabilistic programming languages based on particle Markov chain Monte Carlo. Our approach is simple to implement and easy to parallelize. It applies to Turing-complete probabilistic programming languages and supports accurate inference in models that make use of complex control flow, including stochastic recursion. It also i… ▽ More

    Submitted 9 July, 2015; v1 submitted 3 July, 2015; originally announced July 2015.

    Comments: Updated version of the 2014 AISTATS paper (to reflect changes in new language syntax). 10 pages, 3 figures. Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, JMLR Workshop and Conference Proceedings, Vol 33, 2014

  45. arXiv:1502.07314  [pdf, other

    cs.AI

    Path Finding under Uncertainty through Probabilistic Inference

    Authors: David Tolpin, Brooks Paige, Jan Willem van de Meent, Frank Wood

    Abstract: We introduce a new approach to solving path-finding problems under uncertainty by representing them as probabilistic models and applying domain-independent inference algorithms to the models. This approach separates problem representation from the inference algorithm and provides a framework for efficient learning of path-finding policies. We evaluate the new approach on the Canadian Traveler Prob… ▽ More

    Submitted 8 June, 2015; v1 submitted 25 February, 2015; originally announced February 2015.

  46. arXiv:1501.06769  [pdf, other

    stat.ML cs.AI cs.PL

    Particle Gibbs with Ancestor Sampling for Probabilistic Programs

    Authors: Jan-Willem van de Meent, Hongseok Yang, Vikash Mansinghka, Frank Wood

    Abstract: Particle Markov chain Monte Carlo techniques rank among current state-of-the-art methods for probabilistic program inference. A drawback of these techniques is that they rely on importance resampling, which results in degenerate particle trajectories and a low effective sample size for variables sampled early in a program. We here develop a formalism to adapt ancestor resampling, a technique that… ▽ More

    Submitted 9 February, 2015; v1 submitted 27 January, 2015; originally announced January 2015.

    Comments: 9 pages, 2 figures

  47. arXiv:1501.05677  [pdf, other

    cs.AI stat.ML

    Output-Sensitive Adaptive Metropolis-Hastings for Probabilistic Programs

    Authors: David Tolpin, Jan Willem van de Meent, Brooks Paige, Frank Wood

    Abstract: We introduce an adaptive output-sensitive Metropolis-Hastings algorithm for probabilistic models expressed as programs, Adaptive Lightweight Metropolis-Hastings (AdLMH). The algorithm extends Lightweight Metropolis-Hastings (LMH) by adjusting the probabilities of proposing random variables for modification to improve convergence of the program output. We show that AdLMH converges to the correct eq… ▽ More

    Submitted 5 May, 2015; v1 submitted 22 January, 2015; originally announced January 2015.