Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 97 results for author: Schön, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.13794  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Conditioning diffusion models by explicit forward-backward bridging

    Authors: Adrien Corenflos, Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

    Abstract: Given an unconditional diffusion model $π(x, y)$, using it to perform conditional simulation $π(x \mid y)$ is still largely an open question and is typically achieved by learning conditional drifts to the denoising SDE after the fact. In this work, we express conditional simulation as an inference problem on an augmented space corresponding to a partial SDE bridge. This perspective allows us to im… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 24 pages, 12 figures

  2. arXiv:2311.12566  [pdf, other

    cs.LG stat.ML

    Variational Elliptical Processes

    Authors: Maria Bånkestad, Jens Sjölund, Jalil Taghia, Thomas B. Schöon

    Abstract: We present elliptical processes, a family of non-parametric probabilistic models that subsume Gaussian processes and Student's t processes. This generalization includes a range of new heavy-tailed behaviors while retaining computational tractability. Elliptical processes are based on a representation of elliptical distributions as a continuous mixture of Gaussian distributions. We parameterize thi… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 14 pages, 15 figures, appendix 9 pages

    Journal ref: Transactions on Machine Learning Research, September 2023

  3. arXiv:2310.19608  [pdf, other

    cs.LG stat.ML

    On Feynman--Kac training of partial Bayesian neural networks

    Authors: Zheng Zhao, Sebastian Mair, Thomas B. Schön, Jens Sjölund

    Abstract: Recently, partial Bayesian neural networks (pBNNs), which only consider a subset of the parameters to be stochastic, were shown to perform competitively with full Bayesian neural networks. However, pBNNs are often multi-modal in the latent variable space and thus challenging to approximate with parametric models. To address this problem, we propose an efficient sampling-based training strategy, wh… ▽ More

    Submitted 27 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: In AISTATS 2024

  4. arXiv:2310.10807  [pdf, other

    stat.ML cs.CR cs.LG math.OC

    Regularization properties of adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Francis Bach, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against it. Formulated as a min-max problem, it searches for the best solution when the training data were corrupted by the worst-case attacks. Linear models are among the simple models where vulnerabilities can be… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted (spotlight) NeurIPS 2023; A preliminary version of this work titled: "Surprises in adversarially-trained linear regression" was made available under a different identifier: arXiv:2205.12695

  5. arXiv:2309.16335  [pdf, other

    cs.LG cs.AI q-bio.QM stat.AP

    End-to-end Risk Prediction of Atrial Fibrillation from the 12-Lead ECG by Deep Neural Networks

    Authors: Theogene Habineza, Antônio H. Ribeiro, Daniel Gedon, Joachim A. Behar, Antonio Luiz P. Ribeiro, Thomas B. Schön

    Abstract: Background: Atrial fibrillation (AF) is one of the most common cardiac arrhythmias that affects millions of people each year worldwide and it is closely linked to increased risk of cardiovascular diseases such as stroke and heart failure. Machine learning methods have shown promising results in evaluating the risk of developing atrial fibrillation from the electrocardiogram. We aim to develop and… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 16 pages with 7 figures

    Journal ref: @article{HABINEZA2023193, journal = {Journal of Electrocardiology}, volume = {81}, pages = {193-200}, year = {2023}, issn = {0022-0736}}

  6. arXiv:2210.14684  [pdf, other

    stat.CO stat.AP stat.ME

    Nonlinear System Identification: Learning while respecting physical models using a sequential Monte Carlo method

    Authors: Anna Wigren, Johan Wågberg, Fredrik Lindsten, Adrian Wills, Thomas B. Schön

    Abstract: Identification of nonlinear systems is a challenging problem. Physical knowledge of the system can be used in the identification process to significantly improve the predictive performance by restricting the space of possible mappings from the input to the output. Typically, the physical models contain unknown parameters that must be learned from data. Classical methods often restrict the possible… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 52 pages, 13 figures

    Journal ref: IEEE Control Systems Magazine, Volume 42, Issue 1, pages 75 - 102, February 2022

  7. arXiv:2205.12695  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Surprises in adversarially-trained linear regression

    Authors: Antônio H. Ribeiro, Dave Zachariah, Thomas B. Schön

    Abstract: State-of-the-art machine learning models can be vulnerable to very small input perturbations that are adversarially constructed. Adversarial training is an effective approach to defend against such examples. It is formulated as a min-max problem, searching for the best solution when the training data was corrupted by the worst-case attacks. For linear regression problems, adversarial training can… ▽ More

    Submitted 20 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  8. arXiv:2205.06306  [pdf, other

    stat.ML eess.SP stat.AP

    Probabilistic Estimation of Instantaneous Frequencies of Chirp Signals

    Authors: Zheng Zhao, Simo Särkkä, Jens Sjölund, Thomas B. Schön

    Abstract: We present a continuous-time probabilistic approach for estimating the chirp signal and its instantaneous frequency function when the true forms of these functions are not accessible. Our model represents these functions by non-linearly cascaded Gaussian processes represented as non-linear stochastic differential equations. The posterior distribution of the functions is then estimated with stochas… ▽ More

    Submitted 13 February, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in IEEE Transactions on Signal Processing

  9. arXiv:2204.06274  [pdf, other

    stat.ML cs.CR cs.LG eess.SP math.ST

    Overparameterized Linear Regression under Adversarial Attacks

    Authors: Antônio H. Ribeiro, Thomas B. Schön

    Abstract: We study the error of linear regression in the face of adversarial attacks. In this framework, an adversary changes the input to the regression model in order to maximize the prediction error. We provide bounds on the prediction error in the presence of an adversary as a function of the parameter norm and the error in the absence of such an adversary. We show how these bounds make it possible to s… ▽ More

    Submitted 27 January, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

  10. arXiv:2202.01793  [pdf, other

    stat.ML cs.LG

    Incorporating Sum Constraints into Multitask Gaussian Processes

    Authors: Philipp Pilar, Carl Jidling, Thomas B. Schön, Niklas Wahlström

    Abstract: Machine learning models can be improved by adapting them to respect existing background knowledge. In this paper we consider multitask Gaussian processes, with background knowledge in the form of constraints that require a specific sum of the outputs to be constant. This is achieved by conditioning the prior distribution on the constraint fulfillment. The approach allows for both linear and nonlin… ▽ More

    Submitted 1 February, 2023; v1 submitted 3 February, 2022; originally announced February 2022.

    Journal ref: Transactions on Machine Learning Research, 2022

  11. Efficient Learning of the Parameters of Non-Linear Models using Differentiable Resampling in Particle Filters

    Authors: Conor Rosato, Vincent Beraud, Paul Horridge, Thomas B. Schön, Simon Maskell

    Abstract: It has been widely documented that the sampling and resampling steps in particle filters cannot be differentiated. The {\itshape reparameterisation trick} was introduced to allow the sampling step to be reformulated into a differentiable function. We extend the {\itshape reparameterisation trick} to include the stochastic input to resampling therefore limiting the discontinuities in the gradient c… ▽ More

    Submitted 27 April, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 35 pages, 10 figures

  12. arXiv:2110.11948  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Proposals for Practical Energy-Based Regression

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression. However, energy-based regression requires a proposal distribution to be manually designed for training, and an initial estimate has to be provided at test-time. We address both of these issues by introducing a conceptually simple metho… ▽ More

    Submitted 7 November, 2023; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: AISTATS 2022. Code is available at https://github.com/fregu856/ebms_proposals

  13. arXiv:2012.07269  [pdf, ps, other

    stat.ML cs.LG

    Variational State and Parameter Estimation

    Authors: Jarrad Courts, Johannes Hendriks, Adrian Wills, Thomas Schön, Brett Ninness

    Abstract: This paper considers the problem of computing Bayesian estimates of both states and model parameters for nonlinear state-space models. Generally, this problem does not have a tractable solution and approximations must be utilised. In this work, a variational approach is used to provide an assumed density which approximates the desired, intractable, distribution. The approach is deterministic and r… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  14. arXiv:2012.06341  [pdf, other

    cs.LG eess.SY stat.ML

    Beyond Occam's Razor in System Identification: Double-Descent when Modeling Dynamics

    Authors: Antônio H. Ribeiro, Johannes N. Hendriks, Adrian G. Wills, Thomas B. Schön

    Abstract: System identification aims to build models of dynamical systems from data. Traditionally, choosing the model requires the designer to balance between two goals of conflicting nature; the model must be rich enough to capture the system dynamics, but not so flexible that it learns spurious random effects from the dataset. It is typically observed that the model validation performance follows a U-sha… ▽ More

    Submitted 6 August, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: To appear in the Proceedings of the 19th IFAC Symposium in System Identification (2021)

  15. arXiv:2012.05072  [pdf, ps, other

    stat.ML cs.LG eess.SY stat.ME

    Variational System Identification for Nonlinear State-Space Models

    Authors: Jarrad Courts, Adrian Wills, Thomas Schön, Brett Ninness

    Abstract: This paper considers parameter estimation for nonlinear state-space models, which is an important but challenging problem. We address this challenge by employing a variational inference (VI) approach, which is a principled method that has deep connections to maximum likelihood estimation. This VI approach ultimately provides estimates of the model as solutions to an optimisation problem, which is… ▽ More

    Submitted 14 September, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

  16. arXiv:2012.04634  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    Accurate 3D Object Detection using Energy-Based Models

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: Accurate 3D object detection (3DOD) is crucial for safe navigation of complex environments by autonomous robots. Regressing accurate 3D bounding boxes in cluttered environments based on sparse LiDAR data is however a highly challenging problem. We address this task by exploring recent advances in conditional energy-based models (EBMs) for probabilistic regression. While methods employing EBMs for… ▽ More

    Submitted 7 November, 2023; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: CVPR Workshops 2021. Code is available at https://github.com/fregu856/ebms_3dod

  17. arXiv:2005.01698  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    How to Train Your Energy-Based Model for Regression

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Radu Timofte, Thomas B. Schön

    Abstract: Energy-based models (EBMs) have become increasingly popular within computer vision in recent years. While they are commonly employed for generative image modeling, recent work has applied EBMs also for regression tasks, achieving state-of-the-art performance on object detection and visual tracking. Training EBMs is however known to be challenging. While a variety of different techniques have been… ▽ More

    Submitted 14 August, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: BMVC 2020. Code is available at https://github.com/fregu856/ebms_regression

  18. arXiv:2003.14162  [pdf, other

    eess.SY cs.LG stat.ML

    Deep State Space Models for Nonlinear System Identification

    Authors: Daniel Gedon, Niklas Wahlström, Thomas B. Schön, Lennart Ljung

    Abstract: Deep state space models (SSMs) are an actively researched model class for temporal models developed in the deep learning community which have a close connection to classic SSMs. The use of deep SSMs as a black-box identification model can describe a wide range of dynamics due to the flexibility of deep neural networks. Additionally, the probabilistic nature of the model class allows the uncertaint… ▽ More

    Submitted 18 June, 2021; v1 submitted 31 March, 2020; originally announced March 2020.

  19. arXiv:2003.07201  [pdf, ps, other

    stat.ME cs.LG stat.ML

    The Elliptical Processes: a Family of Fat-tailed Stochastic Processes

    Authors: Maria Bånkestad, Jens Sjölund, Jalil Taghia, Thomas Schön

    Abstract: We present the elliptical processes -- a family of non-parametric probabilistic models that subsumes the Gaussian process and the Student-t process. This generalization includes a range of new fat-tailed behaviors yet retains computational tractability. We base the elliptical processes on a representation of elliptical distributions as a continuous mixture of Gaussian distributions and derive clos… ▽ More

    Submitted 2 December, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

  20. Gaussian Variational State Estimation for Nonlinear State-Space Models

    Authors: Jarrad Courts, Adrian Wills, Thomas B. Schön

    Abstract: In this paper, the problem of state estimation, in the context of both filtering and smoothing, for nonlinear state-space models is considered. Due to the nonlinear nature of the models, the state estimation problem is generally intractable as it involves integrals of general nonlinear functions and the filtered and smoothed state distributions lack closed-form solutions. As such, it is common to… ▽ More

    Submitted 1 October, 2021; v1 submitted 6 February, 2020; originally announced February 2020.

  21. arXiv:2002.01600  [pdf, other

    stat.ML cs.LG physics.comp-ph

    Linearly Constrained Neural Networks

    Authors: Johannes Hendriks, Carl Jidling, Adrian Wills, Thomas Schön

    Abstract: We present a novel approach to modelling and learning vector fields from physical systems using neural networks that explicitly satisfy known linear operator constraints. To achieve this, the target function is modelled as a linear transformation of an underlying potential field, which is in turn modelled by a neural network. This transformation is chosen such that any prediction of the target fun… ▽ More

    Submitted 27 April, 2021; v1 submitted 4 February, 2020; originally announced February 2020.

  22. arXiv:1910.09527  [pdf, ps, other

    stat.CO stat.ML

    Particle filter with rejection control and unbiased estimator of the marginal likelihood

    Authors: Jan Kudlicka, Lawrence M. Murray, Thomas B. Schön, Fredrik Lindsten

    Abstract: We consider the combined use of resampling and partial rejection control in sequential Monte Carlo methods, also known as particle filters. While the variance reducing properties of rejection control are known, there has not been (to the best of our knowledge) any work on unbiased estimation of the marginal likelihood (also known as the model evidence or the normalizing constant) in this type of p… ▽ More

    Submitted 4 March, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

  23. arXiv:1909.12297  [pdf, other

    cs.LG cs.CV stat.ML

    Energy-Based Models for Deep Probabilistic Regression

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Goutam Bhat, Thomas B. Schön

    Abstract: While deep learning-based classification is generally tackled using standardized approaches, a wide variety of techniques are employed for regression. In computer vision, one particularly popular such technique is that of confidence-based regression, which entails predicting a confidence value for each input-target pair (x,y). While this approach has demonstrated impressive results, it requires im… ▽ More

    Submitted 19 July, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: ECCV 2020. Code is available at https://github.com/fregu856/ebms_regression

  24. arXiv:1909.01844  [pdf, other

    stat.ML cs.LG

    Deep kernel learning for integral measurements

    Authors: Carl Jidling, Johannes Hendriks, Thomas B. Schön, Adrian Wills

    Abstract: Deep kernel learning refers to a Gaussian process that incorporates neural networks to improve the modelling of complex functions. We present a method that makes this approach feasible for problems where the data consists of line integral measurements of the target function. The performance is illustrated on computed tomography reconstruction examples.

    Submitted 4 September, 2019; originally announced September 2019.

  25. arXiv:1909.01730  [pdf, other

    eess.SY cs.LG cs.NE stat.ML

    Deep Convolutional Networks in System Identification

    Authors: Carl Andersson, Antônio H. Ribeiro, Koen Tiels, Niklas Wahlström, Thomas B. Schön

    Abstract: Recent developments within deep learning are relevant for nonlinear system identification problems. In this paper, we establish connections between the deep learning and the system identification communities. It has recently been shown that convolutional architectures are at least as capable as recurrent architectures when it comes to sequence modeling tasks. Inspired by these results we explore t… ▽ More

    Submitted 19 November, 2019; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted to Conference on Decision and Control, The first two authors contributed equally

  26. arXiv:1909.01238  [pdf, other

    eess.SY stat.ML

    Stochastic quasi-Newton with line-search regularization

    Authors: Adrian Wills, Thomas Schön

    Abstract: In this paper we present a novel quasi-Newton algorithm for use in stochastic optimisation. Quasi-Newton methods have had an enormous impact on deterministic optimisation problems because they afford rapid convergence and computationally attractive algorithms. In essence, this is achieved by learning the second-order (Hessian) information based on observing first-order gradients. We extend these i… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

  27. arXiv:1907.04615  [pdf, other

    stat.CO stat.ML

    Probabilistic programming for birth-death models of evolution using an alive particle filter with delayed sampling

    Authors: Jan Kudlicka, Lawrence M. Murray, Fredrik Ronquist, Thomas B. Schön

    Abstract: We consider probabilistic programming for birth-death models of evolution and introduce a new widely-applicable inference method that combines an extension of the alive particle filter (APF) with automatic Rao-Blackwellization via delayed sampling. Birth-death models of evolution are an important family of phylogenetic models of the diversification processes that lead to evolutionary trees. Probab… ▽ More

    Submitted 14 February, 2021; v1 submitted 10 July, 2019; originally announced July 2019.

    Journal ref: Conference on Uncertainty in Artificial Intelligence (UAI) 2019

  28. arXiv:1906.08482  [pdf, other

    cs.LG cs.NE math.DS stat.ML

    Beyond exploding and vanishing gradients: analysing RNN training using attractors and smoothness

    Authors: Antônio H. Ribeiro, Koen Tiels, Luis A. Aguirre, Thomas B. Schön

    Abstract: The exploding and vanishing gradient problem has been the major conceptual principle behind most architecture and training improvements in recurrent neural networks (RNNs) during the last decade. In this paper, we argue that this principle, while powerful, might need some refinement to explain recent developments. We refine the concept of exploding gradients by reformulating the problem in terms o… ▽ More

    Submitted 5 March, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: To appear in the Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), 2020. PMLR: Volume 108. This paper was previously titled "The trade-off between long-term memory and smoothness for recurrent networks". The current version subsumes all previous versions

  29. arXiv:1906.01620  [pdf, other

    cs.LG cs.CV stat.ML

    Evaluating Scalable Bayesian Deep Learning Methods for Robust Computer Vision

    Authors: Fredrik K. Gustafsson, Martin Danelljan, Thomas B. Schön

    Abstract: While deep neural networks have become the go-to approach in computer vision, the vast majority of these models fail to properly capture the uncertainty inherent in their predictions. Estimating this predictive uncertainty can be crucial, for example in automotive applications. In Bayesian deep learning, predictive uncertainty is commonly decomposed into the distinct types of aleatoric and epistem… ▽ More

    Submitted 7 April, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: CVPR Workshops 2020. Code is available at https://github.com/fregu856/evaluating_bdl

  30. arXiv:1906.01584  [pdf, other

    math.OC cs.LG stat.ML

    Robust exploration in linear quadratic reinforcement learning

    Authors: Jack Umenberger, Mina Ferizbegovic, Thomas B. Schön, Håkan Hjalmarsson

    Abstract: This paper concerns the problem of learning control policies for an unknown linear dynamical system to minimize a quadratic cost function. We present a method, based on convex optimization, that accomplishes this task robustly: i.e., we minimize the worst-case cost, accounting for system uncertainty given the observed data. The method balances exploitation and exploration, exciting the system in s… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

  31. arXiv:1905.06854  [pdf, other

    physics.comp-ph physics.data-an stat.ML

    Neutron Transmission Strain Tomography for Non-Constant Stress-Free Lattice Spacing

    Authors: J. N. Hendriks, C. Jidling, T. B. Schön, A. Wills, C. M. Wensrich, E. H. Kisi

    Abstract: Recently, several algorithms for strain tomography from energy-resolved neutron transmission measurements have been proposed. These methods assume that the stress-free lattice spacing $d_0$ is a known constant limiting their application to the study of stresses generated by manufacturing and loading methods that do not alter this parameter. In this paper, we consider the more general problem of jo… ▽ More

    Submitted 18 July, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: Journal article, 14 pages, 5 figures

    Journal ref: Nuclear instruments and methods in physics research section B, 456:64-73, 2019

  32. arXiv:1904.01949  [pdf, other

    cs.LG eess.SP stat.ML

    Automatic diagnosis of the 12-lead ECG using a deep neural network

    Authors: Antônio H. Ribeiro, Manoel Horta Ribeiro, Gabriela M. M. Paixão, Derick M. Oliveira, Paulo R. Gomes, Jéssica A. Canazart, Milton P. S. Ferreira, Carl R. Andersson, Peter W. Macfarlane, Wagner Meira Jr., Thomas B. Schön, Antonio Luiz P. Ribeiro

    Abstract: The role of automatic electrocardiogram (ECG) analysis in clinical practice is limited by the accuracy of existing models. Deep Neural Networks (DNNs) are models composed of stacked transformations that learn tasks by examples. This technology has recently achieved striking success in a variety of task and there are great expectations on how it might improve clinical practice. Here we present a DN… ▽ More

    Submitted 14 April, 2020; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: A preliminary version of this work titled: "Automatic Diagnosis of Short-Duration 12-Lead ECG using a Deep Convolutional Network " was presented in the Machine Learning for Health Workshop at NeurIPS 2018 and was made available under a different identifier: arXiv:1811.12194. The current version subsumes all previous versions

    Journal ref: Nature Communications 11, article number: 1760 (2020)

  33. arXiv:1903.04797  [pdf, other

    stat.ML cs.LG stat.CO

    Elements of Sequential Monte Carlo

    Authors: Christian A. Naesseth, Fredrik Lindsten, Thomas B. Schön

    Abstract: A core problem in statistics and probabilistic machine learning is to compute probability distributions and expectations. This is the fundamental problem of Bayesian statistics and machine learning, which frames all inference as expectations with respect to the posterior distribution. The key challenge is to approximate these intractable expectations. In this tutorial, we review sequential Monte C… ▽ More

    Submitted 4 March, 2022; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: Foundations and Trends in Machine Learning

  34. arXiv:1903.02250  [pdf, other

    eess.SY math.OC stat.ML

    Nonlinear input design as optimal control of a Hamiltonian system

    Authors: Jack Umenberger, Thomas B. Schön

    Abstract: We propose an input design method for a general class of parametric probabilistic models, including nonlinear dynamical systems with process noise. The goal of the procedure is to select inputs such that the parameter posterior distribution concentrates about the true value of the parameters; however, exact computation of the posterior is intractable. By representing (samples from) the posterior a… ▽ More

    Submitted 6 March, 2019; originally announced March 2019.

  35. arXiv:1902.06977  [pdf

    cs.LG stat.ML

    Evaluating model calibration in classification

    Authors: Juozas Vaicenavicius, David Widmann, Carl Andersson, Fredrik Lindsten, Jacob Roll, Thomas B. Schön

    Abstract: Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their ability to represent uncertainty about predictions. In safety-critical applications, it is pivotal for a model to possess an adequate sense of uncertainty, which… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

  36. arXiv:1902.04272  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Towards Self-Supervised High Level Sensor Fusion

    Authors: Qadeer Khan, Torsten Schön, Patrick Wenzel

    Abstract: In this paper, we present a framework to control a self-driving car by fusing raw information from RGB images and depth maps. A deep neural network architecture is used for mapping the vision and depth information, respectively, to steering commands. This fusion of information from two sensor sources allows to provide redundancy and fault tolerance in the presence of sensor failures. Even if one o… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  37. arXiv:1902.03777  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Semantic Label Reduction Techniques for Autonomous Driving

    Authors: Qadeer Khan, Torsten Schön, Patrick Wenzel

    Abstract: Semantic segmentation maps can be used as input to models for maneuvering the controls of a car. However, not all labels may be necessary for making the control decision. One would expect that certain labels such as road lanes or sidewalks would be more critical in comparison with labels for vegetation or buildings which may not have a direct influence on the car's driving decision. In this append… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

  38. arXiv:1902.03765  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Latent Space Reinforcement Learning for Steering Angle Prediction

    Authors: Qadeer Khan, Torsten Schön, Patrick Wenzel

    Abstract: Model-free reinforcement learning has recently been shown to successfully learn navigation policies from raw sensor data. In this work, we address the problem of learning driving policies for an autonomous agent in a high-fidelity simulator. Building upon recent research that applies deep reinforcement learning to navigation problems, we present a modular deep reinforcement learning approach to pr… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

  39. arXiv:1902.01182  [pdf, other

    stat.ML cs.AI cs.LG

    Constructing the Matrix Multilayer Perceptron and its Application to the VAE

    Authors: Jalil Taghia, Maria Bånkestad, Fredrik Lindsten, Thomas B. Schön

    Abstract: Like most learning algorithms, the multilayer perceptrons (MLP) is designed to learn a vector of parameters from data. However, in certain scenarios we are interested in learning structured parameters (predictions) in the form of symmetric positive definite matrices. Here, we introduce a variant of the MLP, referred to as the matrix MLP, that is specialized at learning symmetric positive definite… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  40. arXiv:1901.09919  [pdf, other

    stat.ME cs.LG stat.ML

    Inferring Heterogeneous Causal Effects in Presence of Spatial Confounding

    Authors: Muhammad Osama, Dave Zachariah, Thomas B. Schön

    Abstract: We address the problem of inferring the causal effect of an exposure on an outcome across space, using observational data. The data is possibly subject to unmeasured confounding variables which, in a standard approach, must be adjusted for by estimating a nuisance function. Here we develop a method that eliminates the nuisance function, while mitigating the resulting errors-in-variables. The resul… ▽ More

    Submitted 3 June, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

    Comments: 10 pages, 10 figures

  41. arXiv:1812.07319  [pdf, other

    stat.ML cs.LG

    Evaluating the squared-exponential covariance function in Gaussian processes with integral observations

    Authors: J. N. Hendriks, C. Jidling, A. Wills, T. B. Schön

    Abstract: This paper deals with the evaluation of double line integrals of the squared exponential covariance function. We propose a new approach in which the double integral is reduced to a single integral using the error function. This single integral is then computed with efficiently implemented numerical techniques. The performance is compared against existing state of the art methods and the results sh… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

  42. arXiv:1811.12194  [pdf, other

    eess.SP cs.HC cs.LG stat.ML

    Automatic Diagnosis of Short-Duration 12-Lead ECG using a Deep Convolutional Network

    Authors: Antônio H. Ribeiro, Manoel Horta Ribeiro, Gabriela Paixão, Derick Oliveira, Paulo R. Gomes, Jéssica A. Canazart, Milton Pifano, Wagner Meira Jr., Thomas B. Schön, Antonio Luiz Ribeiro

    Abstract: We present a model for predicting electrocardiogram (ECG) abnormalities in short-duration 12-lead ECG signals which outperformed medical doctors on the 4th year of their cardiology residency. Such exams can provide a full evaluation of heart activity and have not been studied in previous end-to-end machine learning papers. Using the database of a large telehealth network, we built a novel dataset… ▽ More

    Submitted 17 February, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/82

  43. arXiv:1811.10947  [pdf, other

    stat.ML cs.LG

    Reliable Semi-Supervised Learning when Labels are Missing at Random

    Authors: Xiuming Liu, Dave Zachariah, Johan Wågberg, Thomas B. Schön

    Abstract: Semi-supervised learning methods are motivated by the availability of large datasets with unlabeled features in addition to labeled data. Unlabeled data is, however, not guaranteed to improve classification performance and has in fact been reported to impair the performance in certain cases. A fundamental source of error arises from restrictive assumptions about the unlabeled features, which resul… ▽ More

    Submitted 24 October, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

  44. arXiv:1810.01539  [pdf, other

    stat.ML cs.LG

    Automated learning with a probabilistic programming language: Birch

    Authors: Lawrence M. Murray, Thomas B. Schön

    Abstract: This work offers a broad perspective on probabilistic modeling and inference in light of recent advances in probabilistic programming, in which models are formally expressed in Turing-complete programming languages. We consider a typical workflow and how probabilistic programming languages can help to automate this workflow, especially in the matching of models with inference methods. We focus on… ▽ More

    Submitted 16 April, 2020; v1 submitted 2 October, 2018; originally announced October 2018.

  45. arXiv:1810.01269  [pdf, other

    math.OC cs.LG stat.ML

    A fast quasi-Newton-type method for large-scale stochastic optimisation

    Authors: Adrian Wills, Carl Jidling, Thomas Schon

    Abstract: During recent years there has been an increased interest in stochastic adaptations of limited memory quasi-Newton methods, which compared to pure gradient-based routines can improve the convergence by incorporating second order information. In this work we propose a direct least-squares approach conceptually similar to the limited memory quasi-Newton methods, but that computes the search direction… ▽ More

    Submitted 29 September, 2018; originally announced October 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1802.04310

  46. arXiv:1809.03779  [pdf, other

    cs.CV cs.LG stat.ML

    Probabilistic approach to limited-data computed tomography reconstruction

    Authors: Zenith Purisha, Carl Jidling, Niklas Wahlström, Simo Särkkä, Thomas B. Schön

    Abstract: In this work, we consider the inverse problem of reconstructing the internal structure of an object from limited x-ray projections. We use a Gaussian process prior to model the target function and estimate its (hyper)parameters from measured data. In contrast to other established methods, this comes with the advantage of not requiring any manual parameter tuning, which usually arises in classical… ▽ More

    Submitted 3 July, 2019; v1 submitted 11 September, 2018; originally announced September 2018.

  47. arXiv:1808.05889  [pdf, ps, other

    stat.ME eess.SP stat.CO stat.ML

    Data Consistency Approach to Model Validation

    Authors: Andreas Svensson, Dave Zachariah, Petre Stoica, Thomas B. Schön

    Abstract: In scientific inference problems, the underlying statistical modeling assumptions have a crucial impact on the end results. There exist, however, only a few automatic means for validating these fundamental modelling assumptions. The contribution in this paper is a general criterion to evaluate the consistency of a set of statistical models with respect to observed data. This is achieved by automat… ▽ More

    Submitted 20 May, 2019; v1 submitted 17 August, 2018; originally announced August 2018.

    Journal ref: IEEE Access, 7(1):59788-59796, 2019

  48. arXiv:1806.00319  [pdf, other

    stat.ML cs.LG math.OC

    Learning convex bounds for linear quadratic control policy synthesis

    Authors: Jack Umenberger, Thomas B. Schön

    Abstract: Learning to make decisions from observed data in dynamic environments remains a problem of fundamental importance in a number of fields, from artificial intelligence and robotics, to medicine and finance. This paper concerns the problem of learning control policies for unknown linear dynamical systems so as to maximize a quadratic reward function. We present a method to optimize the expected value… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  49. arXiv:1802.09086  [pdf, other

    stat.ML

    Conditionally Independent Multiresolution Gaussian Processes

    Authors: Jalil Taghia, Thomas B. Schön

    Abstract: The multiresolution Gaussian process (GP) has gained increasing attention as a viable approach towards improving the quality of approximations in GPs that scale well to large-scale data. Most of the current constructions assume full independence across resolutions. This assumption simplifies the inference, but it underestimates the uncertainties in transitioning from one resolution to another. Thi… ▽ More

    Submitted 24 February, 2019; v1 submitted 25 February, 2018; originally announced February 2018.

  50. arXiv:1802.04310  [pdf, other

    stat.ML cs.LG

    Stochastic quasi-Newton with adaptive step lengths for large-scale problems

    Authors: Adrian Wills, Thomas Schön

    Abstract: We provide a numerically robust and fast method capable of exploiting the local geometry when solving large-scale stochastic optimisation problems. Our key innovation is an auxiliary variable construction coupled with an inverse Hessian approximation computed using a receding history of iterates and gradients. It is the Markov chain nature of the classic stochastic gradient algorithm that enables… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.