Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–23 of 23 results for author: Oberhauser, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12219  [pdf, other

    cs.LG math.NA stat.ML

    A Quadrature Approach for General-Purpose Batch Bayesian Optimization via Probabilistic Lifting

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Saad Hamid, Harald Oberhauser, Michael A. Osborne

    Abstract: Parallelisation in Bayesian optimisation is a common strategy but faces several challenges: the need for flexibility in acquisition functions and kernel choices, flexibility dealing with discrete and continuous variables simultaneously, model misspecification, and lastly fast massive parallelisation. To address these challenges, we introduce a versatile and modular framework for batch Bayesian opt… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: This work is the journal extension of the workshop paper (arXiv:2301.11832) and AISTATS paper (arXiv:2306.05843). 48 pages, 11 figures

    MSC Class: 62C10; 62F15

  2. arXiv:2311.12214  [pdf, other

    stat.ML cs.LG stat.ME

    Random Fourier Signature Features

    Authors: Csaba Toth, Harald Oberhauser, Zoltan Szabo

    Abstract: Tensor algebras give rise to one of the most powerful measures of similarity for sequences of arbitrary length called the signature kernel accompanied with attractive theoretical guarantees from stochastic analysis. Previous algorithms to compute the signature kernel scale quadratically in terms of the length and the number of the sequences. To mitigate this severe computational bottleneck, we dev… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  3. arXiv:2311.04171  [pdf, other

    cs.LG cs.AI math.AT math.DG math.ST

    HADES: Fast Singularity Detection with Local Measure Comparison

    Authors: Uzu Lim, Harald Oberhauser, Vidit Nanda

    Abstract: We introduce Hades, an unsupervised algorithm to detect singularities in data. This algorithm employs a kernel goodness-of-fit test, and as a consequence it is much faster and far more scaleable than the existing topology-based alternatives. Using tools from differential geometry and optimal transport theory, we prove that Hades correctly detects singularities with high probability when the data s… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    MSC Class: 55N31; 32S50

  4. arXiv:2306.05843  [pdf, other

    cs.LG cs.AI math.NA stat.CO stat.ML

    Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Xingchen Wan, Vu Nguyen, Harald Oberhauser, Michael A. Osborne

    Abstract: Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address… ▽ More

    Submitted 21 February, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at AISTATS 2024. 33 pages, 6 figures

    MSC Class: 62C10; 62F15

  5. arXiv:2305.04625  [pdf, other

    math.PR cs.LG stat.ML

    The Signature Kernel

    Authors: Darrick Lee, Harald Oberhauser

    Abstract: The signature kernel is a positive definite kernel for sequential data. It inherits theoretical guarantees from stochastic analysis, has efficient algorithms for computation, and shows strong empirical performance. In this short survey paper for a forthcoming Springer handbook, we give an elementary introduction to the signature kernel and highlight these theoretical and computational properties.

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 31 pages, 2 figures

  6. arXiv:2301.12466  [pdf, other

    stat.ML cs.IT cs.LG

    Kernelized Cumulants: Beyond Kernel Mean Embeddings

    Authors: Patric Bonnier, Harald Oberhauser, Zoltán Szabó

    Abstract: In $\mathbb R^d$, it is well-known that cumulants provide an alternative to moments that can achieve the same goals with numerous benefits such as lower variance estimators. In this paper we extend cumulants to reproducing kernel Hilbert spaces (RKHS) using tools from tensor algebras and show that they are computationally tractable by a kernel trick. These kernelized cumulants provide a new set of… ▽ More

    Submitted 29 October, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: 19 pages, 8 figures

  7. arXiv:2301.11832  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    SOBER: Highly Parallel Bayesian Optimization and Bayesian Quadrature over Discrete and Mixed Spaces

    Authors: Masaki Adachi, Satoshi Hayakawa, Saad Hamid, Martin Jørgensen, Harald Oberhauser, Micheal A. Osborne

    Abstract: Batch Bayesian optimisation and Bayesian quadrature have been shown to be sample-efficient methods of performing optimisation and quadrature where expensive-to-evaluate objective functions can be queried in parallel. However, current methods do not scale to large batch sizes -- a frequent desideratum in practice (e.g. drug discovery or simulation-based inference). We present a novel algorithm, SOB… ▽ More

    Submitted 5 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 34 pages, 12 figures

    MSC Class: 62C10; 62F15

  8. arXiv:2301.09517  [pdf, other

    math.NA cs.LG stat.ML

    Sampling-based Nyström Approximation and Kernel Quadrature

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We analyze the Nyström approximation of a positive definite kernel associated with a probability measure. We first prove an improved error bound for the conventional Nyström approximation with i.i.d. sampling and singular-value decomposition in the continuous regime; the proof techniques are borrowed from statistical learning theory. We further introduce a refined selection of subspaces in Nyström… ▽ More

    Submitted 22 May, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 22 pages, ICML 2023 camera-ready version. Typos fixed

  9. arXiv:2206.04734  [pdf, other

    cs.LG math.NA stat.CO stat.ML

    Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination

    Authors: Masaki Adachi, Satoshi Hayakawa, Martin Jørgensen, Harald Oberhauser, Michael A. Osborne

    Abstract: Calculation of Bayesian posteriors and model evidences typically requires numerical integration. Bayesian quadrature (BQ), a surrogate-model-based approach to numerical integration, is capable of superb sample efficiency, but its lack of parallelisation has hindered its practical applications. In this work, we propose a parallelised (batch) BQ method, employing techniques from kernel quadrature, t… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 38 pages, 6 figures

    MSC Class: 62C10; 62F15

    Journal ref: NeurIPS 35, 16533--16547 (2022)

  10. arXiv:2205.14092  [pdf, other

    cs.LG math.PR

    Capturing Graphs with Hypo-Elliptic Diffusions

    Authors: Csaba Toth, Darrick Lee, Celia Hacker, Harald Oberhauser

    Abstract: Convolutional layers within graph neural networks operate by aggregating information about local neighbourhood structures; one common way to encode such substructures is through random walks. The distribution of these random walks evolves according to a diffusion equation defined using the graph Laplacian. We extend this approach by leveraging classic mathematical results about hypo-elliptic diffu… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 30 pages

  11. arXiv:2110.06357  [pdf, other

    math.ST cs.LG

    Tangent Space and Dimension Estimation with the Wasserstein Distance

    Authors: Uzu Lim, Harald Oberhauser, Vidit Nanda

    Abstract: Consider a set of points sampled independently near a smooth compact submanifold of Euclidean space. We provide mathematically rigorous bounds on the number of sample points required to estimate both the dimension and the tangent spaces of that manifold with high confidence. The algorithm for this estimation is Local PCA, a local version of principal component analysis. Our results accommodate for… ▽ More

    Submitted 25 September, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Main theorems rewritten. Introduction is written more compactly

  12. arXiv:2107.09597  [pdf, other

    math.NA cs.LG stat.ML

    Positively Weighted Kernel Quadrature via Subsampling

    Authors: Satoshi Hayakawa, Harald Oberhauser, Terry Lyons

    Abstract: We study kernel quadrature rules with convex weights. Our approach combines the spectral properties of the kernel with recombination results about point measures. This results in effective algorithms that construct convex quadrature rules using only access to i.i.d. samples from the underlying measure and evaluation of the kernel and that result in a small worst-case error. In addition to our theo… ▽ More

    Submitted 11 October, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 29 pages, NeurIPS 2022 camera-ready version

  13. arXiv:2104.14691  [pdf, other

    math.PR cs.CE

    Grid-Free Computation of Probabilistic Safety with Malliavin Calculus

    Authors: Francesco Cosentino, Harald Oberhauser, Alessandro Abate

    Abstract: This work concerns continuous-time, continuous-space stochastic dynamical systems described by stochastic differential equations (SDE). It presents a new approach to compute probabilistic safety regions, namely sets of initial conditions of the SDE associated to trajectories that are safe with a probability larger than a given threshold. The approach introduces a functional that is minimised at th… ▽ More

    Submitted 10 January, 2023; v1 submitted 29 April, 2021; originally announced April 2021.

  14. arXiv:2102.03657  [pdf, other

    cs.LG

    Neural SDEs as Infinite-Dimensional GANs

    Authors: Patrick Kidger, James Foster, Xuechen Li, Harald Oberhauser, Terry Lyons

    Abstract: Stochastic differential equations (SDEs) are a staple of mathematical modelling of temporal dynamics. However, a fundamental limitation has been that such models have typically been relatively inflexible, which recent work introducing Neural SDEs has sought to solve. Here, we show that the current classical approach to fitting SDEs may be approached as a special case of (Wasserstein) GANs, and in… ▽ More

    Submitted 11 May, 2021; v1 submitted 6 February, 2021; originally announced February 2021.

    Comments: Published at ICML 2021

  15. arXiv:2102.02876  [pdf, other

    stat.ML cs.LG math.PR

    Nonlinear Independent Component Analysis for Discrete-Time and Continuous-Time Signals

    Authors: Alexander Schell, Harald Oberhauser

    Abstract: We study the classical problem of recovering a multidimensional source signal from observations of nonlinear mixtures of this signal. We show that this recovery is possible (up to a permutation and monotone scaling of the source's original component signals) if the mixture is due to a sufficiently differentiable and invertible but otherwise arbitrarily nonlinear function and the component signals… ▽ More

    Submitted 15 January, 2023; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: 89 pages, 10 figures; thoroughly revised presentation (including newly added Sections 2, 10, A.19). To appear in the Annals of Statistics

    MSC Class: 62H25; 62M99; 62H05; 60L10; 62M45; 62R10

  16. arXiv:2006.07027  [pdf, other

    cs.LG stat.ML

    Seq2Tens: An Efficient Representation of Sequences by Low-Rank Tensor Projections

    Authors: Csaba Toth, Patric Bonnier, Harald Oberhauser

    Abstract: Sequential data such as time series, video, or text can be challenging to analyse as the ordered structure gives rise to complex dependencies. At the heart of this is non-commutativity, in the sense that reordering the elements of a sequence can completely change its meaning. We use a classical mathematical object -- the tensor algebra -- to capture such dependencies. To address the innate computa… ▽ More

    Submitted 30 July, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 37 pages, 6 figures, 8 tables

  17. arXiv:2006.01819  [pdf, other

    cs.LG math.PR stat.ML

    Carathéodory Sampling for Stochastic Gradient Descent

    Authors: Francesco Cosentino, Harald Oberhauser, Alessandro Abate

    Abstract: Many problems require to optimize empirical risk functions over large data sets. Gradient descent methods that calculate the full gradient in every descent step do not scale to such datasets. Various flavours of Stochastic Gradient Descent (SGD) replace the expensive summation that computes the full gradient by approximating it with a small sum over a randomly selected subsample of the data set th… ▽ More

    Submitted 25 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

  18. arXiv:2006.01757  [pdf, other

    cs.LG math.PR stat.ML

    A Randomized Algorithm to Reduce the Support of Discrete Measures

    Authors: Francesco Cosentino, Harald Oberhauser, Alessandro Abate

    Abstract: Given a discrete probability measure supported on $N$ atoms and a set of $n$ real-valued functions, there exists a probability measure that is supported on a subset of $n+1$ of the original $N$ atoms and has the same mean when integrated against each of the $n$ functions. If $ N \gg n$ this results in a huge reduction of complexity. We give a simple geometric characterization of barycenters via ne… ▽ More

    Submitted 26 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Journal ref: 34th Conference on Advances in Neural Information Processing Systems, 2020

  19. arXiv:1906.08215  [pdf, other

    stat.ML cs.LG math.PR

    Bayesian Learning from Sequential Data using Gaussian Processes with Signature Covariances

    Authors: Csaba Toth, Harald Oberhauser

    Abstract: We develop a Bayesian approach to learning from sequential data by using Gaussian processes (GPs) with so-called signature kernels as covariance functions. This allows to make sequences of different length comparable and to rely on strong theoretical results from stochastic analysis. Signatures capture sequential structure with tensors that can scale unfavourably in sequence length and state space… ▽ More

    Submitted 6 July, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: Near camera ready version for ICML 2020. Previous title: "Variational Gaussian Processes with Signature Covariances"

  20. arXiv:1806.00381  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Persistence paths and signature features in topological data analysis

    Authors: Ilya Chevyrev, Vidit Nanda, Harald Oberhauser

    Abstract: We introduce a new feature map for barcodes that arise in persistent homology computation. The main idea is to first realize each barcode as a path in a convenient vector space, and to then compute its path signature which takes values in the tensor algebra of that vector space. The composition of these two operations - barcode to path, path to tensor series - results in a feature map that has sev… ▽ More

    Submitted 12 December, 2018; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Additional experiment and further details. To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence

    Journal ref: IEEE TPAMI (2020) Volume: 42, Issue: 1, pp. 192 - 202

  21. arXiv:1801.00753  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Probabilistic supervised learning

    Authors: Frithjof Gressmann, Franz J. Király, Bilal Mateen, Harald Oberhauser

    Abstract: Predictive modelling and supervised learning are central to modern data science. With predictions from an ever-expanding number of supervised black-box strategies - e.g., kernel methods, random forests, deep learning aka neural networks - being employed as a basis for decision making processes, it is crucial to understand the statistical uncertainty associated with these predictions. As a genera… ▽ More

    Submitted 7 May, 2019; v1 submitted 2 January, 2018; originally announced January 2018.

  22. arXiv:1708.09708  [pdf, ps, other

    stat.ML cs.DS math.ST

    Sketching the order of events

    Authors: Terry Lyons, Harald Oberhauser

    Abstract: We introduce features for massive data streams. These stream features can be thought of as "ordered moments" and generalize stream sketches from "moments of order one" to "ordered moments of arbitrary order". In analogy to classic moments, they have theoretical guarantees such as universality that are important for learning algorithms.

    Submitted 31 August, 2017; originally announced August 2017.

  23. arXiv:1601.08169  [pdf, ps, other

    stat.ML cs.DM cs.LG math.ST stat.ME

    Kernels for sequentially ordered data

    Authors: Franz J Király, Harald Oberhauser

    Abstract: We present a novel framework for kernel learning with sequential data of any kind, such as time series, sequences of graphs, or strings. Our approach is based on signature features which can be seen as an ordered variant of sample (cross-)moments; it allows to obtain a "sequentialized" version of any static kernel. The sequential kernels are efficiently computable for discrete sequences and are sh… ▽ More

    Submitted 29 January, 2016; originally announced January 2016.