Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–5 of 5 results for author: Lorenzen, S S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.13624  [pdf, other

    cs.LG stat.ML

    Chebyshev-Cantelli PAC-Bayes-Bennett Inequality for the Weighted Majority Vote

    Authors: Yi-Shan Wu, Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: We present a new second-order oracle bound for the expected risk of a weighted majority vote. The bound is based on a novel parametric form of the Chebyshev- Cantelli inequality (a.k.a. one-sided Chebyshev's), which is amenable to efficient minimization. The new form resolves the optimization challenge faced by prior oracle bounds based on the Chebyshev-Cantelli inequality, the C-bounds [Germain e… ▽ More

    Submitted 17 January, 2023; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: aligned with the camera-ready version published at NeurIPS 2021

  2. arXiv:2106.12912  [pdf, other

    cs.LG

    Information Bottleneck: Exact Analysis of (Quantized) Neural Networks

    Authors: Stephan Sloth Lorenzen, Christian Igel, Mads Nielsen

    Abstract: The information bottleneck (IB) principle has been suggested as a way to analyze deep neural networks. The learning dynamics are studied by inspecting the mutual information (MI) between the hidden layers and the input and output. Notably, separate fitting and compression phases during training have been reported. This led to some controversy including claims that the observations are not reproduc… ▽ More

    Submitted 14 February, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

  3. arXiv:2007.13532  [pdf, other

    cs.LG stat.ML

    Second Order PAC-Bayesian Bounds for the Weighted Majority Vote

    Authors: Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: We present a novel analysis of the expected risk of weighted majority vote in multiclass classification. The analysis takes correlation of predictions by ensemble members into account and provides a bound that is amenable to efficient minimization, which yields improved weighting for the majority vote. We also provide a specialized version of our bound for binary classification, which allows to ex… ▽ More

    Submitted 17 December, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

  4. arXiv:1908.08656  [pdf, other

    cs.DB cs.IR

    Revisiting Wedge Sampling for Budgeted Maximum Inner Product Search

    Authors: Stephan S. Lorenzen, Ninh Pham

    Abstract: Top-k maximum inner product search (MIPS) is a central task in many machine learning applications. This paper extends top-k MIPS with a budgeted setting, that asks for the best approximate top-k MIPS given a limit of B computational operations. We investigate recent advanced sampling algorithms, including wedge and diamond sampling to solve it. Though the design of these sampling schemes naturally… ▽ More

    Submitted 12 September, 2020; v1 submitted 23 August, 2019; originally announced August 2019.

    Comments: ECML-PKDD 2020

  5. arXiv:1810.09746  [pdf, ps, other

    cs.LG stat.ML

    On PAC-Bayesian Bounds for Random Forests

    Authors: Stephan Sloth Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: Existing guarantees in terms of rigorous upper bounds on the generalization error for the original random forest algorithm, one of the most frequently used machine learning methods, are unsatisfying. We discuss and evaluate various PAC-Bayesian approaches to derive such bounds. The bounds do not require additional hold-out data, because the out-of-bag samples from the bagging in the training proce… ▽ More

    Submitted 6 March, 2019; v1 submitted 23 October, 2018; originally announced October 2018.