Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–7 of 7 results for author: Sorrell, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16739  [pdf, other

    cs.LG cs.AI eess.SY

    Oracle-Efficient Reinforcement Learning for Max Value Ensembles

    Authors: Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell

    Abstract: Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardinality) and experimentally (where function approximation and policy gradient techniques often scale poorly and suffer from instability and high variance). One line of research attempting to address thes… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  2. arXiv:2305.15284  [pdf, other

    cs.LG

    Replicable Reinforcement Learning

    Authors: Eric Eaton, Marcel Hussing, Michael Kearns, Jessica Sorrell

    Abstract: The replicability crisis in the social, behavioral, and data sciences has led to the formulation of algorithm frameworks for replicability -- i.e., a requirement that an algorithm produce identical outputs (with high probability) when run on two different samples from the same underlying distribution. While still in its infancy, provably replicable algorithms have been developed for many fundament… ▽ More

    Submitted 31 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  3. arXiv:2303.12921  [pdf, ps, other

    cs.LG cs.CR cs.DS

    Stability is Stable: Connections between Replicability, Privacy, and Adaptive Generalization

    Authors: Mark Bun, Marco Gaboardi, Max Hopkins, Russell Impagliazzo, Rex Lei, Toniann Pitassi, Satchit Sivakumar, Jessica Sorrell

    Abstract: The notion of replicable algorithms was introduced in Impagliazzo et al. [STOC '22] to describe randomized algorithms that are stable under the resampling of their inputs. More precisely, a replicable algorithm gives the same output with high probability when its randomness is fixed and it is run on a new i.i.d. sample drawn from the same distribution. Using replicable algorithms for data analysis… ▽ More

    Submitted 24 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: STOC 2023, minor typos fixed

  4. arXiv:2301.13767  [pdf, other

    cs.LG cs.DS

    Multicalibration as Boosting for Regression

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Aaron Roth, Jessica Sorrell

    Abstract: We study the connection between multicalibration and boosting for squared error regression. First we prove a useful characterization of multicalibration in terms of a ``swap regret'' like condition on squared error. Using this characterization, we give an exceedingly simple algorithm that can be analyzed both as a boosting algorithm for regression and as a multicalibration algorithm for a class H… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Code available here: https://github.com/Declancharrison/Level-Set-Boosting

  5. arXiv:2201.08430  [pdf, ps, other

    cs.LG

    Reproducibility in Learning

    Authors: Russell Impagliazzo, Rex Lei, Toniann Pitassi, Jessica Sorrell

    Abstract: We introduce the notion of a reproducible algorithm in the context of learning. A reproducible learning algorithm is resilient to variations in its samples -- with high probability, it returns the exact same output when run on two samples from the same underlying distribution. We begin by unpacking the definition, clarifying how randomness is instrumental in balancing accuracy and reproducibility.… ▽ More

    Submitted 14 April, 2023; v1 submitted 20 January, 2022; originally announced January 2022.

  6. arXiv:2106.07779  [pdf, ps, other

    cs.LG stat.ML

    Boosting in the Presence of Massart Noise

    Authors: Ilias Diakonikolas, Russell Impagliazzo, Daniel Kane, Rex Lei, Jessica Sorrell, Christos Tzamos

    Abstract: We study the problem of boosting the accuracy of a weak learner in the (distribution-independent) PAC model with Massart noise. In the Massart noise model, the label of each example $x$ is independently misclassified with probability $η(x) \leq η$, where $η<1/2$. The Massart model lies between the random classification noise model and the agnostic model. Our main positive result is the first compu… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

  7. arXiv:2002.01100  [pdf, other

    cs.LG stat.ML

    Efficient, Noise-Tolerant, and Private Learning via Boosting

    Authors: Mark Bun, Marco Leandro Carmosino, Jessica Sorrell

    Abstract: We introduce a simple framework for designing private boosting algorithms. We give natural conditions under which these algorithms are differentially private, efficient, and noise-tolerant PAC learners. To demonstrate our framework, we use it to construct noise-tolerant and private PAC learners for large-margin halfspaces whose sample complexity does not depend on the dimension. We give two samp… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: 33 pages