Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Bertrand, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09499  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences

    Authors: Damien Ferbach, Quentin Bertrand, Avishek Joey Bose, Gauthier Gidel

    Abstract: The rapid progress in generative models has resulted in impressive leaps in generation quality, blurring the lines between synthetic and real data. Web-scale datasets are now prone to the inevitable contamination by synthetic data, directly impacting the training of future generated models. Already, some theoretical results on self-consuming generative models (a.k.a., iterative retraining) have em… ▽ More

    Submitted 12 June, 2024; originally announced July 2024.

    MSC Class: 68T10 ACM Class: I.2.6

  2. arXiv:2312.08484  [pdf, other

    cs.GT

    Q-learners Can Provably Collude in the Iterated Prisoner's Dilemma

    Authors: Quentin Bertrand, Juan Duque, Emilio Calvano, Gauthier Gidel

    Abstract: The deployment of machine learning systems in the market economy has triggered academic and institutional fears over potential tacit collusion between fully automated agents. Multiple recent economics studies have empirically shown the emergence of collusive strategies from agents guided by machine learning algorithms. In this work, we prove that multi-agent Q-learners playing the iterated prisone… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  3. arXiv:2310.00429  [pdf, other

    cs.LG stat.ML

    On the Stability of Iterative Retraining of Generative Models on their own Data

    Authors: Quentin Bertrand, Avishek Joey Bose, Alexandre Duplessis, Marco Jiralerspong, Gauthier Gidel

    Abstract: Deep generative models have made tremendous progress in modeling complex data, often exhibiting generation quality that surpasses a typical human's ability to discern the authenticity of samples. Undeniably, a key driver of this success is enabled by the massive amounts of web-scale data consumed by these models. Due to these models' striking performance and ease of availability, the web will inev… ▽ More

    Submitted 2 April, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  4. arXiv:2306.07905  [pdf, other

    cs.LG math.OC stat.ML

    Omega: Optimistic EMA Gradients

    Authors: Juan Ramirez, Rohan Sukumaran, Quentin Bertrand, Gauthier Gidel

    Abstract: Stochastic min-max optimization has gained interest in the machine learning community with the advancements in GANs and adversarial training. Although game optimization is fairly well understood in the deterministic setting, some issues persist in the stochastic regime. Recent work has shown that stochastic gradient descent-ascent methods such as the optimistic gradient are highly sensitive to noi… ▽ More

    Submitted 25 March, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Oral at the LatinX in AI workshop @ ICML 2023

  5. arXiv:2211.14666  [pdf, other

    cs.LG stat.ML

    Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning

    Authors: Sébastien Lachapelle, Tristan Deleu, Divyat Mahajan, Ioannis Mitliagkas, Yoshua Bengio, Simon Lacoste-Julien, Quentin Bertrand

    Abstract: Although disentangled representations are often said to be beneficial for downstream tasks, current empirical and theoretical understanding is limited. In this work, we provide evidence that disentangled representations coupled with sparse base-predictors improve generalization. In the context of multi-task learning, we prove a new identifiability result that provides conditions under which maxima… ▽ More

    Submitted 6 June, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: Appears in: Fortieth International Conference on Machine Learning (ICML 2023). 36 pages

    ACM Class: I.2.6; I.5.1

  6. arXiv:2206.12301  [pdf, other

    cs.GT cs.LG stat.ML

    On the Limitations of Elo: Real-World Games, are Transitive, not Additive

    Authors: Quentin Bertrand, Wojciech Marian Czarnecki, Gauthier Gidel

    Abstract: Real-world competitive games, such as chess, go, or StarCraft II, rely on Elo models to measure the strength of their players. Since these games are not fully transitive, using Elo implicitly assumes they have a strong transitive component that can correctly be identified and extracted. In this study, we investigate the challenge of identifying the strength of the transitive component in games. Fi… ▽ More

    Submitted 6 March, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

  7. arXiv:2204.07826  [pdf, other

    stat.ML cs.LG

    Beyond L1: Faster and Better Sparse Models with skglm

    Authors: Quentin Bertrand, Quentin Klopfenstein, Pierre-Antoine Bannier, Gauthier Gidel, Mathurin Massias

    Abstract: We propose a new fast algorithm to estimate any sparse generalized linear model with convex or non-convex separable penalties. Our algorithm is able to solve problems with millions of samples and features in seconds, by relying on coordinate descent, working sets and Anderson acceleration. It handles previously unaddressed models, and is extensively shown to improve state-of-art algorithms. We pro… ▽ More

    Submitted 6 March, 2023; v1 submitted 16 April, 2022; originally announced April 2022.

  8. arXiv:2105.01637  [pdf, other

    stat.ML cs.LG math.OC

    Implicit differentiation for fast hyperparameter selection in non-smooth convex learning

    Authors: Quentin Bertrand, Quentin Klopfenstein, Mathurin Massias, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Finding the optimal hyperparameters of a model can be cast as a bilevel optimization problem, typically solved using zero-order techniques. In this work we study first-order methods when the inner optimization problem is convex but non-smooth. We show that the forward-mode differentiation of proximal gradient descent and proximal coordinate descent yield sequences of Jacobians converging toward th… ▽ More

    Submitted 8 August, 2022; v1 submitted 4 May, 2021; originally announced May 2021.

  9. arXiv:2011.10065  [pdf, other

    stat.ML cs.LG

    Anderson acceleration of coordinate descent

    Authors: Quentin Bertrand, Mathurin Massias

    Abstract: Acceleration of first order methods is mainly obtained via inertial techniques à la Nesterov, or via nonlinear extrapolation. The latter has known a recent surge of interest, with successful applications to gradient and proximal gradient techniques. On multiple Machine Learning problems, coordinate descent achieves performance significantly superior to full-gradient methods. Speeding up coordinate… ▽ More

    Submitted 28 October, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

  10. arXiv:2010.11825  [pdf, other

    stat.ML cs.LG math.OC

    Model identification and local linear convergence of coordinate descent

    Authors: Quentin Klopfenstein, Quentin Bertrand, Alexandre Gramfort, Joseph Salmon, Samuel Vaiter

    Abstract: For composite nonsmooth optimization problems, Forward-Backward algorithm achieves model identification (e.g. support identification for the Lasso) after a finite number of iterations, provided the objective function is regular enough. Results concerning coordinate descent are scarcer and model identification has only been shown for specific estimators, the support-vector machine for instance. In… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  11. arXiv:2002.08943  [pdf, other

    stat.ML cs.LG

    Implicit differentiation of Lasso-type models for hyperparameter optimization

    Authors: Quentin Bertrand, Quentin Klopfenstein, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Setting regularization parameters for Lasso-type estimators is notoriously difficult, though crucial in practice. The most popular hyperparameter optimization approach is grid-search using held-out validation data. Grid-search however requires to choose a predefined grid for each parameter, which scales exponentially in the number of parameters. Another approach is to cast hyperparameter optimizat… ▽ More

    Submitted 3 September, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

  12. arXiv:2001.05401  [pdf, other

    stat.ML cs.LG math.OC

    Support recovery and sup-norm convergence rates for sparse pivotal estimation

    Authors: Mathurin Massias, Quentin Bertrand, Alexandre Gramfort, Joseph Salmon

    Abstract: In high dimensional sparse regression, pivotal estimators are estimators for which the optimal regularization parameter is independent of the noise level. The canonical pivotal estimator is the square-root Lasso, formulated along with its derivatives as a "non-smooth + non-smooth" optimization problem. Modern techniques to solve these include smoothing the datafitting term, to benefit from fast ef… ▽ More

    Submitted 3 September, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  13. arXiv:1902.02509  [pdf, other

    stat.ML cs.LG math.OC stat.AP

    Handling correlated and repeated measurements with the smoothed multivariate square-root Lasso

    Authors: Quentin Bertrand, Mathurin Massias, Alexandre Gramfort, Joseph Salmon

    Abstract: Sparsity promoting norms are frequently used in high dimensional regression. A limitation of such Lasso-type estimators is that the optimal regularization parameter depends on the unknown noise level. Estimators such as the concomitant Lasso address this dependence by jointly estimating the noise level and the regression coefficients. Additionally, in many applications, the data is obtained by ave… ▽ More

    Submitted 3 September, 2020; v1 submitted 7 February, 2019; originally announced February 2019.

  14. arXiv:1707.08704  [pdf, other

    cs.AI

    Anytime Exact Belief Propagation

    Authors: Gabriel Azevedo Ferreira, Quentin Bertrand, Charles Maussion, Rodrigo de Salvo Braz

    Abstract: Statistical Relational Models and, more recently, Probabilistic Programming, have been making strides towards an integration of logic and probabilistic reasoning. A natural expectation for this project is that a probabilistic logic reasoning algorithm reduces to a logic reasoning algorithm when provided a model that only involves 0-1 probabilities, exhibiting all the advantages of logic reasoning… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

    Comments: Submission to StaRAI-17 workshop at UAI-17 conference