Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 388 results for author: Jordan, M I

.
  1. arXiv:2407.14332  [pdf, ps, other

    cs.GT

    Unravelling in Collaborative Learning

    Authors: Aymeric Capitaine, Etienne Boursier, Antoine Scheid, Eric Moulines, Michael I. Jordan, El-Mahdi El-Mhamdi, Alain Durmus

    Abstract: Collaborative learning offers a promising avenue for leveraging decentralized data. However, collaboration in groups of strategic learners is not a given. In this work, we consider strategic agents who wish to train a model together but have sampling distributions of different quality. The collaboration is organized by a benevolent aggregator who gathers samples so as to maximize total welfare, bu… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  2. arXiv:2406.19824  [pdf, ps, other

    cs.GT stat.ML

    Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality

    Authors: Antoine Scheid, Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I Jordan, Alain Durmus

    Abstract: In economic theory, the concept of externality refers to any indirect effect resulting from an interaction between players that affects the social welfare. Most of the models within which externality has been studied assume that agents have perfect knowledge of their environment and preferences. This is a major hindrance to the practical implementation of many proposed solutions. To address this i… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

  3. arXiv:2406.17819  [pdf, other

    cs.LG cs.AI

    Automatically Adaptive Conformal Risk Control

    Authors: Vincent Blot, Anastasios N Angelopoulos, Michael I Jordan, Nicolas J-B Brunel

    Abstract: Science and technology have a growing need for effective mechanisms that ensure reliable, controlled performance from black-box machine learning algorithms. These performance guarantees should ideally hold conditionally on the input-that is the performance guarantees should hold, at least approximately, no matter what the input. However, beyond stylized discrete groupings such as ethnicity and gen… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.15898  [pdf, other

    cs.GT cs.LG

    Defection-Free Collaboration between Competitors in a Learning System

    Authors: Mariel Werner, Sai Praneeth Karimireddy, Michael I. Jordan

    Abstract: We study collaborative learning systems in which the participants are competitors who will defect from the system if they lose revenue by collaborating. As such, we frame the system as a duopoly of competitive firms who are each engaged in training machine-learning models and selling their predictions to a market of consumers. We first examine a fully collaborative scheme in which both firms share… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  5. arXiv:2406.07029  [pdf, other

    cs.LG

    Fairness-Aware Meta-Learning via Nash Bargaining

    Authors: Yi Zeng, Xuelin Yang, Li Chen, Cristian Canton Ferrer, Ming Jin, Michael I. Jordan, Ruoxi Jia

    Abstract: To address issues of group-level fairness in machine learning, it is natural to adjust model parameters based on specific fairness objectives over a sensitive-attributed validation set. Such an adjustment procedure can be cast within a meta-learning framework. However, naive integration of fairness goals via meta-learning can cause hypergradient conflicts for subgroups, resulting in unstable conve… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2406.00147  [pdf, other

    cs.GT cs.LG econ.TH

    Fair Allocation in Dynamic Mechanism Design

    Authors: Alireza Fallah, Michael I. Jordan, Annie Ulichney

    Abstract: We consider a dynamic mechanism design problem where an auctioneer sells an indivisible good to two groups of buyers in every round, for a total of $T$ rounds. The auctioneer aims to maximize their discounted overall revenue while adhering to a fairness constraint that guarantees a minimum average allocation for each group. We begin by studying the static case ($T=1$) and establish that the optima… ▽ More

    Submitted 15 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

  7. arXiv:2404.18490  [pdf, other

    cs.LG stat.ML

    Reduced-Rank Multi-objective Policy Learning and Optimization

    Authors: Ezinne Nwankwo, Michael I. Jordan, Angela Zhou

    Abstract: Evaluating the causal impacts of possible interventions is crucial for informing decision-making, especially towards improving access to opportunity. However, if causal effects are heterogeneous and predictable from covariates, personalized treatment decisions can improve individual outcomes and contribute to both efficiency and equity. In practice, however, causal researchers do not have a single… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  8. arXiv:2404.15746  [pdf, other

    stat.ML cs.CR cs.LG

    Collaborative Heterogeneous Causal Inference Beyond Meta-analysis

    Authors: Tianyu Guo, Sai Praneeth Karimireddy, Michael I. Jordan

    Abstract: Collaboration between different data centers is often challenged by heterogeneity across sites. To account for the heterogeneity, the state-of-the-art method is to re-weight the covariate distributions in each site to match the distribution of the target population. Nevertheless, this method could easily fail when a certain site couldn't cover the entire population. Moreover, it still relies on th… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: submitted to ICML

  9. arXiv:2404.10767  [pdf, other

    cs.GT

    Privacy Can Arise Endogenously in an Economic System with Learning Agents

    Authors: Nivasini Ananthakrishnan, Tiffany Ding, Mariel Werner, Sai Praneeth Karimireddy, Michael I. Jordan

    Abstract: We study price-discrimination games between buyers and a seller where privacy arises endogenously--that is, utility maximization yields equilibrium strategies where privacy occurs naturally. In this game, buyers with a high valuation for a good have an incentive to keep their valuation private, lest the seller charge them a higher price. This yields an equilibrium where some buyers will send a sig… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: To appear in Symposium on Foundations of Responsible Computing (FORC 2024)

  10. arXiv:2403.19605  [pdf, other

    stat.ME cs.LG

    Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction

    Authors: Drew T. Nguyen, Reese Pathak, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan

    Abstract: Decision-making pipelines are generally characterized by tradeoffs among various risk functions. It is often desirable to manage such tradeoffs in a data-adaptive manner. As we demonstrate, if this is done naively, state-of-the art uncertainty quantification methods can lead to significant violations of putative risk guarantees. To address this issue, we develop methods that permit valid control… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 27 pages, 10 figures

  11. arXiv:2403.07008  [pdf, other

    cs.LG cs.AI cs.CL stat.ME

    AutoEval Done Right: Using Synthetic Data for Model Evaluation

    Authors: Pierre Boyeau, Anastasios N. Angelopoulos, Nir Yosef, Jitendra Malik, Michael I. Jordan

    Abstract: The evaluation of machine learning models using human-labeled validation data can be expensive and time-consuming. AI-labeled synthetic data can be used to decrease the number of human annotations required for this purpose in a process called autoevaluation. We suggest efficient and statistically principled algorithms for this purpose that improve sample efficiency while remaining unbiased. These… ▽ More

    Submitted 28 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: New experiments, fix fig 1

  12. arXiv:2403.03811  [pdf, other

    stat.ML cs.GT cs.LG

    Incentivized Learning in Principal-Agent Bandit Games

    Authors: Antoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, El Mahdi El Mhamdi, Eric Moulines, Michael I. Jordan, Alain Durmus

    Abstract: This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent. The principal and the agent have misaligned objectives and the choice of action is only left to the agent. However, the principal can influence the agent's decisions by offering incentives which add up to his rewards. The principal aims to iteratively learn an i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  13. arXiv:2402.14005  [pdf, other

    cs.GT econ.TH

    Information Elicitation in Agency Games

    Authors: Serena Wang, Michael I. Jordan, Katrina Ligett, R. Preston McAfee

    Abstract: Rapid progress in scalable, commoditized tools for data collection and data processing has made it possible for firms and policymakers to employ ever more complex metrics as guides for decision-making. These developments have highlighted a prevailing challenge -- deciding *which* metrics to compute. In particular, a firm's ability to compute a wider range of existing metrics does not address the p… ▽ More

    Submitted 15 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  14. arXiv:2402.09697  [pdf, other

    econ.TH cs.GT

    On Three-Layer Data Markets

    Authors: Alireza Fallah, Michael I. Jordan, Ali Makhdoumi, Azarakhsh Malekian

    Abstract: We study a three-layer data market comprising users (data owners), platforms, and a data buyer. Each user benefits from platform services in exchange for data, incurring privacy loss when their data, albeit noisily, is shared with the buyer. The user chooses platforms to share data with, while platforms decide on data noise levels and pricing before selling to the buyer. The buyer selects platform… ▽ More

    Submitted 20 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  15. arXiv:2402.08223  [pdf, ps, other

    econ.TH cs.GT

    The Limits of Price Discrimination Under Privacy Constraints

    Authors: Alireza Fallah, Michael I. Jordan, Ali Makhdoumi, Azarakhsh Malekian

    Abstract: We study a producer's problem of selling a product to a continuum of privacy-conscious consumers, where the producer can implement third-degree price discrimination, offering different prices to different market segments. We consider a privacy mechanism that provides a degree of protection by probabilistically masking each market segment. We establish that the resultant set of all consumer-produce… ▽ More

    Submitted 16 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  16. arXiv:2401.16335  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

    Authors: Banghua Zhu, Michael I. Jordan, Jiantao Jiao

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is a pivotal technique that aligns language models closely with human-centric values. The initial phase of RLHF involves learning human values using a reward model from ranking data. It is observed that the performance of the reward model degrades after one epoch of training, and optimizing too much against the learned reward model eventually hinde… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  17. arXiv:2312.07930  [pdf, other

    cs.LG cs.CL cs.CR cs.IT stat.ML

    Towards Optimal Statistical Watermarking

    Authors: Baihe Huang, Hanlin Zhu, Banghua Zhu, Kannan Ramchandran, Michael I. Jordan, Jason D. Lee, Jiantao Jiao

    Abstract: We study statistical watermarking by formulating it as a hypothesis testing problem, a general framework which subsumes all previous statistical watermarking methods. Key to our formulation is a coupling of the output tokens and the rejection region, realized by pseudo-random generators in practice, that allows non-trivial trade-offs between the Type I error and Type II error. We characterize the… ▽ More

    Submitted 6 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  18. arXiv:2311.10859  [pdf, other

    quant-ph cs.GT cs.LG math.OC

    A Quadratic Speedup in Finding Nash Equilibria of Quantum Zero-Sum Games

    Authors: Francisca Vasconcelos, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Panayotis Mertikopoulos, Georgios Piliouras, Michael I. Jordan

    Abstract: Recent developments in domains such as non-local games, quantum interactive proofs, and quantum generative adversarial networks have renewed interest in quantum game theory and, specifically, quantum zero-sum games. Central to classical game theory is the efficient algorithmic computation of Nash equilibria, which represent optimal strategies for both players. In 2008, Jain and Watrous proposed th… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 53 pages, 7 figures, QTML 2023 (Accepted (Long Talk))

    MSC Class: primary 91A05; 81Q93; secondary 68Q32; 91A26; 37N40;

  19. arXiv:2311.02537  [pdf, ps, other

    cs.GT econ.TH

    Contract Design With Safety Inspections

    Authors: Alireza Fallah, Michael I. Jordan

    Abstract: We study the role of regulatory inspections in a contract design problem in which a principal interacts separately with multiple agents. Each agent's hidden action includes a dimension that determines whether they undertake an extra costly step to adhere to safety protocols. The principal's objective is to use payments combined with a limited budget for random inspections to incentivize agents tow… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  20. arXiv:2310.14087  [pdf, other

    cs.LG math.OC

    A Specialized Semismooth Newton Method for Kernel-Based Optimal Transport

    Authors: Tianyi Lin, Marco Cuturi, Michael I. Jordan

    Abstract: Kernel-based optimal transport (OT) estimators offer an alternative, functional estimation procedure to address OT problems from samples. Recent works suggest that these estimators are more statistically efficient than plug-in (linear programming-based) OT estimators when comparing probability measures in high-dimensions~\citep{Vacher-2021-Dimension}. Unfortunately, that statistical benefit comes… ▽ More

    Submitted 30 January, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted by AISTATS 2024; Fix some inaccuracy in the definition and proof; 24 pages, 36 figures

  21. arXiv:2310.14085  [pdf, ps, other

    cs.GT cs.LG math.OC

    Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback

    Authors: Michael I. Jordan, Tianyi Lin, Zhengyuan Zhou

    Abstract: Online gradient descent (OGD) is well known to be doubly optimal under strong convexity or monotonicity assumptions: (1) in the single-agent setting, it achieves an optimal regret of $Θ(\log T)$ for strongly convex cost functions; and (2) in the multi-agent setting of strongly monotone games, with each agent employing OGD, we obtain last-iterate convergence of the joint action to a unique Nash equ… ▽ More

    Submitted 28 March, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted by Operations Research; 47 pages

  22. arXiv:2310.05921  [pdf, other

    stat.ML cs.LG cs.RO stat.ME

    Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions

    Authors: Jordan Lekeufack, Anastasios N. Angelopoulos, Andrea Bajcsy, Michael I. Jordan, Jitendra Malik

    Abstract: We introduce Conformal Decision Theory, a framework for producing safe autonomous decisions despite imperfect machine learning predictions. Examples of such decisions are ubiquitous, from robot planning algorithms that rely on pedestrian predictions, to calibrating autonomous manufacturing to exhibit high throughput and low error, to the choice of trusting a nominal policy versus switching to a sa… ▽ More

    Submitted 2 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures

  23. arXiv:2309.04877  [pdf, other

    cs.LG stat.ML

    A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning

    Authors: Neha S. Wadia, Yatin Dandi, Michael I. Jordan

    Abstract: The rapid progress in machine learning in recent years has been based on a highly productive connection to gradient-based optimization. Further progress hinges in part on a shift in focus from pattern recognition to decision-making and multi-agent problems. In these broader settings, new mathematical challenges emerge that involve equilibria and game theory instead of optima. Gradient-based method… ▽ More

    Submitted 26 February, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: 36 pages, 7 figures; minor corrections

  24. arXiv:2309.01837  [pdf, other

    cs.LG stat.ML

    Delegating Data Collection in Decentralized Machine Learning

    Authors: Nivasini Ananthakrishnan, Stephen Bates, Michael I. Jordan, Nika Haghtalab

    Abstract: Motivated by the emergence of decentralized machine learning (ML) ecosystems, we study the delegation of data collection. Taking the field of contract theory as our starting point, we design optimal and near-optimal contracts that deal with two fundamental information asymmetries that arise in decentralized ML: uncertainty in the assessment of model quality and uncertainty regarding the optimal pe… ▽ More

    Submitted 2 May, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

  25. arXiv:2307.13381  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Scaff-PD: Communication Efficient Fair and Robust Federated Learning

    Authors: Yaodong Yu, Sai Praneeth Karimireddy, Yi Ma, Michael I. Jordan

    Abstract: We present Scaff-PD, a fast and communication-efficient algorithm for distributionally robust federated learning. Our approach improves fairness by optimizing a family of distributionally robust objectives tailored to heterogeneous clients. We leverage the special structure of these objectives, and design an accelerated primal dual (APD) algorithm which uses bias corrected local steps (as in Scaff… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    MSC Class: 68W40; 68W15; 90C25; 90C06 ACM Class: G.1.6; F.2.1; E.4

  26. arXiv:2307.03748  [pdf, other

    stat.ME cs.GT cs.LG stat.ML

    Incentive-Theoretic Bayesian Inference for Collaborative Science

    Authors: Stephen Bates, Michael I. Jordan, Michael Sklar, Jake A. Soloff

    Abstract: Contemporary scientific research is a distributed, collaborative endeavor, carried out by teams of researchers, regulatory institutions, funding agencies, commercial partners, and scientific bodies, all interacting with each other and facing different incentives. To maintain scientific rigor, statistical methods should acknowledge this state of affairs. To this end, we study hypothesis testing whe… ▽ More

    Submitted 8 February, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  27. arXiv:2307.00126  [pdf, other

    math.OC cs.LG stat.ML

    Accelerating Inexact HyperGradient Descent for Bilevel Optimization

    Authors: Haikuo Yang, Luo Luo, Chris Junchi Li, Michael I. Jordan

    Abstract: We present a method for solving general nonconvex-strongly-convex bilevel optimization problems. Our method -- the \emph{Restarted Accelerated HyperGradient Descent} (\texttt{RAHGD}) method -- finds an $ε$-first-order stationary point of the objective with $\tilde{\mathcal{O}}(κ^{3.25}ε^{-1.75})$ oracle complexity, where $κ$ is the condition number of the lower-level objective and $ε$ is the desir… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  28. arXiv:2306.16617  [pdf, ps, other

    math.OC cs.GT cs.LG

    Curvature-Independent Last-Iterate Convergence for Games on Riemannian Manifolds

    Authors: Yang Cai, Michael I. Jordan, Tianyi Lin, Argyris Oikonomou, Emmanouil-Vasileios Vlatakis-Gkaragkounis

    Abstract: Numerous applications in machine learning and data analytics can be formulated as equilibrium computation over Riemannian manifolds. Despite the extensive investigation of their Euclidean counterparts, the performance of Riemannian gradient-based algorithms remain opaque and poorly understood. We revisit the original scheme of Riemannian gradient descent (RGD) and analyze it under a geodesic monot… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  29. arXiv:2306.14670  [pdf, other

    cs.GT cs.CY cs.LG stat.ML

    Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition

    Authors: Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt, Nika Haghtalab

    Abstract: As the scale of machine learning models increases, trends such as scaling laws anticipate consistent downstream improvements in predictive accuracy. However, these trends take the perspective of a single model-provider in isolation, while in reality providers often compete with each other for users. In this work, we demonstrate that competition can fundamentally alter the behavior of these scaling… ▽ More

    Submitted 6 February, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Appeared at NeurIPS 2023; this is the full version

  30. arXiv:2306.09335  [pdf, other

    stat.ML cs.CV cs.LG stat.ME

    Class-Conditional Conformal Prediction with Many Classes

    Authors: Tiffany Ding, Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan, Ryan J. Tibshirani

    Abstract: Standard conformal prediction methods provide a marginal coverage guarantee, which means that for a random test point, the conformal prediction set contains the true label with a user-specified probability. In many classification problems, we would like to obtain a stronger guarantee--that for test points of a specific class, the prediction set contains the true label with the same user-chosen pro… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  31. arXiv:2306.07479  [pdf, ps, other

    cs.GT cs.IR cs.LG stat.ML

    Incentivizing High-Quality Content in Online Recommender Systems

    Authors: Xinyan Hu, Meena Jagadeesan, Michael I. Jordan, Jacob Steinhardt

    Abstract: In content recommender systems such as TikTok and YouTube, the platform's recommendation algorithm shapes content producer incentives. Many platforms employ online learning, which generates intertemporal incentives, since content produced today affects recommendations of future content. We study the game between producers and analyze the content created at equilibrium. We show that standard online… ▽ More

    Submitted 21 June, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: Updated version with revised and expanded content

  32. arXiv:2306.05592  [pdf, other

    cs.GT cs.CY cs.DC cs.LG econ.TH

    Evaluating and Incentivizing Diverse Data Contributions in Collaborative Learning

    Authors: Baihe Huang, Sai Praneeth Karimireddy, Michael I. Jordan

    Abstract: For a federated learning model to perform well, it is crucial to have a diverse and representative dataset. However, the data contributors may only be concerned with the performance on a specific subset of the population, which may not reflect the diversity of the wider population. This creates a tension between the principal (the FL platform designer) who cares about global performance and the ag… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  33. arXiv:2306.02231  [pdf, other

    cs.CL cs.AI cs.LG eess.SY

    Fine-Tuning Language Models with Advantage-Induced Policy Alignment

    Authors: Banghua Zhu, Hiteshi Sharma, Felipe Vieira Frujeri, Shi Dong, Chenguang Zhu, Michael I. Jordan, Jiantao Jiao

    Abstract: Reinforcement learning from human feedback (RLHF) has emerged as a reliable approach to aligning large language models (LLMs) to human preferences. Among the plethora of RLHF techniques, proximal policy optimization (PPO) is of the most widely used methods. Despite its popularity, however, PPO may suffer from mode collapse, instability, and poor sample efficiency. We show that these issues can be… ▽ More

    Submitted 2 November, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  34. arXiv:2306.02003  [pdf, other

    cs.LG cs.AI cs.PF eess.SY stat.ML

    On Optimal Caching and Model Multiplexing for Large Model Inference

    Authors: Banghua Zhu, Ying Sheng, Lianmin Zheng, Clark Barrett, Michael I. Jordan, Jiantao Jiao

    Abstract: Large Language Models (LLMs) and other large foundation models have achieved noteworthy success, but their size exacerbates existing resource consumption and latency challenges. In particular, the large-scale deployment of these models is hindered by the significant resource requirements during inference. In this paper, we study two approaches for mitigating these challenges: employing a cache to… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

  35. arXiv:2305.17564  [pdf, other

    cs.LG

    Federated Conformal Predictors for Distributed Uncertainty Quantification

    Authors: Charles Lu, Yaodong Yu, Sai Praneeth Karimireddy, Michael I. Jordan, Ramesh Raskar

    Abstract: Conformal prediction is emerging as a popular paradigm for providing rigorous uncertainty quantification in machine learning since it can be easily applied as a post-processing step to already trained models. In this paper, we extend conformal prediction to the federated learning setting. The main challenge we face is data heterogeneity across the clients - this violates the fundamental tenet of e… ▽ More

    Submitted 1 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 23 pages, 18 figures, accepted to International Conference on Machine Learning (ICML 2023)

  36. arXiv:2305.14595  [pdf, other

    cs.LG cs.CY cs.GT

    Operationalizing Counterfactual Metrics: Incentives, Ranking, and Information Asymmetry

    Authors: Serena Wang, Stephen Bates, P. M. Aronow, Michael I. Jordan

    Abstract: From the social sciences to machine learning, it has been well documented that metrics to be optimized are not always aligned with social welfare. In healthcare, Dranove et al. (2003) showed that publishing surgery mortality metrics actually harmed the welfare of sicker patients by increasing provider selection behavior. We analyze the incentive misalignments that arise from such average treated o… ▽ More

    Submitted 29 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  37. arXiv:2305.11381  [pdf, ps, other

    cs.GT cs.CY cs.IR cs.LG econ.TH

    Online Learning in a Creator Economy

    Authors: Banghua Zhu, Sai Praneeth Karimireddy, Jiantao Jiao, Michael I. Jordan

    Abstract: The creator economy has revolutionized the way individuals can profit through online platforms. In this paper, we initiate the study of online learning in the creator economy by modeling the creator economy as a three-party game between the users, platform, and content creators, with the platform interacting with the content creator under a principal-agent model through contracts to encourage bett… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  38. arXiv:2303.06317  [pdf, ps, other

    stat.ME

    Evaluating Sensitivity to the Stick-Breaking Prior in Bayesian Nonparametrics (Rejoinder)

    Authors: Ryan Giordano, Runjing Liu, Michael I. Jordan, Tamara Broderick

    Abstract: One can typically form a local robustness metric for a particular problem quite directly, for Markov chain Monte Carlo applications as well as optimization problems such as variational Bayes. However, we argue that simply forming a local robustness metric is not enough: the hard work is showing that it is useful. Computability, interpretability, and the ability of a local robustness metric to extr… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: Rejoinder for the discussion article "Evaluating Sensitivity to the Stick-Breaking Prior in Bayesian Nonparametrics'' in Bayesian Analysis

  39. arXiv:2303.04833  [pdf, other

    econ.GN cs.LG

    Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning

    Authors: Ruitu Xu, Yifei Min, Tianhao Wang, Zhaoran Wang, Michael I. Jordan, Zhuoran Yang

    Abstract: We study a heterogeneous agent macroeconomic model with an infinite number of households and firms competing in a labor market. Each household earns income and engages in consumption at each time step while aiming to maximize a concave utility subject to the underlying market conditions. The households aim to find the optimal saving strategy that maximizes their discounted cumulative utility given… ▽ More

    Submitted 24 February, 2023; originally announced March 2023.

    Comments: 44 pages

  40. arXiv:2302.10863  [pdf, other

    cs.LG

    A Unifying Perspective on Multi-Calibration: Game Dynamics for Multi-Objective Learning

    Authors: Nika Haghtalab, Michael I. Jordan, Eric Zhao

    Abstract: We provide a unifying framework for the design and analysis of multicalibrated predictors. By placing the multicalibration problem in the general setting of multi-objective learning -- where learning guarantees must hold simultaneously over a set of distributions and loss functions -- we exploit connections to game dynamics to achieve state-of-the-art guarantees for a diverse set of multicalibrati… ▽ More

    Submitted 19 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 45 pages. Authors are ordered alphabetically

  41. arXiv:2302.08300  [pdf, ps, other

    cs.LG math.OC

    Deterministic Nonsmooth Nonconvex Optimization

    Authors: Michael I. Jordan, Guy Kornowski, Tianyi Lin, Ohad Shamir, Manolis Zampetakis

    Abstract: We study the complexity of optimizing nonsmooth nonconvex Lipschitz functions by producing $(δ,ε)$-stationary points. Several recent works have presented randomized algorithms that produce such points using $\tilde O(δ^{-1}ε^{-3})$ first-order oracle calls, independent of the dimension $d$. It has been an open problem as to whether a similar result can be obtained via a deterministic algorithm. We… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: This work supersedes arxiv:2209.12463 and arxiv:2209.10346[Section 3], with major additional results

  42. arXiv:2302.00316  [pdf, other

    math.OC cs.LG eess.SP stat.ML

    Accelerated First-Order Optimization under Nonlinear Constraints

    Authors: Michael Muehlebach, Michael I. Jordan

    Abstract: We exploit analogies between first-order algorithms for constrained optimization and non-smooth dynamical systems to design a new class of accelerated first-order algorithms for constrained optimization. Unlike Frank-Wolfe or projected gradients, these algorithms avoid optimization over the entire feasible set at each iteration. We prove convergence to stationary points even in a nonconvex setting… ▽ More

    Submitted 2 January, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 44 pages, 6 figures

  43. arXiv:2301.11518  [pdf, ps, other

    cs.LG

    Online Learning in Stackelberg Games with an Omniscient Follower

    Authors: Geng Zhao, Banghua Zhu, Jiantao Jiao, Michael I. Jordan

    Abstract: We study the problem of online learning in a two-player decentralized cooperative Stackelberg game. In each round, the leader first takes an action, followed by the follower who takes their action after observing the leader's move. The goal of the leader is to learn to minimize the cumulative regret based on the history of interactions. Differing from the traditional formulation of repeated Stacke… ▽ More

    Submitted 11 April, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  44. arXiv:2301.11270  [pdf, other

    cs.LG cs.AI cs.HC math.ST stat.ML

    Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons

    Authors: Banghua Zhu, Jiantao Jiao, Michael I. Jordan

    Abstract: We provide a theoretical framework for Reinforcement Learning with Human Feedback (RLHF). Our analysis shows that when the true reward function is linear, the widely used maximum likelihood estimator (MLE) converges under both the Bradley-Terry-Luce (BTL) model and the Plackett-Luce (PL) model. However, we show that when training a policy based on the learned reward model, MLE fails while a pessim… ▽ More

    Submitted 7 February, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  45. arXiv:2301.09633  [pdf, other

    stat.ML cs.AI cs.LG q-bio.QM stat.ME

    Prediction-Powered Inference

    Authors: Anastasios N. Angelopoulos, Stephen Bates, Clara Fannjiang, Michael I. Jordan, Tijana Zrnic

    Abstract: Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system. The framework yields simple algorithms for computing provably valid confidence intervals for quantities such as means, quantiles, and linear and logistic regression coefficients, without making any assumptions on the ma… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Code is available at https://github.com/aangelopoulos/ppi_py

  46. arXiv:2211.15381  [pdf, other

    cs.IR cs.LG stat.ML

    Incentive-Aware Recommender Systems in Two-Sided Markets

    Authors: Xiaowu Dai, Wenlu Xu, Yuan Qi, Michael I. Jordan

    Abstract: Online platforms in the Internet Economy commonly incorporate recommender systems that recommend products (or "arms") to users (or "agents"). A key challenge in this domain arises from myopic agents who are naturally incentivized to exploit by choosing the optimal arm based on current information, rather than exploring various alternatives to gather information that benefits the collective. We pro… ▽ More

    Submitted 18 June, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

  47. arXiv:2211.05732  [pdf, other

    cs.GT cs.AI cs.LG econ.TH

    The Sample Complexity of Online Contract Design

    Authors: Banghua Zhu, Stephen Bates, Zhuoran Yang, Yixin Wang, Jiantao Jiao, Michael I. Jordan

    Abstract: We study the hidden-action principal-agent problem in an online setting. In each round, the principal posts a contract that specifies the payment to the agent based on each outcome. The agent then makes a strategic choice of action that maximizes her own utility, but the action is not directly observable by the principal. The principal observes the outcome and receives utility from the agent's cho… ▽ More

    Submitted 19 May, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

  48. arXiv:2210.17550  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization

    Authors: Chris Junchi Li, Angela Yuan, Gauthier Gidel, Quanquan Gu, Michael I. Jordan

    Abstract: We propose a new first-order optimization algorithm -- AcceleratedGradient-OptimisticGradient (AG-OG) Descent Ascent -- for separable convex-concave minimax optimization. The main idea of our algorithm is to carefully leverage the structure of the minimax problem, performing Nesterov acceleration on the individual component and optimistic gradient on the coupling component. Equipped with proper re… ▽ More

    Submitted 14 August, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 44 pages. This version matches the camera-ready that appeared at ICML 2023 under the same title

  49. arXiv:2210.15659  [pdf, other

    stat.ML cs.LG

    A Primal-dual Approach for Solving Variational Inequalities with General-form Constraints

    Authors: Tatjana Chavdarova, Matteo Pagliardini, Tong Yang, Michael I. Jordan

    Abstract: Yang et al. (2023) recently addressed the open problem of solving Variational Inequalities (VIs) with equality and inequality constraints through a first-order gradient method. However, the proposed primal-dual method called ACVI is applicable when we can compute analytic solutions of its subproblems; thus, the general case remains an open problem. In this paper, we adopt a warm-starting technique… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2206.10575

  50. arXiv:2210.12860  [pdf, ps, other

    math.OC cs.CC cs.LG

    Explicit Second-Order Min-Max Optimization Methods with Optimal Convergence Guarantee

    Authors: Tianyi Lin, Panayotis Mertikopoulos, Michael I. Jordan

    Abstract: We propose and analyze several inexact regularized Newton-type methods for finding a global saddle point of \emph{convex-concave} unconstrained min-max optimization problems. Compared to first-order methods, our understanding of second-order methods for min-max optimization is relatively limited, as obtaining global rates of convergence with second-order information is much more involved. In this… ▽ More

    Submitted 23 April, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Provide a simple subroutine with a detailed complexity analysis; 30 pages, 9 figures