Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 71 results for author: Farina, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13116  [pdf, ps, other

    cs.GT

    A Lower Bound on Swap Regret in Extensive-Form Games

    Authors: Constantinos Daskalakis, Gabriele Farina, Noah Golowich, Tuomas Sandholm, Brian Hu Zhang

    Abstract: Recent simultaneous works by Peng and Rubinstein [2024] and Dagan et al. [2024] have demonstrated the existence of a no-swap-regret learning algorithm that can reach $ε$ average swap regret against an adversary in any extensive-form game within $m^{\tilde{\mathcal O}(1/ε)}$ rounds, where $m$ is the number of nodes in the game tree. However, the question of whether a $\mathrm{poly}(m, 1/ε)$-round a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.10631  [pdf, other

    cs.GT cs.LG math.OC

    Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms

    Authors: Yang Cai, Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Weiqiang Zheng

    Abstract: Self-play via online learning is one of the premier ways to solve large-scale two-player zero-sum games, both in theory and practice. Particularly popular algorithms include optimistic multiplicative weights update (OMWU) and optimistic gradient-descent-ascent (OGDA). While both algorithms enjoy $O(1/T)$ ergodic convergence to Nash equilibrium in two-player zero-sum games, OMWU offers several adva… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 27 pages, 4 figures

  3. arXiv:2402.16316  [pdf, ps, other

    cs.GT

    Polynomial-Time Computation of Exact $Φ$-Equilibria in Polyhedral Games

    Authors: Gabriele Farina, Charilaos Pipis

    Abstract: It is a well-known fact that correlated equilibria can be computed in polynomial time in a large class of concisely represented games using the celebrated Ellipsoid Against Hope algorithm (Papadimitriou and Roughgarden, 2008; Jiang and Leyton-Brown, 2015). However, the landscape of efficiently computable equilibria in sequential (extensive-form) games remains unknown. The Ellipsoid Against Hope do… ▽ More

    Submitted 8 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  4. arXiv:2402.09670  [pdf, ps, other

    cs.GT

    Efficient $Φ$-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

    Authors: Brian Hu Zhang, Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

    Abstract: Recent breakthrough results by Dagan, Daskalakis, Fishelson and Golowich [2023] and Peng and Rubinstein [2023] established an efficient algorithm attaining at most $ε$ swap regret over extensive-form strategy spaces of dimension $N$ in $N^{\tilde O(1/ε)}$ rounds. On the other extreme, Farina and Pipis [2023] developed an efficient algorithm for minimizing the weaker notion of linear-swap regret in… ▽ More

    Submitted 17 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2312.12067  [pdf, other

    cs.GT cs.LG

    Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property

    Authors: Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

    Abstract: Policy gradient methods enjoy strong practical performance in numerous tasks in reinforcement learning. Their theoretical understanding in multiagent settings, however, remains limited, especially beyond two-player competitive and potential Markov games. In this paper, we develop a new framework to characterize optimistic policy gradient methods in multi-player Markov games with a single controlle… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: To appear at AAAI 2024

  6. arXiv:2312.03696  [pdf, other

    cs.GT cs.MA math.OC

    Efficient Learning in Polyhedral Games via Best Response Oracles

    Authors: Darshan Chakrabarti, Gabriele Farina, Christian Kroer

    Abstract: We study online learning and equilibrium computation in games with polyhedral decision sets, a property shared by both normal-form games and extensive-form games (EFGs), when the learning agent is restricted to using a best-response oracle. We show how to achieve constant regret in zero-sum games and $O(T^{1/4})$ regret in general-sum games while using only $O(\log t)$ best-response queries at a g… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  7. arXiv:2311.09712  [pdf, other

    cs.CL

    Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning

    Authors: Athul Paul Jacob, Gabriele Farina, Jacob Andreas

    Abstract: We present a model of pragmatic language understanding, where utterances are produced and understood by searching for regularized equilibria of signaling games. In this model (which we call ReCo, for Regularized Conventions), speakers and listeners search for contextually appropriate utterance--meaning mappings that are both close to game-theoretically optimal conventions and close to a shared, ''… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  8. arXiv:2311.05918  [pdf, other

    cs.DC

    Reliable Broadcast despite Mobile Byzantine Faults

    Authors: Silvia Bonomi, Giovanni Farina, Sébastien Tixeuil

    Abstract: We investigate the solvability of the Byzantine Reliable Broadcast and Byzantine Broadcast Channel problems in distributed systems affected by Mobile Byzantine Faults. We show that both problems are not solvable even in one of the most constrained system models for mobile Byzantine faults defined so far. By endowing processes with an additional local failure oracle, we provide a solution to the By… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  9. arXiv:2311.00676  [pdf, other

    cs.GT cs.LG

    Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

    Authors: Yang Cai, Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Weiqiang Zheng

    Abstract: Algorithms based on regret matching, specifically regret matching$^+$ (RM$^+$), and its variants are the most popular approaches for solving large-scale two-player zero-sum games in practice. Unlike algorithms such as optimistic gradient descent ascent, which have strong last-iterate and ergodic convergence properties for zero-sum games, virtually nothing is known about the last-iterate properties… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  10. arXiv:2310.15935  [pdf, other

    cs.GT

    Mediator Interpretation and Faster Learning Algorithms for Linear Correlated Equilibria in General Extensive-Form Games

    Authors: Brian Hu Zhang, Gabriele Farina, Tuomas Sandholm

    Abstract: A recent paper by Farina & Pipis (2023) established the existence of uncoupled no-linear-swap regret dynamics with polynomial-time iterations in extensive-form games. The equilibrium points reached by these dynamics, known as linear correlated equilibria, are currently the tightest known relaxation of correlated equilibrium that can be learned in polynomial time in any finite extensive-form game.… ▽ More

    Submitted 15 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  11. arXiv:2310.09139  [pdf, other

    cs.GT cs.AI cs.CL cs.LG

    The Consensus Game: Language Model Generation via Equilibrium Search

    Authors: Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas

    Abstract: When applied to question answering and other text generation tasks, language models (LMs) may be queried generatively (by sampling answers from their output distribution) or discriminatively (by using them to score or rank a set of candidate outputs). These procedures sometimes yield very different predictions. How do we reconcile mutually incompatible scoring procedures to obtain coherent LM pred… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  12. arXiv:2308.16017  [pdf, ps, other

    cs.GT

    Hidden-Role Games: Equilibrium Concepts and Computation

    Authors: Luca Carminati, Brian Hu Zhang, Gabriele Farina, Nicola Gatti, Tuomas Sandholm

    Abstract: In this paper, we study the class of games known as hidden-role games in which players are assigned privately to teams and are faced with the challenge of recognizing and cooperating with teammates. This model includes both popular recreational games such as the Mafia/Werewolf family and The Resistance (Avalon) and many real-world settings, such as distributed systems where nodes need to work toge… ▽ More

    Submitted 17 February, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

  13. arXiv:2307.05448  [pdf, other

    cs.GT

    Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games

    Authors: Gabriele Farina, Charilaos Pipis

    Abstract: No-regret learners seek to minimize the difference between the loss they cumulated through the actions they played, and the loss they would have cumulated in hindsight had they consistently modified their behavior according to some strategy transformation function. The size of the set of transformations considered by the learner determines a natural notion of rationality. As the set of transformat… ▽ More

    Submitted 7 November, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted for publication at NeurIPS 2023

  14. arXiv:2306.05221  [pdf, other

    cs.GT

    Steering No-Regret Learners to a Desired Equilibrium

    Authors: Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

    Abstract: A mediator observes no-regret learners playing an extensive-form game repeatedly across $T$ rounds. The mediator attempts to steer players toward some desirable predetermined equilibrium by giving (nonnegative) payments to players. We call this the steering problem. The steering problem captures problems several problems of interest, among them equilibrium selection and information design (persuas… ▽ More

    Submitted 17 February, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  15. arXiv:2306.05216  [pdf, ps, other

    cs.GT

    Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games

    Authors: Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

    Abstract: We introduce a new approach for computing optimal equilibria via learning in games. It applies to extensive-form settings with any number of players, including mechanism design, information design, and solution concepts such as correlated, communication, and certification equilibria. We observe that optimal equilibria are minimax equilibrium strategies of a player in an extensive-form zero-sum gam… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  16. arXiv:2305.14709  [pdf, ps, other

    cs.GT cs.LG

    Regret Matching+: (In)Stability and Fast Convergence in Games

    Authors: Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo

    Abstract: Regret Matching+ (RM+) and its variants are important algorithms for solving large-scale games. However, a theoretical understanding of their success in practice is still a mystery. Moreover, recent advances on fast convergence in games are limited to no-regret algorithms such as online mirror descent, which satisfy stability. In this paper, we first give counterexamples showing that RM+ and its p… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  17. arXiv:2304.13138  [pdf, other

    cs.AI cs.LG

    The Update-Equivalence Framework for Decision-Time Planning

    Authors: Samuel Sokota, Gabriele Farina, David J. Wu, Hengyuan Hu, Kevin A. Wang, J. Zico Kolter, Noam Brown

    Abstract: The process of revising (or constructing) a policy at execution time -- known as decision-time planning -- has been key to achieving superhuman performance in perfect-information games like chess and Go. A recent line of work has extended decision-time planning to imperfect-information games, leading to superhuman performance in poker. However, these methods involve solving subgames whose sizes gr… ▽ More

    Submitted 13 May, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

  18. arXiv:2303.12817  [pdf, other

    cs.CR cs.OS

    IRIS: a Record and Replay Framework to Enable Hardware-assisted Virtualization Fuzzing

    Authors: Carmine Cesarano, Marcello Cinque, Domenico Cotroneo, Luigi De Simone, Giorgio Farina

    Abstract: Nowadays, industries are looking into virtualization as an effective means to build safe applications, thanks to the isolation it can provide among virtual machines (VMs) running on the same hardware. In this context, a fundamental issue is understanding to what extent the isolation is guaranteed, despite possible (or induced) problems in the virtualization mechanisms. Uncovering such isolation is… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 13 pages, Accepted for publication at The 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)

  19. arXiv:2301.11241  [pdf, other

    cs.LG cs.GT

    On the Convergence of No-Regret Learning Dynamics in Time-Varying Games

    Authors: Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

    Abstract: Most of the literature on learning in games has focused on the restrictive setting where the underlying repeated game does not change over time. Much less is known about the convergence of no-regret learning algorithms in dynamic multiagent settings. In this paper, we characterize the convergence of optimistic gradient descent (OGD) in time-varying games. Our framework yields sharp convergence bou… ▽ More

    Submitted 18 October, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: To appear at NeurIPS 2023; V3 incorporates reviewers' feedback and minor corrections

  20. arXiv:2210.05492  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

    Authors: Anton Bakhtin, David J Wu, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H Miller, Noam Brown

    Abstract: No-press Diplomacy is a complex strategy game involving both cooperation and competition that has served as a benchmark for multi-agent AI research. While self-play reinforcement learning has resulted in numerous successes in purely adversarial games like chess, Go, and poker, self-play alone is insufficient for achieving optimal performance in domains involving cooperation with humans. We address… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  21. arXiv:2209.14110  [pdf, other

    cs.GT

    Meta-Learning in Games

    Authors: Keegan Harris, Ioannis Anagnostides, Gabriele Farina, Mikhail Khodak, Zhiwei Steven Wu, Tuomas Sandholm

    Abstract: In the literature on game-theoretic equilibrium finding, focus has mainly been on solving a single game in isolation. In practice, however, strategic interactions -- ranging from routing problems to online advertising auctions -- evolve dynamically, thereby leading to many similar games to be solved. To address this gap, we introduce meta-learning for equilibrium finding and learning to play games… ▽ More

    Submitted 1 March, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: In the eleventh Conference on Learning Representations (ICLR 2023)

  22. arXiv:2209.01843  [pdf, other

    cs.DC cs.OS

    RunPHI: Enabling Mixed-criticality Containers via Partitioning Hypervisors in Industry 4.0

    Authors: Marco Barletta, Marcello Cinque, Luigi De Simone, Raffaele Della Corte, Giorgio Farina, Daniele Ottaviano

    Abstract: Orchestration systems are becoming a key component to automatically manage distributed computing resources in many fields with criticality requirements like Industry 4.0 (I4.0). However, they are mainly linked to OS-level virtualization, which is known to suffer from reduced isolation. In this paper, we propose RunPHI with the aim of integrating partitioning hypervisors, as a solution for assuring… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: 2 pages, accepted for publication in Proc. ISSREW, 2022

  23. arXiv:2208.14891  [pdf, ps, other

    cs.GT

    Clairvoyant Regret Minimization: Equivalence with Nemirovski's Conceptual Prox Method and Extension to General Convex Games

    Authors: Gabriele Farina, Christian Kroer, Chung-Wei Lee, Haipeng Luo

    Abstract: A recent paper by Piliouras et al. [2021, 2022] introduces an uncoupled learning algorithm for normal-form games -- called Clairvoyant MWU (CMWU). In this note we show that CMWU is equivalent to the conceptual prox method described by Nemirovski [2004]. This connection immediately shows that it is possible to extend the CMWU algorithm to any convex game, a question left open by Piliouras et al. We… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

  24. arXiv:2208.09747  [pdf, ps, other

    cs.GT cs.LG

    Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

    Authors: Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

    Abstract: In this paper, we establish efficient and uncoupled learning dynamics so that, when employed by all players in multiplayer perfect-recall imperfect-information extensive-form games, the trigger regret of each player grows as $O(\log T)$ after $T$ repetitions of play. This improves exponentially over the prior best known trigger-regret bound of $O(T^{1/4})$, and settles a recent open question by Ba… ▽ More

    Submitted 19 September, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

    Comments: Appearing at ICML 2023. V3 corrects a statement

  25. arXiv:2206.14637  [pdf, other

    cs.DC cs.AR

    Assessing Intel's Memory Bandwidth Allocation for resource limitation in real-time systems

    Authors: Giorgio Farina, Gautam Gala, Marcello Cinque, Gerhard Fohler

    Abstract: Industries are recently considering the adoption of cloud computing for hosting safety critical applications. However, the use of multicore processors usually adopted in the cloud introduces temporal anomalies due to contention for shared resources, such as the memory subsystem. In this paper we explore the potential of Intel's Memory Bandwidth Allocation (MBA) technology, available on Xeon Scalab… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 8 pages, to appear in proceedings of "The 25th International Symposium On Real-Time Distributed Computing ISORC"

  26. arXiv:2206.08742  [pdf, other

    cs.GT cs.LG

    Near-Optimal No-Regret Learning Dynamics for General Convex Games

    Authors: Gabriele Farina, Ioannis Anagnostides, Haipeng Luo, Chung-Wei Lee, Christian Kroer, Tuomas Sandholm

    Abstract: A recent line of work has established uncoupled learning dynamics such that, when employed by all players in a game, each player's \emph{regret} after $T$ repetitions grows polylogarithmically in $T$, an exponential improvement over the traditional guarantees within the no-regret framework. However, so far these results have only been limited to certain classes of games with structured strategy sp… ▽ More

    Submitted 16 October, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: To appear at NeurIPS 2022. V2 incorporates reviewers' feedback

  27. arXiv:2206.04122  [pdf, other

    cs.GT cs.AI cs.LG stat.ML

    ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret

    Authors: Stephen McAleer, Gabriele Farina, Marc Lanctot, Tuomas Sandholm

    Abstract: Recent techniques for approximating Nash equilibria in very large games leverage neural networks to learn approximately optimal policies (strategies). One promising line of research uses neural networks to approximate counterfactual regret minimization (CFR) or its modern variants. DREAM, the only current CFR-based neural method that is model free and therefore scalable to very large games, trains… ▽ More

    Submitted 11 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  28. arXiv:2204.11417  [pdf, other

    cs.GT cs.LG

    Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games

    Authors: Ioannis Anagnostides, Gabriele Farina, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Tuomas Sandholm

    Abstract: In this paper we establish efficient and \emph{uncoupled} learning dynamics so that, when employed by all players in a general-sum multiplayer game, the \emph{swap regret} of each player after $T$ repetitions of the game is bounded by $O(\log T)$, improving over the prior best bounds of $O(\log^4 (T))$. At the same time, we guarantee optimal $O(\sqrt{T})$ swap regret in the adversarial regime as w… ▽ More

    Submitted 5 October, 2022; v1 submitted 24 April, 2022; originally announced April 2022.

    Comments: To appear at NeurIPS 2022. V2 incorporates reviewers' feedback and minor corrections

  29. arXiv:2203.12074  [pdf, other

    cs.GT

    Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games

    Authors: Ioannis Anagnostides, Gabriele Farina, Ioannis Panageas, Tuomas Sandholm

    Abstract: We show that, for any sufficiently small fixed $ε> 0$, when both players in a general-sum two-player (bimatrix) game employ optimistic mirror descent (OMD) with smooth regularization, learning rate $η= O(ε^2)$ and $T = Ω(\text{poly}(1/ε))$ repetitions, either the dynamics reach an $ε$-approximate Nash equilibrium (NE), or the average correlated distribution of play is an $Ω(\text{poly}(ε))$-strong… ▽ More

    Submitted 6 October, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: To appear at NeurIPS 2022. V2 incorporates reviewers' feedback

  30. arXiv:2203.12056  [pdf, other

    cs.GT

    On Last-Iterate Convergence Beyond Zero-Sum Games

    Authors: Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

    Abstract: Most existing results about \emph{last-iterate convergence} of learning dynamics are limited to two-player zero-sum games, and only apply under rigid assumptions about what dynamics the players follow. In this paper we provide new results and techniques that apply to broader families of games and learning dynamics. First, we use a regret-based analysis to show that in a class of games that include… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  31. arXiv:2203.07181  [pdf, other

    cs.GT cs.AI cs.LG

    Optimal Correlated Equilibria in General-Sum Extensive-Form Games: Fixed-Parameter Algorithms, Hardness, and Two-Sided Column-Generation

    Authors: Brian Zhang, Gabriele Farina, Andrea Celli, Tuomas Sandholm

    Abstract: We study the problem of finding optimal correlated equilibria of various sorts: normal-form coarse correlated equilibrium (NFCCE), extensive-form coarse correlated equilibrium (EFCCE), and extensive-form correlated equilibrium (EFCE). This is NP-hard in the general case and has been studied in special cases, most notably triangle-free games, which include all two-player games with public chance mo… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  32. arXiv:2202.05446  [pdf, other

    cs.GT

    Faster No-Regret Learning Dynamics for Extensive-Form Correlated and Coarse Correlated Equilibria

    Authors: Ioannis Anagnostides, Gabriele Farina, Christian Kroer, Andrea Celli, Tuomas Sandholm

    Abstract: A recent emerging trend in the literature on learning in games has been concerned with providing faster learning dynamics for correlated and coarse correlated equilibria in normal-form games. Much less is known about the significantly more challenging setting of extensive-form games, which can capture both sequential and simultaneous moves, as well as imperfect information. In this paper we establ… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Preliminary parts of this paper will appear at the AAAI-22 Workshop on Reinforcement Learning in Games. This version also contains results from an earlier preprint published by a subset of the authors (arXiv:2109.08138)

  33. arXiv:2202.00789  [pdf, other

    cs.GT

    Team Belief DAG: Generalizing the Sequence Form to Team Games for Fast Computation of Correlated Team Max-Min Equilibria via Regret Minimization

    Authors: Brian Hu Zhang, Gabriele Farina, Tuomas Sandholm

    Abstract: A classic result in the theory of extensive-form games asserts that the set of strategies available to any perfect-recall player is strategically equivalent to a low-dimensional convex polytope, called the sequence-form polytope. Online convex optimization tools operating on this polytope are the current state-of-the-art for computing several notions of equilibria in games, and have been crucial i… ▽ More

    Submitted 17 February, 2024; v1 submitted 1 February, 2022; originally announced February 2022.

  34. arXiv:2202.00237  [pdf, other

    cs.GT cs.LG

    Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games

    Authors: Gabriele Farina, Chung-Wei Lee, Haipeng Luo, Christian Kroer

    Abstract: While extensive-form games (EFGs) can be converted into normal-form games (NFGs), doing so comes at the cost of an exponential blowup of the strategy space. So, progress on NFGs and EFGs has historically followed separate tracks, with the EFG community often having to catch up with advances (e.g., last-iterate convergence and predictive regret bounds) from the larger NFG community. In this paper w… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  35. arXiv:2112.07544  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Modeling Strong and Human-Like Gameplay with KL-Regularized Search

    Authors: Athul Paul Jacob, David J. Wu, Gabriele Farina, Adam Lerer, Hengyuan Hu, Anton Bakhtin, Jacob Andreas, Noam Brown

    Abstract: We consider the task of building strong but human-like policies in multi-agent decision-making problems, given examples of human behavior. Imitation learning is effective at predicting human actions but may not match the strength of expert humans, while self-play learning and search techniques (e.g. AlphaZero) lead to strong performance but may produce policies that are difficult for humans to und… ▽ More

    Submitted 16 February, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

  36. arXiv:2112.03804  [pdf, other

    cs.GT

    Fast Payoff Matrix Sparsification Techniques for Structured Extensive-Form Games

    Authors: Gabriele Farina, Tuomas Sandholm

    Abstract: The practical scalability of many optimization algorithms for large extensive-form games is often limited by the games' huge payoff matrices. To ameliorate the issue, Zhang and Sandholm (2020) recently proposed a sparsification technique that factorizes the payoff matrix $\mathbf{A}$ into a sparser object $\mathbf{A} = \hat{\mathbf{A}} + \mathbf{U}\mathbf{V}^\top$, where the total combined number… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: To appear at AAAI'22

  37. Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-Player General-Sum Games

    Authors: Ioannis Anagnostides, Constantinos Daskalakis, Gabriele Farina, Maxwell Fishelson, Noah Golowich, Tuomas Sandholm

    Abstract: Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a multi-player general-sum normal-form game employ Optimistic Multiplicative Weights Update (OMWU), the external regret of every player is $O(\textrm{polylog}(T))$ after $T$ repetitions of the game. We extend their result from external regret to internal regret and swap regret, thereby establishing uncoup… ▽ More

    Submitted 24 January, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: Appeared at STOC 2022

  38. arXiv:2109.08138  [pdf, other

    cs.GT

    Efficient Decentralized Learning Dynamics for Extensive-Form Coarse Correlated Equilibrium: No Expensive Computation of Stationary Distributions Required

    Authors: Gabriele Farina, Andrea Celli, Tuomas Sandholm

    Abstract: While in two-player zero-sum games the Nash equilibrium is a well-established prescriptive notion of optimal play, its applicability as a prescriptive tool beyond that setting is limited. Consequently, the study of decentralized learning dynamics that guarantee convergence to correlated solution concepts in multiplayer, general-sum extensive-form (i.e., tree-form) games has become an important top… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  39. arXiv:2105.12954  [pdf, other

    cs.GT cs.AI

    Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria

    Authors: Gabriele Farina, Christian Kroer, Tuomas Sandholm

    Abstract: We study the application of iterative first-order methods to the problem of computing equilibria of large-scale two-player extensive-form games. First-order methods must typically be instantiated with a regularizer that serves as a distance-generating function for the decision sets of the players. For the case of two-player zero-sum games, the state-of-the-art theoretical convergence rate for Nash… ▽ More

    Submitted 12 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Extended version of the EC21 conference version

  40. arXiv:2104.03673  [pdf, other

    cs.DC cs.DS cs.NI

    Practical Byzantine Reliable Broadcast on Partially Connected Networks (Extended version)

    Authors: Silvia Bonomi, Jérémie Decouchant, Giovanni Farina, Vincent Rahli, Sébastien Tixeuil

    Abstract: In this paper, we consider the Byzantine reliable broadcast problem on authenticated and partially connected networks. The state-of-the-art method to solve this problem consists in combining two algorithms from the literature. Handling asynchrony and faulty senders is typically done thanks to Gabriel Bracha's authenticated double-echo broadcast protocol, which assumes an asynchronous fully connect… ▽ More

    Submitted 26 February, 2024; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: This is an extended version of a paper that appeared at the IEEE ICDCS 2021 conference

  41. arXiv:2104.01520  [pdf, ps, other

    cs.GT cs.LG cs.MA

    Simple Uncoupled No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

    Authors: Gabriele Farina, Andrea Celli, Alberto Marchesi, Nicola Gatti

    Abstract: The existence of simple uncoupled no-regret learning dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated… ▽ More

    Submitted 27 May, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Extended version of our NeurIPS 2020 paper. Compared to the conference version, this preprint gives finer, in-high-probability regret bounds. We also better connected our work to the phi-regret minimization framework

  42. arXiv:2103.04546  [pdf, other

    cs.GT cs.LG

    Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games

    Authors: Gabriele Farina, Robin Schmucker, Tuomas Sandholm

    Abstract: Tree-form sequential decision making (TFSDM) extends classical one-shot decision making by modeling tree-form interactions between an agent and a potentially adversarial environment. It captures the online decision-making problems that each player faces in an extensive-form game, as well as Markov decision processes and partially-observable Markov decision processes where the agent conditions on o… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Full version. The body of the paper appeared in the proceedings of the AAAI 2021 conference

  43. arXiv:2103.04539  [pdf, other

    cs.GT cs.LG

    Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games

    Authors: Gabriele Farina, Tuomas Sandholm

    Abstract: Regret minimization has proved to be a versatile tool for tree-form sequential decision making and extensive-form games. In large two-player zero-sum imperfect-information games, modern extensions of counterfactual regret minimization (CFR) are currently the practical state of the art for computing a Nash equilibrium. Most regret-minimization algorithms for tree-form sequential decision making, in… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: Full version. The body of the paper appeared in the proceedings of the AAAI 2021 conference

  44. arXiv:2009.10061  [pdf, ps, other

    cs.GT cs.AI cs.LG cs.MA

    Faster Algorithms for Optimal Ex-Ante Coordinated Collusive Strategies in Extensive-Form Zero-Sum Games

    Authors: Gabriele Farina, Andrea Celli, Nicola Gatti, Tuomas Sandholm

    Abstract: We focus on the problem of finding an optimal strategy for a team of two players that faces an opponent in an imperfect-information zero-sum extensive-form game. Team members are not allowed to communicate during play but can coordinate before the game. In that setting, it is known that the best the team can do is sample a profile of potentially randomized strategies (one per player) from a joint… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  45. arXiv:2009.04336  [pdf, other

    cs.GT cs.AI

    Polynomial-Time Computation of Optimal Correlated Equilibria in Two-Player Extensive-Form Games with Public Chance Moves and Beyond

    Authors: Gabriele Farina, Tuomas Sandholm

    Abstract: Unlike normal-form games, where correlated equilibria have been studied for more than 45 years, extensive-form correlation is still generally not well understood. Part of the reason for this gap is that the sequential nature of extensive-form games allows for a richness of behaviors and incentives that are not possible in normal-form settings. This richness translates to a significantly different… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

  46. arXiv:2007.14358  [pdf, other

    cs.GT cs.AI cs.LG cs.MA math.OC

    Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent

    Authors: Gabriele Farina, Christian Kroer, Tuomas Sandholm

    Abstract: Blackwell approachability is a framework for reasoning about repeated games with vector-valued payoffs. We introduce predictive Blackwell approachability, where an estimate of the next payoff vector is given, and the decision maker tries to achieve better performance based on the accuracy of that estimator. In order to derive algorithms that achieve predictive Blackwell approachability, we start b… ▽ More

    Submitted 7 March, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

    Comments: Full version. The body of the paper appeared in the proceedings of the AAAI 2021 conference

  47. arXiv:2007.12987  [pdf, other

    cs.PL cs.LO

    Coupled Relational Symbolic Execution for Differential Privacy

    Authors: Gian Pietro Farina, Stephen Chong, Marco Gaboardi

    Abstract: Differential privacy is a de facto standard in data privacy with applications in the private and public sectors. Most of the techniques that achieve differential privacy are based on a judicious use of randomness. However, reasoning about randomized programs is difficult and error prone. For this reason, several techniques have been recently proposed to support designer in proving programs differe… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

  48. arXiv:2004.00603  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

    Authors: Andrea Celli, Alberto Marchesi, Gabriele Farina, Nicola Gatti

    Abstract: The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilib… ▽ More

    Submitted 2 September, 2022; v1 submitted 1 April, 2020; originally announced April 2020.

  49. arXiv:2002.08493  [pdf, other

    cs.GT cs.AI cs.LG

    Stochastic Regret Minimization in Extensive-Form Games

    Authors: Gabriele Farina, Christian Kroer, Tuomas Sandholm

    Abstract: Monte-Carlo counterfactual regret minimization (MCCFR) is the state-of-the-art algorithm for solving sequential games that are too large for full tree traversals. It works by using gradient estimates that can be computed via sampling. However, stochastic methods for sequential games have not been investigated extensively beyond MCCFR. In this paper we develop a new framework for developing stochas… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

  50. arXiv:1910.12450  [pdf, other

    cs.GT cs.AI cs.LG math.OC

    Efficient Regret Minimization Algorithm for Extensive-Form Correlated Equilibrium

    Authors: Gabriele Farina, Chun Kai Ling, Fei Fang, Tuomas Sandholm

    Abstract: Self-play methods based on regret minimization have become the state of the art for computing Nash equilibria in large two-players zero-sum extensive-form games. These methods fundamentally rely on the hierarchical structure of the players' sequential strategy spaces to construct a regret minimizer that recursively minimizes regret at each decision point in the game tree. In this paper, we introdu… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: Full version of NeurIPS 2019 paper