Author: Lykouris, Thodoris : Search

research-article

Quantifying the cost of learning in queueing systems

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 285, Pages 6532–6544

Queueing systems are widely applicable stochastic models with use cases in communication networks, healthcare, service systems, etc. Although their optimal control has been extensively studied, most existing approaches assume perfect knowledge of the ...

research-article

Static Pricing for Multi-unit Prophet Inequalities

Operations Research (OPRH), Volume 72, Issue 4Pages 1388–1399https://doi.org/10.1287/opre.2023.0031

Characterizing the Efficiency of Static Pricing Schemes as a Function of the Supply

The problem of selling a supply of k units to a stream of customers constitutes one of the cornerstones in revenue management. Static pricing schemes (that output the same ...

We study a pricing problem where a seller has k identical copies of a product, buyers arrive sequentially, and the seller prices the items aiming to maximize social welfare. When k = 1, this is the so-called prophet inequality problem for which there is a ...

research-article

Efficient Decentralized Multi-agent Learning in Asymmetric Bipartite Queueing Systems

Operations Research (OPRH), Volume 72, Issue 3Pages 1049–1070https://doi.org/10.1287/opre.2022.0291

New Algorithm Enables Efficient Decentralized Learning in Bipartite Queueing Systems

Bipartite queueing systems, where agents with individual job queues request service from a pool of heterogeneous servers, are standard models for service applications like ...

We study decentralized multiagent learning in bipartite queueing systems, a standard model for service systems. In particular, N agents request service from K servers in a fully decentralized way, that is, by running the same algorithm without ...

extended-abstract

Public Access

Group fairness in dynamic refugee assignment

EC '23: Proceedings of the 24th ACM Conference on Economics and ComputationPage 701https://doi.org/10.1145/3580507.3597758

Ensuring that refugees and asylum seekers thrive (e.g., find employment) in their host countries is a profound humanitarian goal, and a primary driver of employment is the geographic location to which the refugee or asylum seeker is assigned. In the ...

research-article

Contextual Search in the Presence of Adversarial Corruptions

Operations Research (OPRH), Volume 71, Issue 4Pages 1120–1135https://doi.org/10.1287/opre.2022.2365

Contextual search is a generalization of binary search, which captures settings such as feature-based dynamic pricing. In this paradigm, a decision maker repeatedly interacts with a set of agents; in the pricing example, the decision maker first observes ...

We study contextual search, a generalization of binary search in higher dimensions, which captures settings such as feature-based dynamic pricing. Standard formulations of this problem assume that agents act in accordance with a specific homogeneous ...

research-article

Small-Loss Bounds for Online Learning with Partial Information

Mathematics of Operations Research (MOOR), Volume 47, Issue 3Pages 2186–2218https://doi.org/10.1287/moor.2021.1204

We consider the problem of adversarial (nonstochastic) online learning with partial-information feedback, in which, at each round, a decision maker selects an action from a finite set of alternatives. We develop a black-box approach for such problems in ...

extended-abstract

Public Access

Learning in Stackelberg Games with Non-myopic Agents

EC '22: Proceedings of the 23rd ACM Conference on Economics and ComputationPages 917–918https://doi.org/10.1145/3490486.3538308

Stackelberg games are a canonical model for strategic principal-agent interactions. Consider, for instance, a defense system that distributes its security resources across high-risk targets prior to attacks being executed; or a tax policymaker who sets ...

research-article

Pricing and Optimization in Shared Vehicle Systems: An Approximation Framework

Operations Research (OPRH), Volume 70, Issue 3Pages 1783–1805https://doi.org/10.1287/opre.2021.2165

The optimal management of shared vehicle systems, such as bike-, scooter-, car-, or ride-sharing, is more challenging compared with traditional resource allocation settings because of the presence of spatial externalities—changes in the demand/supply at ...

Optimizing shared vehicle systems (bike-/scooter-/car-/ride-sharing) are more challenging compared with traditional resource allocation settings because of the presence of complex network externalities—changes in the demand/supply at any location affect ...

research-article

Bayesian decision-making under misspecifed priors with applications to meta-learning

NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsArticle No.: 2020, Pages 26382–26394

Thompson sampling and other Bayesian sequential decision-making algorithms are among the most popular approaches to tackle explore/exploit trade-offs in (contextual) bandits. The choice of prior in these algorithms offers fexibility to encode domain ...

research-article

Open Access

Competitive Caching with Machine Learned Advice

Journal of the ACM (JACM), Volume 68, Issue 4Article No.: 24, Pages 1–25https://doi.org/10.1145/3447579

Traditional online algorithms encapsulate decision making under uncertainty, and give ways to hedge against all possible future events, while guaranteeing a nearly optimal solution, as compared to an offline optimum. On the other hand, machine learning ...

research-article

Open Access

Contextual search in the presence of irrational agents

STOC 2021: Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of ComputingPages 910–918https://doi.org/10.1145/3406325.3451120

We study contextual search, a generalization of binary search in higher dimensions, which captures settings such as feature-based dynamic pricing. Standard game-theoretic formulations of this problem assume that agents act in accordance with a specific ...

research-article

Free

Constrained episodic reinforcement learning in concave-convex and knapsack settings

NIPS '20: Proceedings of the 34th International Conference on Neural Information Processing SystemsArticle No.: 1369, Pages 16315–16326

We propose an algorithm for tabular episodic reinforcement learning (RL) with constraints. We provide a modular analysis with strong theoretical guarantees for two general settings. First is the convex-concave setting: maximization of a concave reward ...

research-article

Free

Bandits with adversarial scaling

ICML'20: Proceedings of the 37th International Conference on Machine LearningArticle No.: 604, Pages 6511–6521

We study adversarial scaling, a multi-armed bandit model where rewards have a stochastic and an adversarial component. Our model captures display advertising where the click-through-rate can be decomposed to a (fixed across time) armquality component and ...

Article

Free

On preserving non-discrimination when combining expert advice

NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing SystemsPages 8386–8397

We study the interplay between sequential decision making and avoiding discrimination against protected groups, when examples arrive online and do not follow distributional assumptions. We consider the most basic extension of classical online learning: ...

research-article

Public Access

Stochastic bandits robust to adversarial corruptions

STOC 2018: Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of ComputingPages 114–122https://doi.org/10.1145/3188745.3188918

We introduce a new model of stochastic bandits with adversarial corruptions which aims to capture settings where most of the input follows a stochastic pattern but some fraction of it can be adversarially changed to trick the algorithm, e.g., click ...

abstract

Public Access

Pricing and Optimization in Shared Vehicle Systems: An Approximation Framework

EC '17: Proceedings of the 2017 ACM Conference on Economics and ComputationPage 517https://doi.org/10.1145/3033274.3085099

Optimizing shared vehicle systems (bike-sharing/car-sharing/ride-sharing) is more challenging compared to traditional resource allocation settings due to the presence of complex network externalities. In particular, changes in the demand/supply at any ...

Article

Free

Learning in games: robustness of fast convergence

NIPS'16: Proceedings of the 30th International Conference on Neural Information Processing SystemsPages 4734–4742

We show that learning algorithms satisfying a low approximate regret property experience fast convergence to approximate optimality in a large class of repeated games. Our property, which simply requires that each learner has small regret compared to a (...

research-article

Learning and efficiency in games with dynamic population

SODA '16: Proceedings of the twenty-seventh annual ACM-SIAM symposium on Discrete algorithmsPages 120–129

We study the quality of outcomes in repeated games when the population of players is dynamically changing, and where participants use learning algorithms to adapt to the dynamic environment. Price of anarchy has originally been introduced to study the ...

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

Quantifying the cost of learning in queueing systems

Static Pricing for Multi-unit Prophet Inequalities

Efficient Decentralized Multi-agent Learning in Asymmetric Bipartite Queueing Systems

Group fairness in dynamic refugee assignment

Contextual Search in the Presence of Adversarial Corruptions

Small-Loss Bounds for Online Learning with Partial Information

Learning in Stackelberg Games with Non-myopic Agents

Pricing and Optimization in Shared Vehicle Systems: An Approximation Framework

Bayesian decision-making under misspecifed priors with applications to meta-learning

Competitive Caching with Machine Learned Advice

Contextual search in the presence of irrational agents

Constrained episodic reinforcement learning in concave-convex and knapsack settings

Bandits with adversarial scaling

On preserving non-discrimination when combining expert advice

Stochastic bandits robust to adversarial corruptions

Pricing and Optimization in Shared Vehicle Systems: An Approximation Framework

Learning in games: robustness of fast convergence

Learning and efficiency in games with dynamic population

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder