Bandits with Knapsacks and predictions
Abstract
References
Index Terms
- Bandits with Knapsacks and predictions
Recommendations
Adversarial Bandits with Knapsacks
We consider Bandits with Knapsacks (henceforth, BwK), a general model for multi-armed bandits under supply/budget constraints. In particular, a bandit algorithm needs to solve a well-known knapsack problem: find an optimal packing of items into a limited-...
Bandits with concave rewards and convex knapsacks
EC '14: Proceedings of the fifteenth ACM conference on Economics and computationIn this paper, we consider a very general model for exploration-exploitation tradeoff which allows arbitrary concave rewards and convex constraints on the decisions across time, in addition to the customary limitation on the time horizon. This model ...
Bandits with knapsacks beyond the worst case
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsBandits with Knapsacks (BwK) is a general model for multi-armed bandits under supply/budget constraints. While worst-case regret bounds for BwK are well-understood, we present three results that go beyond the worst-case perspective. First, we provide ...
Comments
Information & Contributors
Information
Published In
Sponsors
- HUAWEI
- DEShaw&Co
- Barcelona School of Economics
- Universitat Pompeu Fabra
Publisher
JMLR.org
Publication History
Qualifiers
- Research-article
- Research
- Refereed limited
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0