Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs
Abstract
References
Index Terms
- Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs
Recommendations
Prior-free and prior-dependent regret bounds for Thompson Sampling
NIPS'13: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 1We consider the stochastic multi-armed bandit problem with a prior distribution on the reward distributions. We are interested in studying prior-free and prior-dependent regret bounds, very much in the same spirit than the usual distribution-free and ...
Near-Optimal Regret Bounds for Thompson Sampling
Thompson Sampling (TS) is one of the oldest heuristics for multiarmed bandit problems. It is a randomized algorithm based on Bayesian ideas and has recently generated significant interest after several studies demonstrated that it has favorable ...
Finite-time regret of thompson sampling algorithms for exponential family multi-armed bandits
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsWe study the regret of Thompson sampling (TS) algorithms for exponential family bandits, where the reward distribution is from a one-dimensional exponential family, which covers many common reward distributions including Bernoulli, Gaussian, Gamma, ...
Comments
Information & Contributors
Information
Published In
Sponsors
- Association for the Advancement of Artificial Intelligence
Publisher
AAAI Press
Publication History
Qualifiers
- Research-article
- Research
- Refereed limited
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0