Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
This paper studies the Best-of-K Bandit game: At each time the player chooses a subset S among all N-choose-K possible options and observes reward max(X(i) : i in S) where X is a random vector drawn from a joint distribution.
Mar 9, 2016
Best-of-K-bandits ... 29th Annual Conference on Learning Theory, PMLR 49:1440-1489, 2016. Abstract. This paper studies the Best-of-K Bandit game: ...
Download Citation | Best-of-K Bandits | This paper studies the Best-of-K Bandit game: At each time the player chooses a subset S among all N-choose-K ...
Top-k eXtreme contextual bandits with arm hierarchy. By Rajat Sen, Alexander Rakhlin, Lexing Ying, Rahul Kidambi, Dean Foster, Daniel N. Hill, Inderjit S.
May 28, 2019 · Title:Top-k Combinatorial Bandits with Full-Bandit Feedback ... Abstract:Top-k Combinatorial Bandits generalize multi-armed bandits, where at each ...
Missing: Best- | Show results with:Best-
On Top-k Selection in Multi-Armed Bandits and. Hidden Bipartite Graphs. Wei Cao1. Jian Li1. Yufei Tao2. Zhize Li1. 1Tsinghua University. 2Chinese University of ...
A/B Testing and Best-arm Identification for Linear Bandits with Robustness ... Bandit Settings with Applications in Robot-Assisted Feeding, Ethan K.
On Top-k Selection in Multi-Armed Bandits and. Hidden Bipartite Graphs. Wei Cao1. Jian Li1. Yufei Tao2. Zhize Li1. 1Tsinghua University. 2Chinese University of ...