Browsing by Subject "UCB algorithm"
Now showing items 1-1 of 1
-
Sequential Transfer in Multi-Armed Bandits using Reward Samples
We consider a sequential multi-task problem, where each task is modeled as a stochastic multi-armed bandit with K arms. We study the problem of transfer learning in this setting and propose algorithms based on UCB to ...