• Login
    View Item 
    •   etd@IISc
    • Division of Electrical, Electronics, and Computer Science (EECS)
    • Electrical Communication Engineering (ECE)
    • View Item
    •   etd@IISc
    • Division of Electrical, Electronics, and Computer Science (EECS)
    • Electrical Communication Engineering (ECE)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Sequential Transfer in Multi-Armed Bandits using Reward Samples

    View/Open
    Thesis full text (2.589Mb)
    Author
    Rahul, N R
    Metadata
    Show full item record
    Abstract
    We consider a sequential multi-task problem, where each task is modeled as a stochastic multi-armed bandit with K arms. We study the problem of transfer learning in this setting and propose algorithms based on UCB to transfer reward samples from previous tasks to improve the total regret across all tasks. We consider two different notions of similarity among tasks, (i) universal similarity and (ii) adjacent similarity. In universal similarity, all tasks encountered in the sequence are similar. On the other hand, in adjacent similarity, tasks close to one another in the sequence are more similar than the ones that are farther apart. We provide transfer algorithms and their regret upper bounds for both the similarity notions and then highlight the benefit of transfer. Our regret bounds show that the performance improves as the sequential tasks become closer to each other. Finally, we provide empirical results for our algorithms, which show performance improvement over the standard UCB algorithm without transfer.
    URI
    https://etd.iisc.ac.in/handle/2005/6803
    Collections
    • Electrical Communication Engineering (ECE) [402]

    etd@IISc is a joint service of SERC & J R D Tata Memorial (JRDTML) Library || Powered by DSpace software || DuraSpace
    Contact Us | Send Feedback | Thesis Templates
    Theme by 
    Atmire NV
     

     

    Browse

    All of etd@IIScCommunities & CollectionsTitlesAuthorsAdvisorsSubjectsBy Thesis Submission DateThis CollectionTitlesAuthorsAdvisorsSubjectsBy Thesis Submission Date

    My Account

    LoginRegister

    etd@IISc is a joint service of SERC & J R D Tata Memorial (JRDTML) Library || Powered by DSpace software || DuraSpace
    Contact Us | Send Feedback | Thesis Templates
    Theme by 
    Atmire NV