Search Results for author: Hamish Flynn

Found 3 papers, 0 papers with code

Tighter Confidence Bounds for Sequential Kernel Regression

no code implementations • 19 Mar 2024 • Hamish Flynn, David Reeb

In this capacity, they can inform the exploration-exploitation trade-off and form a core component in many sequential learning and decision-making algorithms.

Decision Making regression

Paper
Add Code

PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison

no code implementations • 29 Nov 2022 • Hamish Flynn, David Reeb, Melih Kandemir, Jan Peters

On the one hand, we found that PAC-Bayes bounds are a useful tool for designing offline bandit algorithms with performance guarantees.

Decision Making

Paper
Add Code

PAC-Bayesian Lifelong Learning For Multi-Armed Bandits

no code implementations • 7 Mar 2022 • Hamish Flynn, David Reeb, Melih Kandemir, Jan Peters

We present a PAC-Bayesian analysis of lifelong learning.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.