Search Results for author: Costas Tzafestas

Found 1 papers, 1 papers with code

Active exploration in parameterized reinforcement learning

1 code implementation6 Oct 2016 Mehdi Khamassi, Costas Tzafestas

We apply a meta-learning algorithm based on the comparison between variations of short-term and long-term reward running averages to simultaneously tune $\beta$ and the width of the Gaussian distribution from which continuous action parameters are drawn.

Meta-Learning Q-Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.