no code implementations • 31 May 2019 • Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistub
We consider the stochastic multi-armed bandit problem and the contextual bandit problem with historical observations and pre-clustered arms.