no code implementations • 9 Dec 2015 • Jonathan Rosenski, Ohad Shamir, Liran Szlak
We consider a variant of the stochastic multi-armed bandit problem, where multiple players simultaneously choose from the same set of arms and may collide, receiving no reward.