1 code implementation • 2 Aug 2021 • Parth K. Thaker, Mohit Malu, Nikhil Rao, Gautam Dasarathy
In this paper, we consider the pure exploration problem in stochastic multi-armed bandits where the similarities between the arms are captured by a graph and the rewards may be represented as a smooth signal on this graph.