Paper tables with annotated results for Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

Paper

Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

In this paper, we study the collaborative learning model, which concerns the tradeoff between parallelism and communication overhead in multi-agent multi-armed bandits. For regret minimization in multi-armed bandits, we present the first set of tradeoffs between the number of rounds of communication among the agents and the regret of the collaborative learning process.

PDF Paper record

Results in Papers With Code

(↓ scroll down to see all results)

Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

Reader Guidelines

Editor Guidelines