Search Results for author: Milan Vojnović

Found 3 papers, 1 papers with code

Combinatorial Bandits for Maximum Value Reward Function under Max Value-Index Feedback

1 code implementation25 May 2023 Yiliu Wang, Wei Chen, Milan Vojnović

We propose an algorithm and provide a regret bound for problem instances with stochastic arm outcomes according to arbitrary distributions with finite supports.

Doubly Adversarial Federated Bandits

no code implementations22 Jan 2023 Jialin Yi, Milan Vojnović

For the bandit feedback setting, we propose a near-optimal federated bandit algorithm called FEDEXP3.

Open-Ended Question Answering

On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits

no code implementations30 Nov 2022 Jialin Yi, Milan Vojnović

We show that with suitable regularizers and communication protocols, a collaborative multi-agent \emph{follow-the-regularized-leader} (FTRL) algorithm has an individual regret upper bound that matches the lower bound up to a constant factor when the number of arms is large enough relative to degrees of agents in the communication graph.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.