no code implementations • 2 May 2024 • Bingshan Hu, Zhiming Huang, Tianyue H. Zhang, Mathias Lécuyer, Nidhi Hegde
We study Thompson Sampling-based algorithms for stochastic bandits with bounded rewards.
no code implementations • 16 Feb 2021 • Bingshan Hu, Zhiming Huang, Nishant A. Mehta, Nidhi Hegde
In this paper, we study differentially private online learning problems in a stochastic environment under both bandit and full information feedback.
no code implementations • 14 May 2020 • Zhiming Huang, Yifan Xu, Bingshan Hu, QiPeng Wang, Jianping Pan
We study the combinatorial sleeping multi-armed semi-bandit problem with long-term fairness constraints~(CSMAB-F).