no code implementations • 30 Jul 2021 • Ding Xiang, Becky West, Jiaqi Wang, Xiquan Cui, Jinzhou Huang
Second, we compare the accumulative rewards of the three MAB algorithms with more than 1, 000 trials using actual historical A/B test datasets.
Thompson Sampling