1 code implementation • International Conference on Agents and Artificial Intelligence 2021 • Andreas Pentaliotis, Marco Wiering
The empirical results show that Variation-resistant Q-learning can control and utilize estimation bias for better performance in the experimental tasks.