no code implementations • 10 Mar 2021 • K. E. Avrachenkov, V. S. Borkar, H. P. Dolhare, K. Patil
We analyze the DQN reinforcement learning algorithm as a stochastic approximation scheme using the o. d. e.
reinforcement-learning Reinforcement Learning (RL)