no code implementations • 1 Jun 2020 • Tao Bian, Zhong-Ping Jiang
In this paper, a new reinforcement learning (RL) method known as the method of temporal differential is introduced.
no code implementations • 19 May 2020 • Bo Pang, Tao Bian, Zhong-Ping Jiang
This paper studies the robustness of policy iteration in the context of continuous-time infinite-horizon linear quadratic regulation (LQR) problem.
Systems and Control Numerical Analysis Systems and Control Numerical Analysis Optimization and Control