1 code implementation • NeurIPS 2023 • Blake Bordelon, Paul Masset, Henry Kuo, Cengiz Pehlevan
We study how learning dynamics and plateaus depend on feature structure, learning rate, discount factor, and reward function.
no code implementations • 5 Aug 2019 • Yi-Hsiang Chang, Kuan-Yu Chang, Henry Kuo, Chun-Yi Lee
However, by using a proper macro action, defined as a sequence of primitive actions, an agent is able to bypass intermediate states to a farther state and facilitate its learning procedure.