1 code implementation • 6 May 2024 • Stone Tao, Arth Shukla, Tse-kai Chan, Hao Su
A forward curriculum is then used to accelerate the training of the initial policy to perform well on the full initial state distribution of the task and improve demonstration and sample efficiency.