1 code implementation • 9 Oct 2023 • Longxiang He, Li Shen, Linrui Zhang, Junbo Tan, Xueqian Wang
Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).