Search Results for author: Longxiang He

Found 2 papers, 2 papers with code

AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization

1 code implementation28 May 2024 Longxiang He, Li Shen, Junbo Tan, Xueqian Wang

IDQL reinterprets IQL as an actor-critic method and gets weights of implicit policy, however, this weight only holds for the optimal value function.

D4RL Offline RL +2

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

1 code implementation9 Oct 2023 Longxiang He, Li Shen, Linrui Zhang, Junbo Tan, Xueqian Wang

Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).

D4RL Offline RL +1

Cannot find the paper you are looking for? You can Submit a new open access paper.