Search Results for author: Longxiang He

Found 2 papers, 2 papers with code

AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization

1 code implementation • 28 May 2024 • Longxiang He, Li Shen, Junbo Tan, Xueqian Wang

IDQL reinterprets IQL as an actor-critic method and gets weights of implicit policy, however, this weight only holds for the optimal value function.

D4RL Offline RL +2

Paper
Code

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

1 code implementation • 9 Oct 2023 • Longxiang He, Li Shen, Linrui Zhang, Junbo Tan, Xueqian Wang

Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).

D4RL Offline RL +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.