no code implementations • 20 Nov 2023 • Yizhao Jin, Greg Slabaugh, Simon Lucas
Deep Reinforcement Learning (DRL) agents frequently face challenges in adapting to tasks outside their training distribution, including issues with over-fitting, catastrophic forgetting and sample inefficiency.
1 code implementation • 26 Jan 2023 • Xiulei Song, Yizhao Jin, Greg Slabaugh, Simon Lucas
Estimation of value in policy gradient methods is a fundamental problem.
1 code implementation • 26 Jan 2023 • Xiulei Song, Yizhao Jin, Greg Slabaugh, Simon Lucas
Instead, for each sub-action we calculate the loss separately, which is less prone to clipping during updates thereby making better use of samples.