no code implementations • 5 Feb 2024 • Qingyuan Wu, Simon Sinong Zhan, YiXuan Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang
Reinforcement learning is challenging in delayed scenarios, a common real-world situation where observations and interactions occur with delays.
1 code implementation • 3 Nov 2023 • Simon Sinong Zhan, YiXuan Wang, Qingyuan Wu, Ruochen Jiao, Chao Huang, Qi Zhu
In the context of safe exploration, Reinforcement Learning (RL) has long grappled with the challenges of balancing the tradeoff between maximizing rewards and minimizing safety violations, particularly in complex environments with contact-rich or non-smooth dynamics, and when dealing with high-dimensional pixel observations.
no code implementations • 29 Sep 2022 • YiXuan Wang, Simon Sinong Zhan, Ruochen Jiao, Zhilu Wang, Wanxin Jin, Zhuoran Yang, Zhaoran Wang, Chao Huang, Qi Zhu
It is quite challenging to ensure the safety of reinforcement learning (RL) agents in an unknown and stochastic environment under hard constraints that require the system state not to reach certain specified unsafe regions.