no code implementations • 27 Dec 2023 • Chendi Qu, Jianping He, Xiaoming Duan, Jiming Chen
A simplistic model is less likely to contain the real reward function, while a model with high complexity leads to substantial computation cost and risks overfitting.
no code implementations • 27 Dec 2023 • Chendi Qu, Jianping He, Xiaoming Duan
Designing controllers to generate various trajectories has been studied for years, while recently, recovering an optimal controller from trajectories receives increasing attention.
no code implementations • 19 Jun 2022 • Chendi Qu, Jianping He, Jialun Li
This paper aims at the trade-off between the control performance and state unpredictability of mobile agents in long time horizon.