no code implementations • 29 Aug 2023 • Zexin Li, Aritra Samanta, Yufei Li, Andrea Soltoggio, Hyoseung Kim, Cong Liu
These components collaboratively tackle the trade-offs in on-device DRL training, improving timing and algorithm performance while minimizing the risk of out-of-memory (OOM) errors.
no code implementations • 29 Jul 2023 • Shahab Nikkhoo, Zexin Li, Aritra Samanta, Yufei Li, Cong Liu
Our work introduces a new angle for manipulation in recent multi-agent RL social dilemmas that utilize a unique reward function for incentivization.