no code implementations • 13 May 2024 • Chang Huang, Junqiao Zhao, Shatong Zhu, Hongtu Zhou, Chen Ye, Tiantian Feng, Changjun Jiang
Value function factorization methods are commonly used in cooperative multi-agent reinforcement learning, with QMIX receiving significant attention.
Multi-agent Reinforcement Learning reinforcement-learning +2
no code implementations • 4 Feb 2024 • Lanqing Li, Hai Zhang, Xinyu Zhang, Shatong Zhu, Junqiao Zhao, Pheng-Ann Heng
As a marriage between offline RL and meta-RL, the advent of offline meta-reinforcement learning (OMRL) has shown great promise in enabling RL agents to multi-task and quickly adapt while acquiring knowledge safely.