no code implementations • 6 Oct 2023 • Tao Li, Juan Guevara, Xinghong Xie, Quanyan Zhu
In the multi-agent RL (MARL) setting, this distribution shift may arise from the nonstationary opponents (exogenous agents beyond control) in the online testing who display distinct behaviors from those recorded in the offline dataset.