no code implementations • 29 Sep 2021 • Wang Chi Cheung, Zi Yi Ewe
We consider reinforcement learning with vectorial rewards, where the agent receives a vector of $K\geq 2$ different types of rewards at each time step.
Fairness reinforcement-learning +1