no code implementations • NeurIPS 2021 • João P. Abrantes, Arnaldo J. Abrantes, Frans A. Oliehoek
This work proposes Evolution via Evolutionary Reward (EvER) that allows learning to single-handedly drive the search for policies with increasingly evolutionary fitness by ensuring the alignment of the reward function with the fitness function.