1 code implementation • 11 Mar 2021 • Paul Ecoffet, Nicolas Fontbonne, Jean-Baptiste André, Nicolas Bredeche
This paper focuses on a class of reinforcement learning problems where significant events are rare and limited to a single positive reward per episode.