no code implementations • 12 Sep 2022 • Bang You, Jingming Xie, Youping Chen, Jan Peters, Oleg Arenz
Recent works based on state-visitation counts, curiosity and entropy-maximization generate intrinsic reward signals to motivate the agent to visit novel states for exploration.
1 code implementation • 2 Mar 2022 • Bang You, Oleg Arenz, Youping Chen, Jan Peters
Recent methods for reinforcement learning from images use auxiliary tasks to learn image features that are used by the agent's policy or Q-function.