Search Results for author: Luís Alexandre

Found 1 papers, 0 papers with code

DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety

no code implementations8 May 2023 André Correia, Luís Alexandre

We propose a task-agnostic method that leverages small sets of safe and unsafe demonstrations to improve the safety of RL agents during learning.

Cannot find the paper you are looking for? You can Submit a new open access paper.