Search Results for author: Luís Alexandre

DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety

We propose a task-agnostic method that leverages small sets of safe and unsafe demonstrations to improve the safety of RL agents during learning.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.