1 code implementation • 29 Aug 2017 • Mohammed Alshiekh, Roderick Bloem, Ruediger Ehlers, Bettina Könighofer, Scott Niekum, Ufuk Topcu
In the first one, the shield acts each time the learning agent is about to make a decision and provides a list of safe actions.