no code implementations • 18 Dec 2023 • Megan Kinniment, Lucas Jun Koba Sato, Haoxing Du, Brian Goodrich, Max Hasin, Lawrence Chan, Luke Harold Miles, Tao R. Lin, Hjalmar Wijk, Joel Burget, Aaron Ho, Elizabeth Barnes, Paul Christiano
We find that these language model agents can only complete the easiest tasks from this list, although they make some progress on the more challenging tasks.
1 code implementation • 11 May 2022 • Hjalmar Wijk, Benjie Wang, Marta Kwiatkowska
In many domains, worst-case guarantees on the performance (e. g., prediction accuracy) of a decision function subject to distributional shifts and uncertainty about the environment are crucial.
1 code implementation • 20 Jan 2021 • Mirco Giacobbe, Mohammadhosein Hasanbeig, Daniel Kroening, Hjalmar Wijk
We present the first exact method for analysing and ensuring the safety of DRL agents for Atari games.