Search Results for author: Marnix Suilen

Found 7 papers, 2 papers with code

Imprecise Probabilities Meet Partial Observability: Game Semantics for Robust POMDPs

no code implementations • 8 May 2024 • Eline M. Bovy, Marnix Suilen, Sebastian Junges, Nils Jansen

Partially observable Markov decision processes (POMDPs) rely on the key assumption that probability distributions are precisely known.

Paper
Add Code

More for Less: Safe Policy Improvement With Stronger Performance Guarantees

1 code implementation • 13 May 2023 • Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen

In an offline reinforcement learning setting, the safe policy improvement (SPI) problem aims to improve the performance of a behavior policy according to which sample data has been generated.

Paper
Code

Decision-Making Under Uncertainty: Beyond Probabilities

no code implementations • 10 Mar 2023 • Thom Badings, Thiago D. Simão, Marnix Suilen, Nils Jansen

In this paper, the focus is on the uncertainty that goes beyond this classical interpretation, particularly by employing a clear distinction between aleatoric and epistemic uncertainty.

Decision Making Decision Making Under Uncertainty

Paper
Add Code

Safe Policy Improvement for POMDPs via Finite-State Controllers

no code implementations • 12 Jan 2023 • Thiago D. Simão, Marnix Suilen, Nils Jansen

In our novel approach to the SPI problem for POMDPs, we assume that a finite-state controller (FSC) represents the behavior policy and that finite memory is sufficient to derive optimal policies.

Reinforcement Learning (RL)