Search Results for author: Moritz A. Zanger

Found 3 papers, 1 papers with code

Value Improved Actor Critic Algorithms

no code implementations • 3 Jun 2024 • Yaniv Oren, Moritz A. Zanger, Pascal R. van der Vaart, Matthijs T. J. Spaan, Wendelin Bohmer

In this work, we propose a general extension to the AC framework that employs two separate improvement operators: one applied to the policy in the spirit of policy-based algorithms and one applied to the value in the spirit of value-based algorithms, which we dub Value-Improved AC (VI-AC).

Paper
Add Code

Diverse Projection Ensembles for Distributional Reinforcement Learning

no code implementations • 12 Jun 2023 • Moritz A. Zanger, Wendelin Böhmer, Matthijs T. J. Spaan

In contrast to classical reinforcement learning, distributional reinforcement learning algorithms aim to learn the distribution of returns rather than their expected value.

Distributional Reinforcement Learning Inductive Bias +1

Paper
Add Code

Safe Continuous Control with Constrained Model-Based Policy Optimization

1 code implementation • 14 Apr 2021 • Moritz A. Zanger, Karam Daaboul, J. Marius Zöllner

Further, we provide theoretical and empirical analyses regarding the implications of model-usage on constrained policy optimization problems and introduce a practical algorithm that accelerates policy search with model-generated data.

Continuous Control Reinforcement Learning (RL) +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.