Search Results for author: Hany Abdulsamad

Found 17 papers, 9 papers with code

Nesting Particle Filters for Experimental Design in Dynamical Systems

no code implementations • 12 Feb 2024 • Sahel Iqbal, Adrien Corenflos, Simo Särkkä, Hany Abdulsamad

In this paper, we propose a novel approach to Bayesian experimental design for non-exchangeable data that formulates it as risk-sensitive policy optimization.

Experimental Design

Paper
Add Code

Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing

1 code implementation • 21 Dec 2023 • Hany Abdulsamad, Sahel Iqbal, Adrien Corenflos, Simo Särkkä

Stochastic optimal control of dynamical systems is a crucial challenge in sequential decision-making.

Decision Making

Paper
Code

A Recursive Newton Method for Smoothing in Nonlinear State Space Models

no code implementations • 15 Jun 2023 • Fatemeh Yaghoobi, Hany Abdulsamad, Simo Särkkä

In this paper, we use the optimization formulation of nonlinear Kalman filtering and smoothing problems to develop second-order variants of iterated Kalman smoother (IKS) methods.

Paper
Add Code

Variational Gaussian filtering via Wasserstein gradient flows

1 code implementation • 11 Mar 2023 • Adrien Corenflos, Hany Abdulsamad

We present a novel approach to approximate Gaussian and mixture-of-Gaussians filtering.

Paper
Code

Variational Hierarchical Mixtures for Probabilistic Learning of Inverse Dynamics

no code implementations • 2 Nov 2022 • Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

We derive two efficient variational inference techniques to learn these representations and highlight the advantages of hierarchical infinite local regression models, such as dealing with non-smooth functions, mitigating catastrophic forgetting, and enabling parameter sharing and fast predictions.

regression Variational Inference

Paper
Add Code

Active Inference for Robotic Manipulation

no code implementations • 1 Jun 2022 • Tim Schneider, Boris Belousov, Hany Abdulsamad, Jan Peters

Robotic manipulation stands as a largely unsolved problem despite significant advances in robotics and machine learning in the last decades.

Paper
Add Code

Model-Based Reinforcement Learning via Stochastic Hybrid Models

no code implementations • 11 Nov 2021 • Hany Abdulsamad, Jan Peters

Optimal control of general nonlinear systems is a central challenge in automation.

Imitation Learning Model-based Reinforcement Learning +4

Paper
Add Code

Efficient Stochastic Optimal Control through Approximate Bayesian Input Inference

1 code implementation • 17 May 2021 • Joe Watson, Hany Abdulsamad, Rolf Findeisen, Jan Peters

Optimal control under uncertainty is a prevailing challenge for many reasons.

Stochastic Optimization

Paper
Code

Distributionally Robust Trajectory Optimization Under Uncertain Dynamics via Relative Entropy Trust-Regions

no code implementations • 29 Mar 2021 • Hany Abdulsamad, Tim Dorau, Boris Belousov, Jia-Jie Zhu, Jan Peters

Trajectory optimization and model predictive control are essential techniques underpinning advanced robotic applications, ranging from autonomous driving to full-body humanoid control.

Autonomous Driving Humanoid Control +1

Paper
Add Code

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

1 code implementation • 25 Feb 2021 • Pascal Klink, Hany Abdulsamad, Boris Belousov, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Across machine learning, the use of curricula has shown strong empirical potential to improve learning from data by avoiding local optima of training objectives.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

A Variational Infinite Mixture for Probabilistic Inverse Dynamics Learning

1 code implementation • 10 Nov 2020 • Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

Probabilistic regression techniques in control and robotics applications have to fulfill different criteria of data-driven adaptability, computational efficiency, scalability to high dimensions, and the capacity to deal with different modalities in the data.

Computational Efficiency

Paper
Code

Hierarchical Decomposition of Nonlinear Dynamics and Control for System Identification and Policy Distillation

no code implementations • L4DC 2020 • Hany Abdulsamad, Jan Peters

The control of nonlinear dynamical systems remains a major challenge for autonomous agents.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

A Nonparametric Off-Policy Policy Gradient

1 code implementation • 8 Jan 2020 • Samuele Tosatto, Joao Carvalho, Hany Abdulsamad, Jan Peters

Reinforcement learning (RL) algorithms still suffer from high sample complexity despite outstanding recent successes.

Density Estimation Policy Gradient Methods +1

Paper
Code

Receding Horizon Curiosity

1 code implementation • 8 Oct 2019 • Matthias Schultheis, Boris Belousov, Hany Abdulsamad, Jan Peters

Sample-efficient exploration is crucial not only for discovering rewarding experiences but also for adapting to environment changes in a task-agnostic fashion.

Efficient Exploration Experimental Design +1

Paper
Code

Stochastic Optimal Control as Approximate Input Inference

1 code implementation • Conference on Robot Learning (CoRL) 2019 2019 • Joe Watson, Hany Abdulsamad, Jan Peters

Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning.

Uncertainty Quantification

Paper
Code

Self-Paced Contextual Reinforcement Learning

1 code implementation • 7 Oct 2019 • Pascal Klink, Hany Abdulsamad, Boris Belousov, Jan Peters

Generalization and adaptation of learned skills to novel situations is a core requirement for intelligent autonomous robots.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

no code implementations • 29 Jun 2016 • Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann

In order to show the monotonic improvement of our algorithm, we additionally conduct a theoretical analysis of our policy update scheme to derive a lower bound of the change in policy return between successive iterations.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.